5.0 / 5
We will cover the following tasks in 36 minutes:
Cloudy Nights is a forum I visit for Astronomy. There are conversations boards on various topics but mostly I use the Classified Section, which has telescope equipment for sale. In this chapter I will show you how I drew some interesting insights from the Classifieds. After that I will give you a quick bio. And then we’ll get going.
IMDB Simple Scrape
In this lesson we will scrape a single rating on the IMDB website. Though simple, this lesson will set the ground work for future lessons and give you a quick understanding of the code. You will want to stay for this one because it really sets the foundation for the future chapters.
It is football season after all - and to some that’s the only season, though I’m a baseball fan. I digress. Let’s scrape historical data on Superbowl winners in Wikipedia in this lesson. After we do this you will know how to scrape all wiki pages. There is a trick to it. I’ll show you. Hint: We don’t use CSS Selectors.
American Ninja Warrior Parsing URLs
This is a fun show. American Ninja Warrior. Stephy Graph. She’s awesome. Anyhow.
In this section and the remaining chapters we’re going to parse the data for a few seasons of the TV show.
Our scraping is getting more and more complex. By the way, web scraping doesn’t have a one size fits all method. But I’m exposing you to different methods so this is good and it will serve you well going forward. It’s like adding tools to a tool belt.
American Ninja Warrior (CSS Selectors)
In this lesson we’re going to pull the Selectors for the data we want to extract. It’s a must to know how to do this using the CSS Selector Gadget by Chrome. I will go over the gadget, how to find the css code and where to put it in the code.
American Ninja Warrior (Rvest)
In this lesson we’re going to set up our code using Rvest. Once you learn how to set this code chunk up your scraping days will be much easier in the future. I’ll also give you a little tidbit on Messier a famous astonomer as we’re typing it out.
Amrican Ninja Warrior (Function)
This is advanced as I wrap our rvest code into a function. If you don’t know about functions it’s ok. Just follow me a long and take a course on functions later. If you do now about functions better yet. This is a time saver. Especially for scripts that you plan on running more than once. Automation baby!
Final Thoughts on the Session and your future scraping projects. Thank you for taking my course. If you liked this course please email me or better yet take another one. You did good on this course. It’s a nice skill set. Go out there and pull down some data.
About the Host (Chris Shockley)
I am a R enthusiast, hiker, and amateur astronomer. My favorite hike is located in Mt. Rainier National Park, my favorite Deep Sky Object is Alberio, and my favorite R package is dplyr (since I use it everyday). I am single too. Maybe because I spend too much time playing? I have a dog named Coog (Lllasa Apso), who would rather be outside than inside, which means I have to take him on a lot of walks. I work as a Data Analyst/Financial Analyst for a Metals Co. located in Seattle, WA. I have been in my current position for 4 years. My hope is that I can help you, even if its with my enthusiasm. Yes you can learn R and the Rhyme Interface will help you. But. You also must take what you learn and practice, practice, practice. So... Let's get after it. See you soon.