Machine Learning in R: Random Forests Using FFTrees

In this project, we are going to use an R package that was developed by Nathaniel Philips called Fast and Furious Trees or FFTrees. This package is an implementation of the Random Forest algorithm. Where it lacks in accuracy (slightly behind the RF), it makes up for speed and understandable data - something that you will see shortly. If you need to make a quick decision, such as - whether or not a mushroom is poisonous, this could be your go-to algorithm to figure that out. In this project, I will show you how to create test and training sets, how to train the model and some of my favorite attributes of the package. This is a great introduction to Random Forest models and machine learning.

Available Through Coursera
Machine Learning in R: Random Forests Using FFTrees

Duration (mins)

Learners

5.0 / 5

Rating

Task List


We will cover the following tasks in 22 minutes:


Introduction

In this task, I am going to outline what the goals of the project are. I will go over some basics with you including a little bit about the Rhyme interface. We will run a few code chunks. At the end I will share a little bit about me.


How many are Poisonous

Whenever you are working with a new dataset, it is always wise to do some exploratory analysis. Often, I find that I want to jump right into the algorithm - perhaps you do too? But that’s not the best way to approach solving the problems and instead, should always start with exploratory data analysis. We will do some exploratory analysis in this task in order to better understand the data. Of course, we could spend a whole project on exploratory analysis (perhaps in another project). However, for the current project, we will only do a quick analysis.


Training and Test Split

It is important to break your datasets into two sets - one for training the model and the another for testing the trained model. And to do this, you often need to take random samples for both sets. Once the model is trained, we can use the test set (data the model hasn’t been trained on) and see how well are model performs. I will show you one technique that you can use to do this important step.


Training the Model

In this task, we are going to train the model. There are hundreds of different machine learning algorithms, but most of them follow the same overall structure. Pay attention to the function calls and especially the tilde since this approach will be used in many other machine learning models and algorithms.


Confusion Matrix

In this task, I am going to show you some of the cool features of the FFTrees package. I will show you a confusion matrix. A confusion matrix can be used to quantify the performance of the model. I will also show you how to extract the most important attributes which can be used in a real world scenario when you need a couple data points to classify a problem.


Predict

In this task, I am going to use the trained model on the test set to see how well it performs. After that I will close the project with a few comments about the project an our learnings from it.

Watch Preview

Preview the instructions that you will follow along in a hands-on session in your browser.

Chris Shockley

About the Host (Chris Shockley)


I am an R enthusiast, hiker, and amateur astronomer. My favorite hike is located in Mt. Rainier National Park, my favorite Deep Sky Object is Alberio, and my favorite R package is dplyr (since I use it everyday). I have a dog named Coog (Lllasa Apso)., I work as a Data Analyst/Financial Analyst for a Metals Co. located in Seattle, WA. I have been in my current position for 5 years. I work in SQL, R, R Shiny, QGIS. Because I have traveled the roads you are on I believe I will be an asset and will add value to your programming repertoire. We will walk through multiple examples and get to know each other through the process. Don't take my word for it though. Come on in and take a Project or two. Regards, Chris Shockley



Frequently Asked Questions


In Rhyme, all projects are completely hands-on. You don't just passively watch someone else. You use the software directly while following the host's (Chris Shockley) instructions. Using the software is the only way to achieve mastery. With the "Live Guide" option, you can ask for help and get immediate response.
Nothing! Just join through your web browser. Your host (Chris Shockley) has already installed all required software and configured all data.
You can go to https://rhyme.com, sign up for free, and follow this visual guide How to use Rhyme to create your own projects. If you have custom needs or company-specific environment, please email us at help@rhyme.com
Absolutely. We offer Rhyme for workgroups as well larger departments and companies. Universities, academies, and bootcamps can also buy Rhyme for their settings. You can select projects and trainings that are mission critical for you and, as well, author your own that reflect your own needs and tech environments. Please email us at help@rhyme.com
Rhyme strives to ensure that visual instructions are helpful for reading impairments. The Rhyme interface has features like resolution and zoom that will be helpful for visual impairments. And, we are currently developing a close-caption functionality to help with hearing impairments. Most of the accessibility options of the cloud desktop's operating system or the specific application can also be used in Rhyme. If you have questions related to accessibility, please email us at accessibility@rhyme.com
We started with windows and linux cloud desktops because they have the most flexibility in teaching any software (desktop or web). However, web applications like Salesforce can run directly through a virtual browser. And, others like Jupyter and RStudio can run on containers and be accessed by virtual browsers. We are currently working on such features where such web applications won't need to run through cloud desktops. But, the rest of the Rhyme learning, authoring, and monitoring interfaces will remain the same.
Please email us at help@rhyme.com and we'll respond to you within one business day.

No sessions available