Data Science With R Course Series - Week 8

Written by David Curry

This is a fun part of the course where you learn how to add value to the business by providing ROI-Driven Data Science!

Week 8 will teach you how to calculate a simple policy change - Implementing a No Overtime Policy. You will calculate the expected savings compared to the current policy of allowing overtime.

Next, develop a more sophisticated policy change - Implementing a Targeted Overtime Policy - wherein you target high-flight risk employees.

We’ll teach you everything you need to know about the Expected Value Framework so you can begin to implement this for ANY BINARY CLASSIFICATION MODEL. This includes Customer Churn, Targeted Advertisements, and more!

Here is a recap of our trajectory and the course overview:

Recap: Data Science With R Course Series

You’re in the Week 8: Link Data Science To Business With Expected Value. Here’s our game-plan over the 10 articles in this series. We’ll cover how to apply data science for business with R following our systematic process.

Week 1: Getting Started
Week 2: Business Understanding
Week 3: Data Understanding
Week 4: Data Preparation
Week 5: Predictive Modeling With H2O
Week 6: H2O Model Performance
Week 7: Machine Learning Interpretability With LIME
Week 8: Link Data Science To Business With Expected Value (You’re here)
Week 9: Expected Value Optimization And Sensitivity Analysis
Week 10: Build A Recommendation Algorithm To Improve Decision Making

Week 8: Link Data Science To Business With Expected Value

Student Feedback

Week 8: Link Data Science To Business With Expected Value

Overview & Setup

This overview will teach you how to quantify the business return on investment (ROI) using the Expected Value Framework.

The Expected Value Framework is way to apply an expected value to a classification model - it connects a machine learning classification model to ROI for the business.

Learn how to combine:

The threshold,
Knowledge of costs and benefits, and
The confusion matrix converted to expected error rates to account for the presence of false positives and false negatives.

We can use this combination to calculate the business savings for implementing policy as a results of your data science work.

Calculating Expected ROI: No Overtime Policy

Over the past few weeks you have created a machine learning model to predict which employees are likely to leave. Through your analysis, you determined the number one cause of employee turnover is working overtime hours.

In this module, you will create a baseline calculation to determine how much the business will save if they completely remove overtime for all employees.

Targeting By Threshold Primer

Targeting by threshold will allow you to target employees above a certain level of turnover risk to pinpoint those who are most likely to leave.

Use the calculation to determine the expected value to find the optimal threshold that will maximize business savings for implementing the policy.

Calculating Expected ROI: Targeted Overtime Policy

Using the threshold from the previous module, learn how to apply the expected value to employees above a certain probability to leave.

You will also compare the targeted overtime policy to the previous no overtime policy to calculate the cost difference between each policy.

This will enable you to clearly communicate the savings for implementing overtime policy.

You Need To Learn R For Business

To be efficient as a data scientist, you need to learn R. Take the course that has cut data science projects in half (see this testimonial from a leading data science consultant) and has progressed data scientists more than anything they have tried before. Over 10-weeks you learn what it has taken data scientists 10-years to learn:

Our systematic data science for business framework
R and H2O for Machine Learning
How to produce Return-On-Investment from data science
And much more.

Start Learning Today!

Next Up

The next article in the Data Science With R Series covers Expected Value Optimization And Sensitivity Analysis.

This is a really fun series of chapters that teach you the skills to align your data science work with business ROI.

This week’s targeted analysis leads into Week 9, where you will perform two advanced analyses that are critical in the course:

Threshold Optimization - A method used to maximize expected saving via iteratively calculating savings at various thresholds
Sensitivity Analysis - A method used to investigate how sensitive the expected savings is to various parameter values that were created based on assumptions

Week 9: Expected Value Optimization And Sensitivity Analysis

New Course Coming Soon: Build A Shiny Web App!

You’re experiencing the magic of creating a high performance employee turnover risk prediction algorithm in DS4B 201-R. Why not put it to good use in an Interactive Web Dashboard?

In our new course, Build A Shiny Web App (DS4B 301-R), you’ll learn how to integrate the H2O model, LIME results, and recommendation algorithm building in the 201 course into an ML-Powered R + Shiny Web App!

Shiny Apps Course Coming in October 2018!!! Sign up for Business Science University Now!