How to Make a Heatmap in R Written by Matt Dancho on October 12, 2021 The ggplot2 package is an essential tool in every data scientists toolkit. Today we show you how to use ggplot2 to make a professional heatmap that organizes customers by their sales purchasing habits. Read More...
3 Ways to Read Multiple CSV Files: For-Loop, Map, List Comprehension Written by Matt Dancho on September 21, 2021 Reading many CSV files is a common task for a data scientist. In this free tutorial, we show you 3 ways to streamline reading CSV files in Python. Read More...
Tidy Parallel Processing in R with furrr Written by Matt Dancho on September 14, 2021 furrr is a critical package to speed up iterative calculations using tidyverse purrr syntax. Read More...
Introducing Iterative (Nested) Forecasting with Modeltime Written by Matt Dancho on August 26, 2021 I'm super excited to introduce the experimental feature for performing iterative forecasting. Read More...
ggalt: Make a Lollipop Plot to Compare Categories in ggplot2 Written by Matt Dancho on August 24, 2021 ggalt is a ggplot2 extension that adds many new ggplot geometries. In this tutorial, we'll learn how to make lollipop plots for comparing categories within our data using geom_lollipop(). Read More...
ggalt: Make a Dumbbell Plot to Visualize Change in ggplot2 Written by Matt Dancho on August 12, 2021 ggalt is a ggplot2 extension that adds many new ggplot geometries. In this tutorial, we'll learn how to make dumbbell plots for visualizing change within our data using geom_dumbbell(). Read More...
SweetViz: Automated Exploratory Data Analysis (EDA) in Python Written by Matt Dancho on August 3, 2021 SweetViz is a Python library that makes exploratory data analysis (EDA) fast and effective. Learn how to investigate feature relationships using correlation and associations in the automated SweetViz report. Read More...
ggforce: Make a Hull Plot to Visualize Clusters in ggplot2 Written by Matt Dancho on July 27, 2021 ggforce is a ggplot2 extension that adds many exploratory data analysis features. In this tutorial, we'll learn how to make hull plots for visualizing clusters or groups within our data. Read More...
ggdist: Make a Raincloud Plot to Visualize Distribution in ggplot2 Written by Matt Dancho on July 22, 2021 The ggdist package is a ggplot2 extension that is made for visualizing distributions and uncertainty. We'll show see how ggdist can be used to make a raincloud plot. Read More...
Forecasting Many Time Series (Using NO For-Loops) Written by Matt Dancho on July 19, 2021 I'm super excited to introduce the new panel data forecasting functionality in modeltime. It's perfect for making many forecasts at once without for-loops. Read More...
easystats: Quickly investigate model performance Written by Matt Dancho on July 13, 2021 The easystats performance R package makes it easy to investigate the relevant assumptions for regression models. Simply use the check_model() function to produce a visualization that combines 6 tests for model performance. Read More...
R is for Research, Python is for Production Written by Matt Dancho and Jarrell Chalmers on July 12, 2021 Both R and Python are great. We’ll showcase some of the strengths of each language in this article by showcasing where the major development efforts are within each ecosystem. Read More...