Introducing portfoliodown: The Data Science Portfolio Website Builder Written by Matt Dancho on December 20, 2021 I'm super excited to introduce a new R package that makes it painless for data scientists to create a professional. Read More...
Tidy Time Series Forecasting in R with Spark Written by Matt Dancho on October 20, 2021 I'm super excited to introduce the new Modeltime Backend for Spark. Let's use it to perform forecasting with tidymodels. Read More...
How to Make a Heatmap in R Written by Matt Dancho on October 12, 2021 The ggplot2 package is an essential tool in every data scientists toolkit. Today we show you how to use ggplot2 to make a professional heatmap that organizes customers by their sales purchasing habits. Read More...
3 Ways to Read Multiple CSV Files: For-Loop, Map, List Comprehension Written by Matt Dancho on September 21, 2021 Reading many CSV files is a common task for a data scientist. In this free tutorial, we show you 3 ways to streamline reading CSV files in Python. Read More...
Tidy Parallel Processing in R with furrr Written by Matt Dancho on September 14, 2021 furrr is a critical package to speed up iterative calculations using tidyverse purrr syntax. Read More...
Introducing Iterative (Nested) Forecasting with Modeltime Written by Matt Dancho on August 26, 2021 I'm super excited to introduce the experimental feature for performing iterative forecasting. Read More...
ggalt: Make a Lollipop Plot to Compare Categories in ggplot2 Written by Matt Dancho on August 24, 2021 ggalt is a ggplot2 extension that adds many new ggplot geometries. In this tutorial, we'll learn how to make lollipop plots for comparing categories within our data using geom_lollipop(). Read More...
ggalt: Make a Dumbbell Plot to Visualize Change in ggplot2 Written by Matt Dancho on August 12, 2021 ggalt is a ggplot2 extension that adds many new ggplot geometries. In this tutorial, we'll learn how to make dumbbell plots for visualizing change within our data using geom_dumbbell(). Read More...
SweetViz: Automated Exploratory Data Analysis (EDA) in Python Written by Matt Dancho on August 3, 2021 SweetViz is a Python library that makes exploratory data analysis (EDA) fast and effective. Learn how to investigate feature relationships using correlation and associations in the automated SweetViz report. Read More...
ggforce: Make a Hull Plot to Visualize Clusters in ggplot2 Written by Matt Dancho on July 27, 2021 ggforce is a ggplot2 extension that adds many exploratory data analysis features. In this tutorial, we'll learn how to make hull plots for visualizing clusters or groups within our data. Read More...
ggdist: Make a Raincloud Plot to Visualize Distribution in ggplot2 Written by Matt Dancho on July 22, 2021 The ggdist package is a ggplot2 extension that is made for visualizing distributions and uncertainty. We'll show see how ggdist can be used to make a raincloud plot. Read More...
Forecasting Many Time Series (Using NO For-Loops) Written by Matt Dancho on July 19, 2021 I'm super excited to introduce the new panel data forecasting functionality in modeltime. It's perfect for making many forecasts at once without for-loops. Read More...