5-Day Free Course · Statistics · R

R Programming for Data and Statistics

The tidyverse, ggplot2, statistical modeling, tidymodels for machine learning, and R Markdown for reproducible reporting. The R skills data scientists and researchers actually use — not base R syntax lectures.

Start Day 1 → See Syllabus

5 days self-paced

Free forever

Text + external video refs

No signup required

Days

30+

Code Examples

External Videos

Forever Free

How This Course Works

No videos. On purpose.

This is a text-first course that links out to the best supporting material on the internet instead of trying to replace it. The goal is to make this the best course on r programming you can find — even without producing a single minute of custom video.

Practitioner-tested, not vendor marketing

This course is built by engineers who ship r programming systems for a living. It reflects how these tools actually behave in production — not how the documentation describes them.

Code you can run, not demos to watch

Every day includes working code examples you can copy, run, and modify right now. The goal is understanding through doing, not passive reading.

Links to the canonical sources

Instead of re-explaining existing documentation, this course links to the definitive open-source implementations and the best reference material on r programming available.

Completes in 5 one-hour sessions

Each day is designed to finish in about an hour of focused reading plus hands-on work. Do the whole course over a week of lunch breaks. No calendar commitment, no live classes.

Syllabus

The 5 Days

Each day stands alone. Read them in order for the full picture, or jump straight to the day that answers the question you have today.

01Day One

R and the Tidyverse

RStudio setup, vectors and data frames, dplyr for data manipulation (filter, select, mutate, summarize, group_by), piping with |>, and why the tidyverse dialect makes R more readable.

tidyversedplyrpipesdata frames

→

02Day Two

Data Visualization with ggplot2

The grammar of graphics, layers, aesthetics, geoms, scales, and facets. Building publication-quality charts from scratch — histograms, scatter plots, line charts, and faceted grids.

ggplot2grammar of graphicsgeomsfacets

→

03Day Three

Statistical Analysis in R

Descriptive statistics, hypothesis testing (t-test, ANOVA, chi-square), correlation, linear regression with lm(), model summaries, and the statistical output R produces vs what it actually means.

hypothesis testinglinear regressionlm()p-values

→

04Day Four

Machine Learning with tidymodels

The tidymodels meta-framework — recipes for preprocessing, parsnip for model interfaces, rsample for cross-validation, yardstick for metrics, and running logistic regression, random forest, and xgboost through the same interface.

tidymodelsparsnipcross-validationrandom forest

→

05Day Five

R Markdown and Reproducible Research

R Markdown documents, code chunk options, knitting to HTML and PDF, parameterized reports, version control for R projects with renv, and connecting R outputs to Quarto for modern publishing.

R MarkdownQuartorenvreproducibility

→

Supporting Videos

The best external videos on this topic.

Instead of shooting our own videos, we link to the best deep-dives already on YouTube. Watch them alongside the course. All external, all free, all from builders who ship this stuff.

YouTube · Search

R and the Tidyverse

Complete tutorials on dplyr, tidyr, and the pipe — the tidyverse dialect that makes R genuinely readable for data manipulation.

YouTube · Search

ggplot2 Data Visualization

The grammar of graphics applied with ggplot2 — from basic scatter plots to publication-quality faceted charts.

YouTube · Search

tidymodels Machine Learning

The tidymodels ecosystem for consistent ML workflows in R — preprocessing, model fitting, cross-validation, and evaluation in a unified interface.

YouTube · Search

Statistical Analysis in R

Hypothesis testing, regression, and statistical inference in R — with clear explanations of when to use each test.

YouTube · Search

R Markdown and Quarto

Reproducible reporting in R — from basic R Markdown documents to parameterized reports and Quarto publishing.

Open-Source Implementations

Read the source.

The best way to deepen understanding is to read the canonical open-source implementations. Clone them, trace the code, understand how the concepts in this course get applied in production.

github.com/tidyverse

tidyverse

Meta-package and organization for the tidyverse. The individual package repos (dplyr, ggplot2, tidyr) are the best source code references for understanding R data manipulation.

github.com/tidymodels

tidymodels

The tidymodels meta-package. The /vignettes directory has the canonical examples for every component of the ML workflow.

github.com/tidyverse

ggplot2

The ggplot2 source. Understanding the layer and aes() internals explains why the grammar-of-graphics approach is so flexible.

github.com/quarto-dev

quarto-cli

Quarto is the modern successor to R Markdown — multi-language, multi-format scientific publishing. The examples directory covers every output format.

Who This Is For

Three kinds of people read this.

Researchers and Academic Data Scientists

R remains the dominant language for statistical research. This course covers the modern R workflow that academic and industry researchers actually use.

Data Analysts Moving Beyond Excel

ggplot2 and dplyr make R the most powerful spreadsheet you've ever used. This course is the fastest path from Excel to publication-quality R analysis.

Python Data Scientists Adding R

R's statistical libraries and ggplot2 still lead Python for certain research tasks. This course gives Python practitioners enough R to read and run existing R code.

Want to Go Deeper In Person?

The 2-day in-person Precision AI Academy bootcamp covers data science and statistical programming in depth — hands-on, with practitioners who build AI systems for a living. 5 U.S. cities. $1,490. 40 seats max. June–October 2026 (Thu–Fri).

Reserve Your Seat