| output |
|---|
github_document |
Data Analysis and Prediction Algorithms with R, Third Edition
By Daniel Paredes Inilupu
https://dparedesi.github.io/Data-Science-with-R-book/
| Part | Topics |
|---|---|
| I. Fundamentals | R objects, functions, data frames, tidyverse |
| II. Visualization | ggplot2, gapminder case study |
| III. Statistics | Probability, distributions, inference |
| IV. Data Wrangling | Importing data, text mining |
| V. Machine Learning | tidymodels, classification, regression, clustering |
| VI. Applied Cases | Real estate analysis, Google Analytics |
| VII. Generative AI | LLM APIs, AI-assisted coding |
| Appendix | Ethics checklist |
- Modern Tidyverse: Uses the native pipe (
|>) and moderndplyrverbs - Tidymodels: Full ML chapters using the
tidymodelsframework - Real-World Cases: Includes new case studies with real datasets
- GenAI Integration: Covers using LLMs and APIs within R workflow
- Ethics: Dedicated sections on algorithmic bias and ethics checklist
- R: 4.5.2+
- Framework: bookdown
- ML: tidymodels
- Visualization: ggplot2
# Install dependencies
source("setup_dependencies.R")
# Build the book
bookdown::render_book("index.Rmd")The output will be generated in the docs/ folder.
All datasets used in this book are hosted directly in this repository and served via GitHub Pages. you can browse them in the docs/data/ directory.
To access them in R, you can use the URL pattern:
https://dparedesi.github.io/Data-Science-with-R-book/data/[filename]
Example:
library(readr)
url <- "https://dparedesi.github.io/Data-Science-with-R-book/data/student-grades.csv"
grades <- read_csv(url)├── 01.intro/ Introduction
├── 02.fundamentals/ Objects, Functions, Data Frames
├── 03.visualization/ ggplot2, Gapminder Case Study
├── 04.statistics/ Probability and Inference
├── 05.wrangling/ Data Import, Text Mining
├── 06.machine-learning/ Supervised & Unsupervised Learning
├── 07.real-cases/ Applied Case Studies
├── 08.genai/ Generative AI with R
├── 09.appendix/ Ethics Checklist
If you use this book in your work, please cite:
@book{paredes2024datascience,
author = {Paredes Inilupu, Daniel},
title = {Data Science with R: Data Analysis and Prediction Algorithms},
year = {2025},
edition = {3rd},
publisher = {Leanpub},
url = {https://leanpub.com/data-science-with-r}
}You can support this effort by purchasing the PDF version on Leanpub. The purchase includes access to future updates.
See CONTRIBUTING.md for guidelines on how to report errors, suggest improvements, or submit fixes.
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Questions or suggestions? Email: dparedesi@uni.pe
