Skip to content

intro-to-data-science-23-workshop/01-data-wrangling-dreyer-ramirez-kakade

Repository files navigation

Cleaning and wrangling data with janitor and forcats

Summary

This repository provides materials for our session on how to clean and wrangle data using janitor and forcatsas part of the I2DS Tools for Data Science workshop run at the Hertie School, Berlin in October 2023. The student-run workshop is part of the course Introduction to Data Science taught by Simon Munzert at the Hertie School, Berlin, in Fall 2023.

Session contents

This session will introduce you to the intricacies of factor management with R using the "forcats" package, as well as data cleaning and tidying with the "janitor" package. Both packages are essential for efficient data manipulation and ensuring clean and consistent datasets. Have a look at our presentation to learn more!

Main learning objectives

The goals of this session are to:

  1. Equip you with conceptual knowledge about the "forcats" and "janitor" packages.
  2. Demonstrate various functions and utilities provided by both packages.
  3. Provide you with practice material on how to efficiently wrangle and clean data with both packages.

Instructors

  • Elena Dreyer
  • Luis Fernando Ramirez Ruiz
  • Shruti Kakade

Resources

janitor

For original material on the janitor package

forcats

Further resources

Quiz and practice materials

License

The material in this repository is made available under the MIT license.

Statement of contributions

Elena Dreyer prepared the presentation slides for forcats, recorded the session on forcats and created the Mentimeter quiz.

Luis Fernando Ramirez Ruiz prepared the presentation slides for janitor, recorded the session on janitor and created the practice material.

About

Data Wrangling using 'janitor' and 'forcats'

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published