Skip to content

This repository provides materials for the session on Regular Expressions with Stringr that is part of the I2DS Tools for Data Science workshop run at the Hertie School, Berlin in November 2021.

intro-to-data-science-21-workshop/05-AngelaDuarte-RegExp

Repository files navigation

Regular Expressions (RegExp)

Summary

This repository provides materials for the session on Regular Expressions with Stringr that is part of the I2DS Tools for Data Science workshop run at the Hertie School, Berlin in November 2021.

The student-run workshop is part of the course Introduction to Data Science taught by Simon Munzert in Fall 2021.

Session contents

This session will introduce you to the basic Regular Expressions and how to use them with some basic Stringr functions. Regular expressions are conventions used to describe and find patterns in strings (text data) that work across programming languages.

To learn how to use them, this session will include:

  • an overview of what these expressions allow us to do,
  • an explanation of the key conventions,
  • examples on how to combine them to solve problems,
  • a guide to solve your own exercises.

Main learning objectives

This session on Regexps has 3 main objectives:

  1. Equip participants with an understanding of what Regexps are and what they are useful for.

  2. Identify and use key regular expressions and what patterns do they account for.

  3. Provide participants practice material as well as some further readings on Regexps.

Instructors

Further resources

  1. R for Data Science book -part on StringR and Regular Expressions
  2. Automated Data Collection with R. A Practical Guide to Web Scraping and Text Mining. Chapter8
  3. Interactive Tutorial on Regular Expressions
  4. Hands-on Tutorial on Regular Expressions on R
  5. R Regex Tester

Statement of contributions

Ana Deniz prepared the main presentation material, the slides and the video.

Ángela Duarte Pardo prepared the main presentation material, the tutorial exercises, and the video.

A note on the documents in the repo.

In this repo, there are several documents related to the session on Regexps with Stringr. Specially interesting are:

  1. Regexps (.rmd, .html) has all the materials (explanation and solved exercises).

  2. RegexpExercises (.rmd) contains the instructions to the exercises for the audience to try out (no solutions!).

  3. Regular Expressions with Stringr (.pdf) contains the presentation.

License

The material in this repository is made available under the MIT license.

About

This repository provides materials for the session on Regular Expressions with Stringr that is part of the I2DS Tools for Data Science workshop run at the Hertie School, Berlin in November 2021.

Topics

Resources

Stars

Watchers

Forks

Languages