Skip to content

Repository for course description of JHU Data Science in R Specialization

License

Notifications You must be signed in to change notification settings

anshabhi/JHU-R-Data-Science

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Forks Stargazers Issues MIT License LinkedIn


Logo

Data Science Specialization

This repository talks about my learnings through coursework and projects that I completed in JHU data science with R specialization.

Table of contents

About The Specialization

Certificate

This Data Science with R specialization is an introductory level 10-course specialization offered by renowned professors from the Johns Hopkins University. This specialization teaches one the fundamental concepts and techniques to provide the perfect platform for getting started with Data Science.

Built With

I learned and made use of the following R packages in this specialization:

List of Courses and Things Learnt

  • Overview of data, questions, and tools that data scientists work with.
  • Introduction to version control, markdown, git, GitHub, R, and RStudio.

Course 2. R Programming

  • Installation and configuration of R
  • Programming in R, reading data into R, accessing R packages, writing R functions, debugging, profiling R code, and organizing and commenting R code
  • Obtaining data from the web, from APIs, from databases in various formats
  • Basics of data cleaning and data sharing
  • Data components such as raw data, processing instructions, codebooks, and processed data.
  • Essential exploratory techniques for summarizing data
  • Plotting systems in R, basic principles of constructing data graphics
  • Multivariate statistical techniques used to visualize high-dimensional data
  • Reporting data analyses in a reproducible manner
  • knitr R library
  • Drawing conclusions about scientific truths from data.
  • Bayesian and likelihood theories
  • Resolving missing data, observed and unobserved confounding, biases
  • Regression analysis, least squares and inference using regression models
  • ANOVA and ANCOVA, analysis of residuals and variability
  • Model selection methods
  • Scatterplot Smoothing
  • Basic components of building and applying prediction functions for real world applications.
  • Training and tests sets, overfitting, error rates, feature creation and evaluation.
  • Regression, classification trees, Naive Bayes, and random forests.
  • Drawing conclusions about scientific truths from data.
  • Bayesian and likelihood theories
  • Resolving missing data, observed and unobserved confounding, biases

Capstone Project

Capstone Presentation

Live Deployment

Certificate

The Capstone Project was building a next word prediction app using a commerical dataset provided by SwiftKey.

The app is available with live demo at the above URL. This is a R Shiny app.

The Capstone Presentation contains more details about the how the model works. This presentation was made using R Markdown.

Further Recommended Readings

About

Repository for course description of JHU Data Science in R Specialization

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published