Skip to content

Make-School-Courses/DS-1.1-Data-Analysis

Repository files navigation

Make School Logo

DS 1.1 Data Analysis & Visualization

Course Description

In this course, students learn the foundational skills of data science, including data collection, scrubbing, analysis, and visualization with modern tools and libraries. Students gain a strong grounding in statistical concepts, utilize statistical techniques and master the science and art of data exploration and visualization to tell stories and persuade decision makers with data-driven insights.

Why should you take this class?

The skill set gained in this course will jump start your career as a Data Scientist, Data Engineer or Data Analyst, if that is your goal. If you are a Software Engineer, these skills will enable you to effectively interface with a Data Team.

Prerequisites

CS 1.2
QL-1.1

Course Specifics

Course Delivery: Synchronous | 7 weeks | 19 sessions

Course Credits: 3 units | 37.5 Contact Hours/Term | 92 Non-Contact Hours/Term | 129.5 Total Hours/Term

Learning Outcomes

By the end of this course, students will be able to...

  1. Perform exploratory data analysis and visualization of a data set using histograms, correlations and other mathematical tools
  2. Understand and use descriptive statistics, probability distributions and confidence intervals
  3. Carry out hypothesis tests, and understand when to reject or accept a null hypothesis
  4. Describe and implement a plan for cleaning and standardizing a data set
  5. Implement a machine learning pipeline for a simple model

Schedule

Course Dates: Monday, May 31 – Friday, July 16, 2021 (7 weeks)

Class Times: Monday, Wednesday, Friday at 9:30am–11:15am (19 class sessions)

Class Date Topics Assignments and Quizzes
- Mon, May 31 No Class - Memorial Day
1 Wed, June 2 Course Overview
Introduction to Numpy
2 Fri, June 4 Data Manipulation with Pandas, Part 1 Pandas and Numpy Challenge assigned
3 Mon, June 7 Data Manipulation with Pandas, Part 2
4 Wed, June 9 Data Visualization, Part 1
5 Fri, June 11 Data Visualization, Part 2 Pandas and Numpy Challenge due
Data Visualization Challenge assigned
6 Mon, June 14 Probability and Statistics, Part 1
7 Wed, June 16 Probability and Statistics, Part 2
8 Fri, June 18 Probability Distributions, Part 1 Data Visualization Challenge due
Probability and Statistics Challenge assigned
9 Mon, June 21 Probability Distributions, Part 2
10 Wed, June 23 Counting Problems
11 Fri, June 25 Outliers, Boxplots, and Correlation, Part 1 Probability and Statistics Challenge due
Probability Distributions and Outliers Challenge assigned
12 Mon, June 28 Outliers, Boxplots, and Correlation, Part 2
13 Wed, June 30 Conditional Probability
14 Fri, July 2 Bayes' Rule Probability Distributions and Outliers Challenge due
Hypothesis Testing Challenge assigned
- Mon, July 5 No Class - Independence Day Observed
15 Wed, July 7 Hypothesis Testing, Part 1
16 Fri, July 9 Hypothesis Testing, Part 2 Hypothesis Testing Challenge due
Machine Learning Challenge assigned
17 Mon, July 12 Intro to Machine Learning, Part 1
18 Wed, July 14 Intro to Machine Learning, Part 2
19 Fri, July 16 Lab Day Machine Learning Challenge due

Class Assignments

We will be using Gradescope, which allows us to provide fast and accurate feedback on your work. All assigned work will be submitted through Gradescope, and assignment and exam grades will be returned through Gradescope.

As soon as grades are posted, you will be notified immediately so that you can log in and see your feedback. You may also submit regrade requests if you feel we have made a mistake.

Your Gradescope login is your Make School email, and your password can be changed at https://gradescope.com/reset_password. The same link can be used if you need to set your password for the first time.

Projects

  • Pandas and Numpy
  • Data Visualization
  • Probability and Statistics
  • Probability Distributions and Outliers
  • Hypothesis Testing
  • Machine Learning project: You will choose a dataset to clean, investigate, and make predictions on
  • All projects will be submitted on Gradescope

Evaluation

To pass this course you must meet the following requirements:

  • Actively participate in class and abide by the Attendance Policy
  • Complete and pass all Assignments and Projects with a score of above 70%
  • Make up classwork from all absences

Information Resources

Data Science Technical Interview Topics Covered in this course

  • Data Exploration
  • Data Visualization
  • Data Cleaning
  • Outlier identification
  • Data Preprocessing
  • Applied Probability and statistics
  • Probability distributions
  • Hypothesis Testing
  • Confidence intervals
  • Machine Learning Model
  • Machine Learning Pipeline

Students are expected to practice academic integrity in all of its forms, including abstaining from plagiarism, cheating, and other forms of academic misconduct. Make School reserves the right to determine in any given instance what action constitutes a violation of academic honesty and integrity. Plagiarism, defined as the practice of presenting another's work or ideas as one’s own, is an act of academic dishonesty and is a serious ethical and scholarly violation.

Copying text or ideas, whether verbatim or by paraphrasing from a source without using proper citation, is not accepted at Make School. Any materials incorporated into your work, regardless of format, must be properly acknowledged using a citation style appropriate for the discipline of the course.

Though plagiarism may be the most common form, other violations of scholarly integrity also constitute cheating, including:

  • Using or copying information from another student’s code or written work;
  • Copying information from another student’s test or using unauthorized materials during an examination, whether an in-class or take-home exam;
  • Buying, selling, or stealing test questions, answers, or term papers;
  • Doing work or taking tests on behalf of another student or submitting work done by another person;
  • Falsifying data or laboratory results; and
  • Submitting the same work for more than one course without explicit instructor approval.

If an incident of plagiarism or cheating occurs, the instructor will investigate the incident and consult with the Dean. If the student has been found to have committed an act of academic dishonesty, an Academic Misconduct Report will be filed and the student will be placed on a Participation Improvement Plan (PIP). A student who believes they have been wrongly accused of plagiarism or cheating, or that the instructor’s resolution of the alleged incident is unjust, may file a Request for Appeal of Disciplinary Action.

Make School Course Policies