Introduction to Machine Learning

Machine learning is an application of artificial intelligence (AI) that provides systems the ability to automatically learn and improve from experience without being explicitly programmed.

Learning objectives

In this workshop, you will learn the following skills:

How to use skills from the NLTK workshop to build features for a classification task
How to build a text classification system that can predict whether sentences belong to one category ("news") or another ("romance")
How to model the topics in a corpus based on the distributions of words across the documents
How to group data and perform calculations on the aggregations
How to prepare data for machine learning using pandas, a package for Python that helps to organize your data
How to use the scikit-learn package for Python to perform different types of machine learning on the data
How to evaluate the results of machine learning algorithms
How to visualize observations, aggregations, and algorithmic results

This workshop will review key concepts for understanding how machine learning works, and walk participants through the process of analyzing data using statistical and machine learning methods.

Much gratitude to Kelsey Chatlosh, Lisa Rhody, and Michael Grossberg for substantive feedback that worked its way into content.

Get Started >>>

Introduction
Installation and Setup
What Is Classification?
Getting Our Data
Extracting Features
Supervised Machine Learning
Supervised Classification Algorithm with sklearn
Unsupervised Machine Learning
Review
Resources

Appendices:

Session leaders: Rachel Rakov and Hannah Aizenman
Based on previous work by: Rachel Rakov and Hannah Aizenman

Digital Research Institute (DRI) Curriculum by Graduate Center Digital Initiatives is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. Based on a work at https://github.com/DHRI-Curriculum. When sharing this material or derivative works, preserve this paragraph, changing only the title of the derivative work, or provide comparable attribution.

Name		Name	Last commit message	Last commit date
Latest commit History 194 Commits
notebooks		notebooks
sections		sections
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
df_news_romance.csv		df_news_romance.csv
df_nouns_adjs.csv		df_nouns_adjs.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

notebooks

notebooks

sections

sections

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

df_news_romance.csv

df_news_romance.csv

df_nouns_adjs.csv

df_nouns_adjs.csv

Repository files navigation

Introduction to Machine Learning

Learning objectives

About

Releases

Packages

Languages

License

DHRI-Curriculum/machine-learning

Folders and files

Latest commit

History

Repository files navigation

Introduction to Machine Learning

Learning objectives

About

Resources

License

Stars

Watchers

Forks

Languages