This repository contains exercises for the “Lede 12” summer program at Columbia University.
- Python dictionaries, US presidents
- Dictionaries, 2 (movies)
- List of dictionaries, loops
- Programmer, ...
- NYT and Spotify APIs
- Forecast API
- Pandas: Cats & dogs, Millionaires, Train stations;
df['column'].value_counts()
,df['column'].idxmax()
,df.groupby()
- Pandas, my own data sets;
df[df['Station'].str.match('^A')]
- Python (≠Jupyter), Quakebot;
def
,getPOSIX()
- “Scraping and Saving” (Scraper.py, Mailer.py,
crontab -e
) - Parking violations (985 MB CSV)
- (No exercise)
- Date index:
resample
,df.groupby
- Term Frequency-Inverse Document Frequency!
- (Short paper)
- Queries through pg8000
- Basic Scraping
- List slices,
[sqrt(i) for i in numbers if i < 100]
, parse menu using regexp - Insert with pg8000: cat-cafes
- Web Application using Flask (lakes)
- (Twitter bot - sent via email)
Submission: link
- Mean, Mayoral excuses, prime numbers link
- [No homework]
- Sorting, search link
- Statistical analysis (mode, quantile, IQR), correlation link
- Slope, Intercept; Correlation; FiveThirtyEight article about Obama
- No homework
- Tree classifier (breast cancer set)
- Group work, no homework; made these slides about decision trees (with node numbers)
- Write 5-fold classification function; RandomForestClassifier; Group work about the iris dataset link
- Classification: newsgroups (class: about Naive Bayes)
- Class: clustering
- Class: face recognition, Google BigQuery
Good kinds of pancakes:
-
Okonomiyaki
-
- Dorayaki
-
- Silver dollars
-
- Injera
-
We are doing some stuff right here
-
We are doing some other stuff