Welcome to my Data Science with Python course! I am happy to share my work and I am even happier if it can be useful.
- Data Structures
- Lists
- Tuples
- Sets
- Dictionaries
- Numpy arrays
- Pandas DataFrames
- Pyspark DataFrames
- Data Exploration
- Import, export data
- Descriprives and summary statistics
- Pivot tables and aggregation
- Data Types
- Numerical data
- String data
- Time data
- Missing data
- Data Wrangling
- Rows: sorting, indexing, ....
- Columns: renaming, ordering, ....
- Collapse and aggregate
- Reshape
- Concatenate and merge
- Plotting
- Distributions
- Time Series
- Correlations
- Regression
- Geographical data
- Machine Learning Pipeline
- Data exploration
- Encoding and normalization
- Missing values
- Weighting
- Prediction
- Cross-validation
- Web Scraping
- Pandas
- APIs
- Static Webscraping
- Dynamic Webscraping
- TBD
- What is missing? Let me know!
Please, if you find any typos or mistakes, open a new issue. Or even better, fork the repo and submit a pull request. All feedback is greatly appreciated!