This repository contains comprehensive educational materials for an introductory Python course (IND216) designed for analytics students. The course emphasizes practical data wrangling skills using Python's technical and scientific computing ecosystem, making it ideal for students with minimal or no programming background.
Rather than attempting an exhaustive coverage of Python's features, this course focuses on building a solid foundation in:
- Core Python programming concepts
- Data manipulation fundamentals
- Scientific computing with NumPy
- Data analysis with Pandas
- Location:
/notes/
- Format: LaTeX source files and compiled PDF
- Content: Complete course notes covering six modules of Python programming for data analysis
- Building: Run
pdflatex -shell-escape notes.tex
from the notes directory
The repository also hosts a Quarto-based website featuring a five-part introduction to Python for MSc students in analytics, providing interactive examples and exercises for hands-on learning.
Upon completion of this course, students will be able to:
- Write Python scripts for data processing tasks
- Manipulate and analyze data using NumPy arrays
- Perform data wrangling operations with Pandas DataFrames
- Develop reproducible data analysis workflows
No prior programming experience is required. The course is designed to take students from zero coding knowledge to practical data analysis skills.
-
Clone the repository:
bash git clone https://github.com/simoneSantoni/intro-to-Python-IND216.git
-
Set up Python environment (recommended):
bash conda create -n ind216 python=3.9 conda activate ind216 conda install numpy pandas matplotlib jupyter
-
Access course materials:
- PDF notes are available in
/notes/notes.pdf
after compilation - Interactive notebooks and examples are throughout the repository
- PDF notes are available in
- Organization of the Notes and IND216 - Course overview and learning approach
- Getting Started with Python - Installation, environments, and running programs
- Python Objects - Data types, control flow, and basic programming constructs
- Technical & Scientific Computation with NumPy - Array operations and numerical computing
- Data Management with Pandas - DataFrames and data manipulation
- Applied Examples and Best Practices - Real-world data analysis workflows
This is an educational repository. If you find errors or have suggestions for improvements, please open an issue or submit a pull request.
This educational material is provided under the MIT License. See the LICENSE file for details.
For questions about the course content or materials, please open an issue in this repository.