**Development Environment Setup and Usage **



**1. Introduction**



In this course, we will use **Python** as the primary programming language for statistical modeling and machine learning. To ensure consistency and efficiency in the development environment, we recommend using **Anaconda**, an integrated Python development environment suitable for data science and machine learning applications. This handout will provide detailed instructions on how to install and configure the development environment using Anaconda, as well as how to use commonly needed tools and libraries.



**2. Anaconda Installation**



**2.1 Install Anaconda**

​	1.	Visit the Anaconda official website: https://www.anaconda.com/products/individual and download the Anaconda installer for your operating system.

​	2.	Choose the **Python 3.x** version, click the download button, and run the installer.

​	3.	During installation, select the option **“Add Anaconda to my PATH environment variable”** to ensure that you can use the conda command in the command line.



**2.2 Verify Installation**



Once the installation is complete, open the command line tool (e.g., cmd or terminal) and run the following commands:

​	•	**Check Anaconda installation**: conda --version

​	•	**Check Python version**: python --version



If the version numbers appear correctly, it means Anaconda has been successfully installed.



**3. Create and Manage Virtual Environments**



To avoid conflicts between dependencies in different projects, it is recommended to use **virtual environments** for managing dependencies in each project.



**3.1 Create a Virtual Environment**

​	1.	To create a new virtual environment and specify the Python version (e.g., Python 3.8), run the following command:

```
conda create --name myenv python=3.8
```



​	2.	To activate the virtual environment:

​	•	**Windows**: conda activate myenv

​	•	**macOS/Linux**: conda activate myenv

​	3.	If you no longer need the virtual environment, you can delete it by running:

```
conda remove --name myenv --all
```





**3.2 Install Dependencies**



In the virtual environment, you can install various libraries using **conda** to ensure each project has its own dependencies. For example, to install NumPy, Pandas, and Matplotlib:

```
conda install numpy pandas matplotlib
```

To install machine learning libraries like scikit-learn, run:

```
conda install scikit-learn
```

**3.3 View Installed Libraries**



To view the libraries and versions installed in the virtual environment:

```
conda list
```

**4. Install and Use Jupyter Notebook**



**4.1 Install Jupyter Notebook**



To install **Jupyter Notebook** in the virtual environment, run:

```
conda install jupyter
```

**4.2 Launch Jupyter Notebook**



Once installed, launch Jupyter Notebook by running the following command in the command line:

```
jupyter notebook
```

This will automatically open the Jupyter interface in your browser. You can create new notebooks, write code, and view the output directly.



**4.3 Using Jupyter Notebook**

​	•	**Create a New Notebook**: Click on the “New” button at the top right of the page and select Python 3 to create a new notebook.

​	•	**Run Code Cells**: Select a cell and press **Shift + Enter**, or click the **Run** button at the top to execute the code.

​	•	**Save the Notebook**: Go to **File** -> **Save and Checkpoint** to save your current notebook.



**5. Install and Use Common Python Libraries**



With Anaconda, you can easily install and use commonly used Python libraries that help with data analysis and machine learning tasks.



**5.1 Install NumPy**



NumPy is a Python library for numerical computing, providing support for large, multi-dimensional arrays and matrices.

```
conda install numpy
```

**5.2 Install Pandas**



Pandas is a powerful library for data manipulation and analysis, especially for working with tabular data.

```
conda install pandas
```

**5.3 Install Matplotlib**



Matplotlib is a plotting library that helps you create static, animated, and interactive visualizations.

```
conda install matplotlib
```

**5.4 Install Scikit-learn**



Scikit-learn is a machine learning library that provides simple and efficient tools for data mining and data analysis.

```
conda install scikit-learn
```

**6. Recommended Development Environment Setup**



**6.1 Install VS Code**



**VS Code** is a lightweight, powerful development tool that supports Python development and is highly customizable.

​	1.	Visit the official website: https://code.visualstudio.com/ to download and install.

​	2.	In VS Code, install the **Python extension**:

​	•	Open VS Code, click on the Extensions icon on the left side, search for the **Python** extension, and install it.

​	3.	Configure the Python interpreter:

​	•	Open VS Code, press **Ctrl+Shift+P**, type and select **Python: Select Interpreter**, then choose the Python interpreter from your virtual environment.



**6.2 Install PyCharm**



**PyCharm** is a professional IDE for Python development, particularly suited for larger projects.

​	1.	Visit the official website: https://www.jetbrains.com/pycharm/ to download and install.

​	2.	Configure the virtual environment:

​	•	In PyCharm, choose the virtual environment as the Python interpreter for your project.



**7. Version Control and Collaboration**



**7.1 Install and Use Git**



**Git** is a distributed version control tool widely used for team collaboration and code management.

​	1.	Download and install Git from: https://git-scm.com/downloads

​	2.	Configure Git:

After installation, run the following commands in the command line to configure your username and email:

```
git config --global user.name "Your Name"
git config --global user.email "youremail@example.com"
```





**7.2 Use GitHub or GitLab**

​	•	Create an account on GitHub or GitLab to host and manage your code.

​	•	Learn basic Git operations like git init, git clone, git add, git commit, git push, etc.



**8. Conclusion**



In this session, we introduced how to set up the Python development environment using **Anaconda**, how to use **Jupyter Notebook** for interactive programming, and how to install common Python libraries. We also covered how to configure development tools like **VS Code** and **PyCharm**. With these tools, you will be able to effectively perform data processing, statistical modeling, and machine learning tasks.



In [1]:
print("hello world")

hello world
