This repository serves as a tutorial for beginner level data science practitioners to acquaint with some of the packages and tools that most data science projects use.
- NumPy ndarray
- Array Creation
- Basic Operation
- Iterating, Indexing and Slicing
- Manipulating ndarray
- Exercise for Array Creation & Basic Operation
- Exercise for Iterating, Indexing and Slicing & Manipulating ndarray
- Fundamentals of Pandas
- Basic Functions
- Common Operations
- Data Cleanup and Missing Data
- Exercise for Fundamentals of Pandas & Basic Functions
- Exercise for Common Operations & Data Cleanup and Missing Data
- Getting Started with PyPlot
- Getting Started with PyPlot (Exercise)
- Creating Different Types of Graphs with PyPlot
- Creating Different Types of Graphs with PyPlot (Exercise)
-
Ensure that you have Anaconda or Miniconda installed in your system.
-
Clone this repository into your local machine.
git clone https://github.com/CertifaiAI/basics-of-data-analytics.git
Alternatively, you may just "Download ZIP" and unzip the folder locally.
-
Open terminal and cd to the
basics-of-data-analytics
folder.cd <your-path>/basics-of-data-analytics
-
Create a new Conda environment using the the following command line.
conda create --name basics-of-data-analytics python=3.7
Alternatively, you may create a new
conda
environment usingenvironment.yml
.conda env create -f environment.yml
-
Activate the newly created environment
basics-of-data-analytics
.conda activate basics-of-data-analytics
-
Install Jupyter Notebook using conda command.
conda install -c conda-forge jupyterlab
You may skip this step if you've create the environment using
environment.yml
file in step 4. -
Open Jupyter Notebook using
jupyter notebook
-
Every package folder (Eg: NumPy) has an "Exercise" folder and a "Solution" folder. Follow along the notebooks in the "Exercise" folder.
But if you couldn't resolve certain exercises or tasks within those notebooks in the "Exercise" folder, refer to the solution counterpart of that notebook located in the "Solution" folder (has the same file name)
Eg:Numpy/exercise/01 - NumPy ndarray.ipynb
->Numpy/solution/01 - NumPy ndarray.ipynb