Skip to content

boakyejeff/first_python_stats

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Python Statistical Models & Basics

This repository contains a collection of Python scripts covering fundamental data science, probability, and machine learning concepts. The codebase has been refactored for clarity and adherence to professional PEP8 formatting standards.

Project Structure

The repository is organized by topics and modules, encompassing:

  • Core Python Concepts: Functions, classes, and object-oriented programming.
  • Data Manipulation: Extensive use of pandas and numpy for vectorized data formatting and mathematics.
  • Data Visualization: Creating actionable charts and plots.
  • Statistical Modeling: Advanced multivariate analyses utilizing scikit-learn and statsmodels.
  • System Performance: Benchmarking and iterative brute force efficiency scripts.

Setup & Installation

To run these scripts locally, ensure you have Python 3 installed. It is recommended to use a virtual environment.

  1. Clone the repository:
    git clone https://github.com/boakyejeff/first_python_stats.git
    cd first_python_stats
  2. Create and activate a virtual environment:
    python -m venv venv
    source venv/bin/activate  # On Windows, use `venv\Scripts\activate`
  3. Install dependencies:
    pip install -r requirements.txt

Requirements

Dependencies are tracked in requirements.txt. Key libraries include:

  • pandas
  • numpy
  • scikit-learn
  • scipy
  • statsmodels
  • matplotlib / seaborn

License

MIT License.

About

Python, machine learning statistical models

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors