Statistics for Data Science

Welcome to the "Statistical for Data Science" repository – your go-to resource for mastering essential statistical concepts in data science. This repository provides concise code implementations, analyses, and practical exercises, offering a seamless blend of theory and application. Whether you're a beginner seeking a solid foundation or an experienced practitioner aiming to enhance your statistical skills, this repository covers crucial topics like descriptive statistics, hypothesis testing, regression analysis, and more. Elevate your data science journey with clear explanations and hands-on coding. Happy exploring!

Data:

Descriptive Statistics

Statistical Experiments

Statistical experiments involve systematically collecting and analyzing data to test hypotheses or answer research questions.

A/B Testing: Compares two versions of a variable to determine which performs better.
Resampling: Technique that involves repeatedly drawing samples to obtain additional information about a population.
Power and Sample Size: Ensures experiments have sufficient sensitivity to detect meaningful effects.
Multi-Arm Bandit Algorithm: Optimization method for decision-making in scenarios with multiple treatment options.

Statistical Inference

Statistical inference involves making generalizations about populations based on sample data.

Hypothesis Tests: Evaluates evidence against a null hypothesis to make inferences about populations.
Statistical Significance and p-values: Determines the probability of observed results by chance.
t-Tests: Compares means between two groups.
Multiple Testing: Adjusts for increased risk of false positives when conducting multiple hypothesis tests.
ANOVA (Analysis of Variance): Compares means among three or more groups.
Chi-Square Test: Assesses independence between categorical variables.

Statistical Learning

Resampling methods
Machine learning and Deep Leraning for times series.

Stadistical modeling

Statistical Models for Time Series: Autoregressive Models, Moving Average Models, Autoregressive Integrated Average Models

Programming

R
Python
Julia

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.ipynb_checkpoints		.ipynb_checkpoints
data		data
.gitignore		.gitignore
ANOVA.ipynb		ANOVA.ipynb
Chi-Square Test.ipynb		Chi-Square Test.ipynb
README.md		README.md
Resampling.ipynb		Resampling.ipynb
Statistical Experiments-Inference.ipynb		Statistical Experiments-Inference.ipynb
Statistical Models for Time Series.ipynb		Statistical Models for Time Series.ipynb
p-values and t-test.ipynb		p-values and t-test.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Statistics for Data Science

Descriptive Statistics

Statistical Experiments

Statistical Inference

Statistical Learning

Stadistical modeling

Programming

About

Releases

Packages

Languages

galois1915/Statistics-For-Data-Science

Folders and files

Latest commit

History

Repository files navigation

Statistics for Data Science

Descriptive Statistics

Statistical Experiments

Statistical Inference

Statistical Learning

Stadistical modeling

Programming

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages