Statsmodels: statistical modeling and econometrics in Python
Clone or download
josef-pkt Merge pull request #5462 from RonLek/docfix
DOC: Fixed broken link for Guerry Dataset
Latest commit 143410a Jan 14, 2019
Type Name Latest commit message Commit time
Failed to load latest commit information.
docs Fixed broken link for Guerry Dataset Jan 12, 2019
examples Merge pull request #5170 from jbrockmendel/deprs Dec 22, 2018
statsmodels Merge pull request #5426 from atheis4/patch-1 Dec 28, 2018
tools BLD: conda-forge use gcc7 Dec 13, 2018
.codacy.yml BUG: Remove unicode from setup.cfg Sep 8, 2018
.coveragerc CLN: Match upstream more closely Sep 12, 2018
.gitattributes BLD: Start simplifying setup Sep 8, 2018
.gitignore ENH: Fast ARMA innov. algo., loglike computation Dec 27, 2018
.mailmap MAINT: Update mailmap. Oct 15, 2014
.travis.yml TST: Add mac testing [skip appveyor] Nov 28, 2018 DOC: Switch to https where used Aug 23, 2018
CONTRIBUTING.rst EHN: Add linting instruction in CONTRIBUTING.rst Jan 4, 2019
COPYRIGHTS.txt Docs: add license for qsturng-py and license notice for ordereddict Apr 8, 2012
INSTALL.txt DOC: Switch to https where used Aug 23, 2018
LICENSE.txt BLD: Use alternative method to copy files from root Sep 14, 2018 MAINT: remove made unnecessary by #4871 Sep 20, 2018
README.rst DOC: Switch to https where used Aug 23, 2018
README_l1.txt Updated examples Oct 6, 2012
appveyor.yml BLD: Add platform-specific skips Sep 21, 2018
github_deploy_key_statsmodels_statsmodels_github_io.enc Configure doctr Sep 18, 2018 Reverting base change as it is pre-release. Nov 2, 2018
requirements.txt CLN: Match upstream more closely Sep 12, 2018
setup.cfg MAINT: disable pytest minversion check (broken in pytest 3.10.0) Nov 4, 2018 ENH: Fast ARMA innov. algo., loglike computation Dec 27, 2018
tox.ini REF: Update references Sep 12, 2018 BLD: Start simplifying setup Sep 8, 2018


Travis Build Status Appveyor Build Status Coveralls Coverage

About Statsmodels

Statsmodels is a Python package that provides a complement to scipy for statistical computations including descriptive statistics and estimation and inference for statistical models.


The documentation for the latest release is at

The documentation for the development version is at

Recent improvements are highlighted in the release notes

Backups of documentation are available at and

Main Features

  • Linear regression models:
    • Ordinary least squares
    • Generalized least squares
    • Weighted least squares
    • Least squares with autoregressive errors
    • Quantile regression
    • Recursive least squares
  • Mixed Linear Model with mixed effects and variance components
  • GLM: Generalized linear models with support for all of the one-parameter exponential family distributions
  • Bayesian Mixed GLM for Binomial and Poisson
  • GEE: Generalized Estimating Equations for one-way clustered or longitudinal data
  • Discrete models:
    • Logit and Probit
    • Multinomial logit (MNLogit)
    • Poisson and Generalized Poisson regression
    • Negative Binomial regression
    • Zero-Inflated Count models
  • RLM: Robust linear models with support for several M-estimators.
  • Time Series Analysis: models for time series analysis
    • Complete StateSpace modeling framework
      • Seasonal ARIMA and ARIMAX models
      • VARMA and VARMAX models
      • Dynamic Factor models
      • Unobserved Component models
    • Markov switching models (MSAR), also known as Hidden Markov Models (HMM)
    • Univariate time series analysis: AR, ARIMA
    • Vector autoregressive models, VAR and structural VAR
    • Vector error correction modle, VECM
    • exponential smoothing, Holt-Winters
    • Hypothesis tests for time series: unit root, cointegration and others
    • Descriptive statistics and process models for time series analysis
  • Survival analysis:
    • Proportional hazards regression (Cox models)
    • Survivor function estimation (Kaplan-Meier)
    • Cumulative incidence function estimation
  • Multivariate:
    • Principal Component Analysis with missing data
    • Factor Analysis with rotation
    • MANOVA
    • Canonical Correlation
  • Nonparametric statistics: Univariate and multivariate kernel density estimators
  • Datasets: Datasets used for examples and in testing
  • Statistics: a wide range of statistical tests
    • diagnostics and specification tests
    • goodness-of-fit and normality tests
    • functions for multiple testing
    • various additional statistical tests
  • Imputation with MICE, regression on order statistic and Gaussian imputation
  • Mediation analysis
  • Graphics includes plot functions for visual analysis of data and model results
  • I/O
    • Tools for reading Stata .dta files, but pandas has a more recent version
    • Table output to ascii, latex, and html
  • Miscellaneous models
  • Sandbox: statsmodels contains a sandbox folder with code in various stages of developement and testing which is not considered "production ready". This covers among others
    • Generalized method of moments (GMM) estimators
    • Kernel regression
    • Various extensions to scipy.stats.distributions
    • Panel data models
    • Information theoretic measures

How to get it

The master branch on GitHub is the most up to date code

Source download of release tags are available on GitHub

Binaries and source distributions are available from PyPi

Binaries can be installed in Anaconda

conda install statsmodels

Installing from sources

See INSTALL.txt for requirements or see the documentation


Modified BSD (3-clause)

Discussion and Development

Discussions take place on our mailing list.

We are very interested in feedback about usability and suggestions for improvements.

Bug Reports

Bug reports can be submitted to the issue tracker at