Statsmodels: statistical modeling and econometrics in Python
kshedden Merge pull request #5390 from kshedden/gee-stationary
BUG/ENH: modify stationary cov_struct for GEE
Latest commit 4a79c28 Nov 18, 2018
Permalink
Failed to load latest commit information.
docs DOC: Remove only_directive Oct 2, 2018
examples MAINT: Remove outdated pandas compat shims Sep 16, 2018
statsmodels BUG/ENH: modify stationary cov_struct for GEE Nov 17, 2018
tools Reverting base change as it is pre-release. Nov 2, 2018
.codacy.yml BUG: Remove unicode from setup.cfg Sep 8, 2018
.coveragerc CLN: Match upstream more closely Sep 12, 2018
.gitattributes BLD: Start simplifying setup Sep 8, 2018
.gitignore MAINT: remove cythonize.py made unnecessary by #4871 Sep 20, 2018
.mailmap MAINT: Update mailmap. Oct 15, 2014
.travis.yml Silence warnings for matrix warnings, remove noise form travis logs. Nov 2, 2018
CHANGES.md DOC: Switch to https where used Aug 23, 2018
CONTRIBUTING.rst DOC: Switch to https where used Aug 23, 2018
COPYRIGHTS.txt Docs: add license for qsturng-py and license notice for ordereddict Apr 8, 2012
INSTALL.txt DOC: Switch to https where used Aug 23, 2018
LICENSE.txt BLD: Use alternative method to copy files from root Sep 14, 2018
MANIFEST.in MAINT: remove cythonize.py made unnecessary by #4871 Sep 20, 2018
README.rst DOC: Switch to https where used Aug 23, 2018
README_l1.txt Updated examples Oct 6, 2012
appveyor.yml BLD: Add platform-specific skips Sep 21, 2018
github_deploy_key_statsmodels_statsmodels_github_io.enc Configure doctr Sep 18, 2018
lint.sh Reverting base change as it is pre-release. Nov 2, 2018
requirements.txt CLN: Match upstream more closely Sep 12, 2018
setup.cfg MAINT: disable pytest minversion check (broken in pytest 3.10.0) Nov 4, 2018
setup.py BLD: Use alternative method to copy files from root Sep 14, 2018
tox.ini REF: Update references Sep 12, 2018
versioneer.py BLD: Start simplifying setup Sep 8, 2018

README.rst

Travis Build Status Appveyor Build Status Coveralls Coverage

About Statsmodels

Statsmodels is a Python package that provides a complement to scipy for statistical computations including descriptive statistics and estimation and inference for statistical models.

Documentation

The documentation for the latest release is at

https://www.statsmodels.org/stable/

The documentation for the development version is at

https://www.statsmodels.org/dev/

Recent improvements are highlighted in the release notes

https://www.statsmodels.org/stable/release/version0.9.html

Backups of documentation are available at https://statsmodels.github.io/stable/ and https://statsmodels.github.io/dev/.

Main Features

  • Linear regression models:
    • Ordinary least squares
    • Generalized least squares
    • Weighted least squares
    • Least squares with autoregressive errors
    • Quantile regression
    • Recursive least squares
  • Mixed Linear Model with mixed effects and variance components
  • GLM: Generalized linear models with support for all of the one-parameter exponential family distributions
  • Bayesian Mixed GLM for Binomial and Poisson
  • GEE: Generalized Estimating Equations for one-way clustered or longitudinal data
  • Discrete models:
    • Logit and Probit
    • Multinomial logit (MNLogit)
    • Poisson and Generalized Poisson regression
    • Negative Binomial regression
    • Zero-Inflated Count models
  • RLM: Robust linear models with support for several M-estimators.
  • Time Series Analysis: models for time series analysis
    • Complete StateSpace modeling framework
      • Seasonal ARIMA and ARIMAX models
      • VARMA and VARMAX models
      • Dynamic Factor models
      • Unobserved Component models
    • Markov switching models (MSAR), also known as Hidden Markov Models (HMM)
    • Univariate time series analysis: AR, ARIMA
    • Vector autoregressive models, VAR and structural VAR
    • Vector error correction modle, VECM
    • exponential smoothing, Holt-Winters
    • Hypothesis tests for time series: unit root, cointegration and others
    • Descriptive statistics and process models for time series analysis
  • Survival analysis:
    • Proportional hazards regression (Cox models)
    • Survivor function estimation (Kaplan-Meier)
    • Cumulative incidence function estimation
  • Multivariate:
    • Principal Component Analysis with missing data
    • Factor Analysis with rotation
    • MANOVA
    • Canonical Correlation
  • Nonparametric statistics: Univariate and multivariate kernel density estimators
  • Datasets: Datasets used for examples and in testing
  • Statistics: a wide range of statistical tests
    • diagnostics and specification tests
    • goodness-of-fit and normality tests
    • functions for multiple testing
    • various additional statistical tests
  • Imputation with MICE, regression on order statistic and Gaussian imputation
  • Mediation analysis
  • Graphics includes plot functions for visual analysis of data and model results
  • I/O
    • Tools for reading Stata .dta files, but pandas has a more recent version
    • Table output to ascii, latex, and html
  • Miscellaneous models
  • Sandbox: statsmodels contains a sandbox folder with code in various stages of developement and testing which is not considered "production ready". This covers among others
    • Generalized method of moments (GMM) estimators
    • Kernel regression
    • Various extensions to scipy.stats.distributions
    • Panel data models
    • Information theoretic measures

How to get it

The master branch on GitHub is the most up to date code

https://www.github.com/statsmodels/statsmodels

Source download of release tags are available on GitHub

https://github.com/statsmodels/statsmodels/tags

Binaries and source distributions are available from PyPi

https://pypi.org/project/statsmodels/

Binaries can be installed in Anaconda

conda install statsmodels

Installing from sources

See INSTALL.txt for requirements or see the documentation

https://statsmodels.github.io/dev/install.html

License

Modified BSD (3-clause)

Discussion and Development

Discussions take place on our mailing list.

http://groups.google.com/group/pystatsmodels

We are very interested in feedback about usability and suggestions for improvements.

Bug Reports

Bug reports can be submitted to the issue tracker at

https://github.com/statsmodels/statsmodels/issues