New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FAQ: related and complementary packages #6400
Comments
https://github.com/pysal Python Spatial Analysis Library |
There are also stalled or abandoned packages that includes features that are not available in other packages https://github.com/RJT1990/pyflux time series including GAS models |
@bashtage I think you have permission to edit my comments, i.e. add to first comment |
https://github.com/MaxHalford/Prince Python factor analysis library (PCA, CA, MCA, MFA, FAMD) found link on french (ENSAE) teaching site http://www.xavierdupre.fr/app/ensae_teaching_cs/helpsphinx/ml2a/td2a_mlbasic_acp_acm_anova.html |
a bit specialized: dynamic discrete choice models and similar in several packages |
extreme value analysis |
functional data analysis with machine learning focus, BSD-3 not much statistics in it, but should have the basics for functional data analysis, also depth measures similar to ours (?) (based on 10 minutes browsing) update |
cubic spline smoothing https://github.com/espdev/csaps https://github.com/cjekel/piecewise_linear_fit_py |
some stats packages, I was searching for nonparametric https://github.com/aschleg/hypothetical https://github.com/citiususc/stac related scikit-posthoc linked to in first message above. https://pypi.org/project/pingouin/ is becoming popular because it has a nice interface, nice returns, but is GPL |
https://github.com/sdv-dev/Copulas Copulas including vine copulas, MIT licensed, currently work in progress, pre-alpha https://github.com/blent-ai/pycopula development seems to have stopped, Apache license |
structural equation modelling https://pypi.org/project/semopy/ requirements.txt numpy pandas scipy sympy sklearn statsmodels numdifftools |
outlier detection https://github.com/yzhao062/pyod |
https://github.com/DoubleML/doubleml-for-py Bach, Philipp, Victor Chernozhukov, Malte S. Kurz, and Martin Spindler. 2021. “DoubleML -- An Object-Oriented Implementation of Double Machine Learning in Python.” ArXiv:2104.03220 [Cs, Econ, Stat], April. http://arxiv.org/abs/2104.03220. Chernozhukov, Victor, Denis Chetverikov, Mert Demirer, Esther Duflo, Christian Hansen, Whitney Newey, and James Robins. 2018. “Double/Debiased Machine Learning for Treatment and Structural Parameters.” The Econometrics Journal 21 (1): C1–68. https://doi.org/10.1111/ectj.12097. I didn't look what this does, |
variation on granger causality Edinburgh, Tom, Stephen J. Eglen, and Ari Ercole. 2021. “Causality Indices for Bivariate Time Series Data: A Comparative Review of Performance.” ArXiv:2104.00718 [Math, Stat], April. http://arxiv.org/abs/2104.00718. |
tools: find nearest correlation matrix, new package with several algorithms ours is mostly find nearest positive (semi-)definite matrix. |
causal machine learning, treatment effect estimation, package by Microsoft research, MIT licensed https://github.com/microsoft/EconML https://econml.azurewebsites.net/index.html |
another survival/lifetime analysis package, MIT licensed good collection of parametric distribution methods, e.g. hazard rates, ... for standard scipy distributions used in survival analysis |
a handbook, online MIT, with notebook |
epidemiology Smith, Matthew J., Mohammad A. Mansournia, Camille Maringe, Paul N. Zivich, Stephen R. Cole, Clémence Leyrat, Aurélien Belot, Bernard Rachet, and Miguel A. Luque-Fernandez. “Introduction to Computational Causal Inference Using Reproducible Stata, R, and Python Code: A Tutorial.” Statistics in Medicine 41, no. 2 (2022): 407–32. https://doi.org/10.1002/sim.9234. |
This should be a curated list of related packages, especially to statistics and econometrics topics that statsmodels does not (yet) provide.
(not complete adding links as found, not sorted and organized yet)
https://github.com/scikit-learn/scikit-learn The closest machine learning library. It is easy to find lists of popular machine learning libraries in Python.
Below are packages that are mainly oriented towards more traditional statistics and econometrics topics.
https://github.com/scipy/scipy statsmodels depends on many algorithm in scipy, scipy.stats has distributions and hypothesis tests (and is our older brother)
https://github.com/bashtage/linearmodels panel data and multivariate linear models
https://github.com/bashtage/arch GARCH models, unit root tests and bootstrap
https://github.com/maximtrp/scikit-posthocs Pairwise multiple comparisons (post hoc) tests
https://github.com/CamDavidsonPilon/lifelines survival analysis
https://github.com/pysal Python Spatial Analysis Library
https://github.com/lmfit/lmfit-py Non-Linear Least Squares
The text was updated successfully, but these errors were encountered: