Releases: anovos/anovos
v1.1.0
Overview
Today we are releasing v1.1.0 for Anovos !!
✨ Added Features
- New submodule Geospatial Analyzer has been added to Anovos !
- MLFlow Integration
- Drift & Stability modules optimization
- Introduced an innovative, scalable Variable clustering methodology
- Updated all the modules example notebooks to reflect the latest changes.
- Added a new use case demo showcasing Geo-spatial module capabilities.
- Detailed Documentation update
- Updates in dependent package versions
- Improvements in CI/CD Pipelines
🔴 Fixes
- Imputation sklearn bugfix fixed #337, #346
- Broken pipeline fix #300
- Fixed bug when no geo-data passed in and updated geo-functions comments #344
- Updated reports in use_case_demo/ to pre_saved_assets
🔨 Merged Pull Requests
- Bump pylint from 2.15.0 to 2.15.2 by @dependabot in #294
- Bump actions/setup-python from 3 to 4 by @dependabot in #292
- Pass Docker credentials when running as reusable workflow by @ionicsolutions in #299
- Update local execution script & instructions by @ionicsolutions in #302
- Update pandas requirement from <=1.4.1,>=1.2.5 to >=1.2.5,<1.5.1 by @dependabot in #304
- Unique count computation rsd value correction by @cshekhar17 in #291
- Bump pylint from 2.15.2 to 2.15.3 by @dependabot in #305
- Bump datapane from 0.14.0 to 0.15.3 by @dependabot in #303
- Bump actions/cache from 3.0.8 to 3.0.9 by @dependabot in #308
- Bump pytest-cov from 3.0.0 to 4.0.0 by @dependabot in #309
- Bump pytest-mock from 3.8.2 to 3.9.0 by @dependabot in #310
- Update tutorial image by @VishnuGowthem in #313
- Reset mistakenly overwritten config & output report when running workloads locally in Docker container by @ionicsolutions in #307
- Bump pytest-mock from 3.9.0 to 3.10.0 by @dependabot in #317
- Bump black from 22.8.0 to 22.10.0 by @dependabot in #315
- Bump actions/cache from 3.0.9 to 3.0.10 by @dependabot in #314
- Update sympy requirement from <1.11,>=1.6 to >=1.6,<1.12 by @dependabot in #282
- Bump actions/cache from 3.0.10 to 3.0.11 by @dependabot in #320
- Bump docker/login-action from 2.0.0 to 2.1.0 by @dependabot in #319
- Bump pylint from 2.15.3 to 2.15.4 by @dependabot in #322
- select num_cols only before initiating VarClusHi by @ZhuLi99 in #323
- changes to docker image updated in changelog by @VishnuGowthem in #324
- basic report header update by @sourjyasen07 in #326
- Bump pytest from 7.1.3 to 7.2.0 by @dependabot in #333
- Bump pylint from 2.15.4 to 2.15.5 by @dependabot in #327
- Geospatial combined by @ranjanravish in #277
- Update workflow.py by @kajanansangar in #340
- Debug variable clustering for null by @ZhuLi99 in #330
- Drift function optimization by @dattranm in #306
- Update pandas requirement from <1.5.1,>=1.2.5 to >=1.2.5,<1.5.2 by @dependabot in #328
- Update pyarrow requirement from <9.0.0 to <11.0.0 by @dependabot in #332
- stability_index_computation optimization by @dattranm in #318
- Doc string changes by @sourjyasen07 in #343
- Feature Stability Improvement by @dattranm in #325
- Update numpy requirement from <1.23.4,>=1.20.0 to >=1.20.0,<1.23.5 by @dependabot in #321
- Drift Stability Bug Fix by @dattranm in #345
- Bump pylint from 2.15.5 to 2.15.6 by @dependabot in #348
- Update LICENSE by @ionicsolutions in #352
- feature/mlflow-integration by @mathiaspet in #229
- Bump flake8 from 5.0.4 to 6.0.0 by @dependabot in #353
- Update numpy requirement from <1.23.5,>=1.20.0 to >=1.20.0,<1.23.6 by @dependabot in #347
- Bump geopy from 2.2.0 to 2.3.0 by @dependabot in #342
- updated geo_format_conversion filter conditions by @ZhuLi99 in #358
- Update pandas requirement from <1.5.2,>=1.2.5 to >=1.2.5,<1.5.3 by @dependabot in #354
- Update scipy requirement from <=1.8.0 to <1.10.0 by @dependabot in #251
- Bump statsmodels from 0.13.2 to 0.13.5 by @dependabot in #339
- added sh file for directly installing anovos and modified README by @ranjanravish in #360
- Drift module notebook change by @dattranm in #341
- Update use case demo & example notebooks by @ZhuLi99 in #359
- Update report htmls by @ZhuLi99 in #361
- mlfow_log_artifact changes moved under mlflow config is true by @VishnuGowthem in #362
Contributors
Our heartfelt thanks to @nisha20verma, @kajanansangar, @VishnuGowthem, @sourjyasen07, @sinuochen, @ranjanravish , @sumitgaurav19, @dattranm, @ionicsolutions, @mwjinjin, @mathiaspet, @cshekhar17, @miker2241, @AnganSamadder, @ziedbouf,@ZhuLi99 for their contributions !
Full Changelog: v1.0.1...v1.1.0
v1.0.1
Overview
Today we are releasing v1.0.1 for Anovos !! It is a patch release
✨ Added Features
- Updated the notebooks to reflect the latest changes (Association Evaluator & Quality Checker Example Notebooks)
- Optimized cat_to_num_unsupervised method
🔴 Fixes
- Fixed demo Dockerfile for spark which was failing
- Use the version of docker-stacks known to work
- Updated central tendency minor bug
- Bump black to 22.8.0 in .pre-commit-config.yaml
New Contributors
Contributors
Our heartfelt thanks to @nisha20verma, @kajanansangar, @VishnuGowthem, @sourjyasen07, @sinuochen, @ranjanravish , @sumitgaurav19, @dattranm, @ionicsolutions, @mwjinjin, @mathiaspet, @cshekhar17, @miker2241, @AnganSamadder, @ziedbouf,@ZhuLi99 for their contributions !
Full Changelog: v1.0.0...v1.0.1
v1.0.0
Overview
Today we are releasing v1.0.0 for Anovos !
✨ Added Features
- Performance improvements across all the modules (Correlation_matrix, IV & IG, Measures of shape, cardinality, etc..)
- Rigorous Scale tests after the performance improvement
- Removed PhiK and introduced Spark based correlation matrix computation (only for numerical attributes)
- Integrated with Feature Store (Feast)
- Support to run Anovos on Azure Kubernetes (AKS)
- Updation of Feature Corpus with 1700+ Features covering new industries/usecases (Agriculture, Information Technology, Aviation, Philosophy, Construction)
- Detailed Documentation update
- Updates in dependent package versions
- Improvements in CI/CD Pipelines
🔴 Fixes
- Fix for running Anovos on Databricks with Azure Blob storage
- Fix in Anovos examples notebooks #208
- Fix for Feature Recommender Lazy Load Model not working properly #117
- Fix for Data report module visualizations
🔨 Merged Pull Requests
- Update latest release version by @kajanansangar in #204
- Adding parameter for basic_report by @VishnuGowthem in #182
- Transformer Module Fix by @sinuochen in #206
- Bump actions/setup-python from 3 to 4 by @dependabot in #213
- Bump actions/cache from 3.0.2 to 3.0.4 by @dependabot in #212
- Fix #208: Notebook Issues by @sinuochen in #209
- Dynamically import Tensorflow based on M1 Machine status by @dattranm in #185
- Merge Correlation Methods by @dattranm in #207
- Run push_to_branch workflow on PR events by @ionicsolutions in #222
- Fully change push_to_branch to a pull_request workflow (fixes #221) by @ionicsolutions in #223
- Fix #205 - updated version.py to also 0.3.0 by @VishnuGowthem in #215
- Update sympy requirement from <=1.9,>=1.6 to >=1.6,<1.11 by @dependabot in #124
- Update pyarrow requirement from <=7.0.0 to <9.0.0 by @dependabot in #174
- Update matplotlib requirement from <=3.5.1,>=3.4.3 to >=3.4.3,<3.5.3 by @dependabot in #176
- Bump black from 22.3.0 to 22.6.0 by @dependabot in #228
- Bump pylint from 2.13.8 to 2.14.4 by @dependabot in #231
- Rename the feature_recommender to feature_mapper changes by @dattranm in #147
- Updated one condition to fix dp issue with lesser object count by @sourjyasen07 in #233
- Adding support for Azure k8s by @VishnuGowthem in #188
- Outlier detection optimization by @sinuochen in #235
- Changes in documentation and Dockerfile for tutorial/ folder by @VishnuGowthem in #241
- Update feature recommender database by @AnganSamadder in #238
- Update Feature Mapper Output and Notebooks by @sinuochen in #245
- Add multiple geospatial functions by @dattranm in #216
- Optimization for attribute_binning and IV_calculation/ IG_calculation by @dattranm in #234
- Bump pylint from 2.14.4 to 2.14.5 by @dependabot in #243
- Bump actions/cache from 3.0.4 to 3.0.5 by @dependabot in #242
- Bump pre-commit from 2.19.0 to 2.20.0 by @dependabot in #239
- Fix failing notebook builds by @ionicsolutions in #246
- Upgrade Spark versions in workflow to latest releases by @ionicsolutions in #249
- Return to latest docker-stacks images for notebook/examples builds after upstream fix by @ionicsolutions in #248
- Bump flake8 from 4.0.1 to 5.0.1 by @dependabot in #252
- Optimize measure_of_shape, remove for loops by @mwjinjin in #253
- Optimize mode_optimization and measure_of_centralTendency by @dattranm in #244
- Feast Integration for simple FileSource repos by @mathiaspet in #168
- outlier_categories optimization by @dattranm in #250
- Bump actions/cache from 3.0.5 to 3.0.6 by @dependabot in #256
- Bump flake8 from 5.0.1 to 5.0.4 by @dependabot in #259
- Bump pytest-mock from 3.7.0 to 3.8.2 by @dependabot in #260
- Removed file cfg element from feast cfg by @mathiaspet in #262
- Use specific version of flake8 when linting by @ionicsolutions in #261
- Persist memory for iv_ig to optimize runtime/memory by @dattranm in #263
- Add feature_store package to setup.py by @ionicsolutions in #265
- Update stats_generator example notebook by @sinuochen in #264
- Bump black from 22.1.0 to 22.6.0 by @dependabot in #273
- Bump actions/cache from 3.0.6 to 3.0.7 by @dependabot in #269
- Build release versions of anovos-notebook for Spark 3.2.2 by @ionicsolutions in #276
- Update numpy requirement from <=1.22.2,>=1.20.0 to >=1.20.0,<1.23.3 by @dependabot in #270
- Update jinja2 requirement from <3.1.0 to <3.2.0 by @dependabot in #226
- Correlation matrix optimization by @cshekhar17 in #254
- Update matplotlib requirement from <3.5.3,>=3.4.3 to >=3.4.3,<3.5.4 by @dependabot in #271
- Optimize duplicate_detection, assign .count() to variable and… by @mwjinjin in #274
- Measures of cardinality performance optimisation by @kajanansangar in #255
- Revert Dockerfile for running tests locally to previous version by @ionicsolutions in #279
- Optimize measures_of_centralTendency by @mwjinjin in #278
- Bump actions/cache from 3.0.7 to 3.0.8 by @dependabot in #281
- Bump pylint from 2.14.5 to 2.15.0 by @dependabot in #283
- Bug fix for outlier_detection, with parquet input and decimal. format cols by @dattranm in #280
- Optimize cat_to_num_supervised by @sinuochen in #275
- Bump black from 22.6.0 to 22.8.0 by @dependabot in #285
- Bump pytest from 7.1.2 to 7.1.3 by @dependabot in #284
- Local execution of Anovos workloads by @ionicsolutions in #266
- updating README and version to 1.0.0 by @VishnuGowthem in #289
New Contributors
- @mwjinjin made their first contribution in #253
- @mathiaspet made their first contribution in #168
- @cshekhar17 made their first contribution in #254
Contributors
Our heartfelt thanks to @nisha20verma, @kajanansangar, @VishnuGowthem, @sourjyasen07, @sinuochen, @ranjanravish , @sumitgaurav19, @dattranm, @ionicsolutions, @mwjinjin, @mathiaspet, @cshekhar17, @miker2241, @AnganSamadder, @ziedbouf for their contributions !
Full Changelog: v0.3.0...v1.0.0
v0.3.0
Overview
Today we are releasing v0.3.0 for Anovos
- Added detailed tutorial for Anovos (Using Credit Risk modelling Use Case)
- Resolved issue related to negative Information Gain values
- Upgraded the DataPane Version (For Visualization purposes)
- Minor bug fixes across all the modules and Unit Tests
- Improvements in CI/CD Pipelines
✨ Added Features
- Tested Anovos to run on Databricks (Python way & Spark-Submit way)
- Moved main workflow script inside the package
- Improved data sampling mechanism (Random and Stratified)
🔴 Fixes
- Bug fixes related to data sampling
- Bug fixes related to visualising pie-chart in the executive summary
- Bug issues pertaining to transformer Unit Tests
🔨 Merged Pull Requests
- Negative IG fix in #145
- Bump actions/cache from 3.0.1 to 3.0.2 in #157
- Datapane Version Upgrade in #158
- Bump github/codeql-action from 1 to 2 in #165
- Bump pytest from 7.1.1 to 7.1.2 by in #162
- Bump pylint from 2.12.2 to 2.13.7 by in #163
- Bump black from 22.1.0 to 22.3.0 by in #166
- Add caching for Python dependencies to GitHub Actions in #128
- Add data_sample function and unit test in #146
- Bump pylint from 2.13.7 to 2.13.8 in #175
- Bump docker/login-action from 1.14.1 to 2.0.0 in #173
- Bump docker/build-push-action from 2 to 3 in #172
- Bump docker/metadata-action from 3 to 4 in #171
- Added Dockerfile for local testing in #184
- Bug fix for data_sample by in #178
- Anovos tutorial by in #181
- Delete type check in data_ingest/write_dataset in #193
- Build up unit tests for the
data_ingest
module in #179 - Chain GitHub Actions workflows in #191
- Fix GitHub Actions workflow chaining in #194
- Use workflow names in triggering workflows in #195
- Always call from current commit SHA in #196
- Bug: date column on cast as double in #148
- Always call workflows from current commit SHA in #199
- Add recommend_type function and unit test by #170
- Introduce pre-commit hooks for streamlined development workflow in #180
- Move the main workflow script inside the package by in #186
- Readme file Update in #201
- Added transformers functions in Azure_Databricks config file in #202
- Fixing the Pie Chart Issue in #187
- Debug and fix failing transformer tests in #192
- Updated Setup.py for v0.3.0 release #203
v0.2.2
Overview
Today we are releasing v0.2.2 for Anovos with minor additions to v0.2.1
- Updated Jinja2 release breaking report generation
- Unit Testing for Feature Recommender Module
- Unit Testing for Time Series Analyzer module addition
- Minor bug fixes in Data ingest module and One Hot Encoding
✨ Added Features
- Latest jinja2 release breaks report generation on all current versions of Anovos in #141
- Feature Recommender Unit Tests in #127
- unit testing script for ts_analyzer in #125
🔴 Fixes
- Data ingest error check in #137
- changed error check to also satisfy multiple join columns in #138
- Minor Fixes @ TS Module in #134
- Optimize One Hot Encoding in #140
🔨 Merged Pull Requests
- Bump actions/download-artifact from 2 to 3 in #151
- Bump actions/upload-artifact from 2 to 3 in #152
- Bump actions/cache from 2.1.7 to 3.0.1 in #143
- Bump pytest from 7.0.1 to 7.1.1 by @dependabot in #136
Full Changelog: v0.2.1...v0.2.2
v0.2.1
Overview
Today we are releasing v0.2.1 for Anovos with minor additions to v0.2.0
- Updated all Anovos example notebooks and references
- Unit Testing for Data Transformers module added
- Unit Testing for Time Series Analyzer module added
✨ Added Features
- Update Anovos v0.2.0 example notebooks in #119
- Unit testing for transformers in #76
- Unit testing data_ingest_ts auto detection in #126
🔴 Fixes
- Feature Recommendation get method bugfix in #118
- Update drift validations in #120
- Add root path for saving intermediate data in #104
🔨 Merged Pull Requests
- Bump actions/setup-python from 2 to 3 in #122
- Bump docker/login-action from 1.13.0 to 1.14.1 in #123
- Release 021 update in #129
- changes for version in #130
Full Changelog: v0.2.0...v0.2.1
v0.2.0
Overview
Today we are releasing v0.2.0 for Anovos with some major developments!
- Data Transformation - Scaling, Imputation, Latent Features, Feature Transformation, Supervised Categorical Attribute Encoding and Expression Parser
- Semantic Enabled Feature Exploration & Recommendation
- Time Series Analyzer
- Feature Stability Estimation
- Reporting & Visualization v0.2.0
- Support for AWS EMR run
- Support for latest Spark & Python Versions
- Automated Workflows in Github Actions for Unit testing, Demo, Full Demo, Code Quality Check, CodeQL Analysis, Docker Images and PyPI release updates
👥 Contributors
Our heartfelt thanks to @ziedbouf, @ionicsolutions, @nisha20verma, @kajanansangar, @sourjyasen07, @VishnuGowthem , @ranjanravish , @sinuochen , @sumitgaurav19 , @dattranm for their contributions !
🐳 Docker Images
Apache Spark Demo Image
Example Image
- anovos/anovos-examples-2.4.8:0.2.0
- anovos/anovos-examples-3.1.3:0.2.0
- anovos/anovos-examples-3.2.1:0.2.0
✨ New Features:
- Transformers in #43, #82
- Feature Explorer & Recommender in #90, #59
- Feature Stability Estimation in #72
- Time Series Module in #89, #113
🔴 Fixed Issues:
- Issues in handling 'double' datatype in #18
- Spark demo takes about an hour to run in #27
- Unit tests can not access data #28
- Unit test cases missed out for #23 #31
- Provide the Getting-Started-Container through Docker Hub and/or GHCR #36
- Doing read_dataset in getting_started_with_anovos.ipynb gives error in #39
- One Hot Encoding Bug in #55
- confused with the following approach data_ingest read_dataset function in #66
- Getting started guide basic report generation is giving an exception in #84
🔨 Merged pull requests:
- Update README.md in #85
- Refactor drift detection in #71
- Bump docker/login-action from 1.12.0 to 1.13.0 in #88
- Fix column validator in #87
- Update findspark requirement from <2.0.0 to <3.0.0 in #50
- Update pyarrow requirement from <6.0.0 to <8.0.0 in #65
- Docker related readme in #91
- Convert all docstrings to numpydoc in #68
- Consolidate modules overview and API docs in #93
- Clean up docstrings formatting in #96
- Turn
anovos
into a proper Python package in #95 - Stats & Transformer module fix in #92
- Bump Spark 3.x to latest releases in #98
- Update READMEs in the repo in #99
- Bump actions/setup-java from 2.5.0 to 3 in #102
- attribute_binning: bug if max_value is 0 in #100
- Feature Recommender minor structure changes in #97
- Clean up some common issues in the code in #103
- docstrings: updated, formatted, deduplicated in #101
- Truly lazy load model (fixes #105) in #108
- Transfer feature recommendation documentation in #109
- Feature Recommender Data file locally save in #107
- Clean requirements bootstrap files in #86
- imputation_matrixFactorization: remove created id column if necessary in #110
- updated README.md in #114
- Ts databricks change in #112
- Anovos docstrings update in #111
- Additional TS Module Changes in #113
- Ensure that input data is found in all scenarios in #115
Full Changelog: v0.1.3...v0.2.0
Pre-Release 0.1.3
This is a pre-release for v0.2.0
Pre-Release 0.1.2
This is a pre-release for the upcoming v0.2.0
Pre-Release 0.1.1
This is a pre-release for the upcoming v0.2.0