Skip to content

Releases: capitalone/DataProfiler

0.10.9

06 Mar 14:28
f8b3e5d
Compare
Choose a tag to compare

Profiler

  • Version.py update 0.10.9 #1107
  • Staging into main from dev #1106
  • Hot fix json bug #1105

Documentation

  • Docs update 0.10.9 #1108
  • Add downloads tile to README #1085

Full Changelog: 0.10.8...0.10.9

What's Changed

Full Changelog: 0.10.8...0.10.9

0.10.8

11 Jan 17:32
a92ab1e
Compare
Choose a tag to compare

Profiler

  • Staging/main/0.10.8 #1081
  • Depedency: matplotlib version bump #1072
  • Make _assimilate_histogram() not use self #1071
  • Feature: added parquet sampling #1070

Documentation

  • Update: Documentation 0.10.8 #1084
  • Docs update to include option for sample_nrows for parquet files #1082

Miscellaneous

  • Bump actions/setup-python from 4 to 5 #1078

Full Changelog: 0.10.7...0.10.8

What's Changed

Full Changelog: 0.10.7...0.10.8

0.10.7

14 Nov 19:47
ad2eb80
Compare
Choose a tag to compare

Profiler

  • Staging/main/0.10.7 #1068
  • Hot Fix: Plugin Testing #1067

Documentation

  • Update: Documentation 0.10.7 #1069

Full Changelog: 0.10.6...0.10.7

What's Changed

Full Changelog: 0.10.6...0.10.7

0.10.6

13 Nov 21:24
302a458
Compare
Choose a tag to compare

Profiler

  • Staging/main/0.10.6 #1065
  • Update: Version 0.10.6 #1064
  • Feature: Plugins #1060
  • Hot Fix: Contribution Doc #1059
  • Rename references to degree of freedom from df to deg_of_free #1056
  • add_s3_connection_remote_loading_s3uri_feature #1054
  • feat: add null ratio to column stats #1052
  • Delay transforming priority_order into ndarray #1045
  • Fix Codeowners List #1043

Documentation

  • Update: Documentation 0.10.6 #1066
  • Docs: AWS S3 Data Reading #1063
  • Update docs to reflect renamed output of deg_of_free #1057

Full Changelog: 0.10.5...0.10.6

What's Changed

Full Changelog: 0.10.5...0.10.6

0.10.5

25 Sep 15:37
9bca4e7
Compare
Choose a tag to compare

Profiler

Documentation

  • Update docs 0.10.5 #1042
  • Update docs 0.10.5 #1041

Full Changelog: 0.10.4...0.10.5

What's Changed

Full Changelog: 0.10.4...0.10.5

0.10.4

22 Sep 12:01
02b7070
Compare
Choose a tag to compare

Profiler

  • version bump (#1032) #1036
  • Staging/main/0.10.4 #1029
  • added psi calculation to categorical columns #1027
  • Bump actions/checkout from 3 to 4 #1024
  • Minor: Profiler Path Fix in Example Notebook #1021
  • modified the assignees for issue creation #1016
  • Make sure random_state is a list before indexed assignment #968

Documentation

  • Update docs 0.10.4 #1038
  • Update docs 0.10.4 #1037
  • update install instructions for mac #1026

Full Changelog: 0.10.3...0.10.4

What's Changed

Full Changelog: 0.10.3...0.10.4

0.10.3

07 Aug 19:10
b0b8510
Compare
Choose a tag to compare

Profiler

  • Staging: main 0.10.3 #1004
  • Fix ProfilerOptions() documentation #1002

Feature: Multiprocess

  • Staging: into dev feature/multiprocess #998
  • Multiprocess automation feature into staging/dev. #997
  • Syncing feature/multiprocess into staging/dev/multiprocess #992
  • Automate multiprocess option #984

Feature: num_quantiles option

  • Staging: into dev feature/num-quantiles #990
  • Fix Scipy Mend Issue #988
  • HistogramAndQuantilesOption sync with dev branch #987

Documentation

  • Update docs to 0.10.3 #1012
  • Update docs to 0.10.3 #1011
  • fixed snappy install issue on Mac #1010
  • Staging: into dev-gh-pages the docs for multiprocess. #1001
  • Add docs to multiprocess option in StructuredOptions. #999
  • Staging: into dev-gh-pages the docs for num_quantiles. #993
  • Add docs for num_quantiles option for histogram_and_quantiles. #991

Full Changelog: 0.10.2...0.10.3

What's Changed

Full Changelog: 0.10.2...0.10.3

0.10.2

28 Jul 16:18
ec47d45
Compare
Choose a tag to compare

Profiler

  • hotfix[0.10.2]: cat vs float bug #973

Documentation

  • Staging: Update docs to 0.10.2 #978
  • Update docs to 0.10.2 #979

Full Changelog: 0.10.1...0.10.2

What's Changed

Full Changelog: 0.10.1...0.10.2

0.10.1

17 Jul 18:21
6cb789a
Compare
Choose a tag to compare

Profiler

  • Hot Fix: .astype("bool") #960

Documentation

  • Staging: Update docs 0.10.1 #961
  • Update docs 0.10.1 #962

Full Changelog: 0.10.0...0.10.1

What's Changed

Full Changelog: 0.10.0...0.10.1

0.10.0

30 Jun 15:04
77ddb29
Compare
Choose a tag to compare

Profiler

  • Forking workflow directions CONTRIBUTING.md #857
  • Fixing diagram rendering in CONTRIBUTING.md #862
  • Fix initial value of processor_type #863
  • fix: test bug due to bad mocks #878
  • added differences section to unstructured data example #877
  • Reservoir sampling refactor #910
  • feat: add dev to workfow for testing #897
  • Cms for categorical #892
  • Hotfix: fix post feature serialization merge #942
  • Update version to 0.10.0 #944
  • Staging/main/0.10.0 #943

Profiler: Profile Serialization

  • Staging/dev/profile serialization #940
  • fix: order bug #939
  • fix: null_rep mat should calculate even if datetime #933
  • Profiler: load_method hotfix #932
  • Top level hotfix: save / load .lower() #931
  • Notebook Example save/load Profile #930
  • refactor: use seed for sample for consistency #927
  • Profile Builder load() serialization #925
  • Reuse passed labeler #924
  • BaseProfiler save() for json #923
  • Added testing for values for test_json_decode_after_update #915
  • UnstructuredProfiler: Added NoImplementationError #907
  • fix: bug and add tests for structuredcolprofiler #904
  • Stuctured profiler encode decode #903
  • refactor: allow options to go through all #902
  • StructuredColProfiler Encode / Decode #901
  • Decode options #894
  • Quick Test update #893
  • Deserialization of datalabeler #891
  • ColumnDataLabelerCompiler: serialize / deserialize #888
  • Add Serialization and Deserialization Tests for Stats Compiler, plus refactors for order Typing #887
  • Adds deserialization for compilers and validates tests for Primitive; fixes numerical deserialization #886
  • Adds tests validating serialization with Primitive type for compiler #885
  • feat: add test and compiler serialization #884
  • ready datalabeler for deserialization and improvement on serializatio… #879
  • Encode Options #875
  • Encode/Decode TextColumnProfiler #870
  • Created encoder for the datalabelercolumn #869
  • Added test to ensure order attribute for ordered column profiler functions correctly after deserialization #868
  • Added decoding for encoding of ordered column profiles #864
  • Json decode date time column #861
  • Float column profiler encode decode #854
  • hot fixes for encode and decode of numeric stats mixin and intcol pro… #852

Profiler: Options

  • staging/dev/options #909
  • RowStatisticsOptions: Implementing option #871
  • New preset implementation and test #867
  • RowStatisticsOptions: Add option #865

Documentation

  • Staging update docs 0.10.0 #945
  • Documentation: Fix Req #922
  • Documentation: Update for Reservoir Sampling #919
  • documentation update for cms specific options to category #917
  • Add forking / branch workflow image #858

Documentation: Profile Serialization

  • Merge staging/dev-gh-pages/profile-serialization into dev-gh-pages #937
  • Docs: Profiler Serialization Clean Up #936
  • Docs: Profiler Serialization #928

Documentation: Options

  • Documentation: feature/options branch docs updates #921
  • Row statistics option documentation #883
  • updating docs for preset name #882
  • Add documentation for median_abs_deviation option #881
  • Preset test updated w new names and different toggles #880
  • reset ignore, update .gitignore, update documentation on presets #874
  • Fixed documentation for sampling_ratio option #873

Full Changelog: 0.9.0...0.10.0

What's Changed

Full Changelog: 0.9.0...0.10.0