Skip to content

Commit

Permalink
add summary of statistics collected
Browse files Browse the repository at this point in the history
  • Loading branch information
lalmei committed May 2, 2021
1 parent 0d6a178 commit e1922f1
Showing 1 changed file with 7 additions and 0 deletions.
7 changes: 7 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -91,6 +91,13 @@ with session.logger(dataset_name="my_dataset") as logger:
#images
logger.log_images("path/to/image.png")
```
whyLogs collects approximate statistics and sketches of data on a column-basis into a statistical profile. These metrics include:

- Simple counters: boolean, null values, data types.
- Summary statistics: sum, min, max, variance.
- Unique value counter or cardinality: tracks an approximate unique value of your feature using HyperLogLog algorithm.
- Histograms for numerical features. whyLogs binary output can be queried to with dynamic binning based on the shape of your data.
- Top frequent items (default is 128). Note that this configuration affects the memory footprint, especially for text features.

Check the examples below for visualization and other use cases

Expand Down

0 comments on commit e1922f1

Please sign in to comment.