Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add 'csv' format to the 'transform' reporter #241

Merged
merged 1 commit into from
Oct 31, 2022

Conversation

pablogsal
Copy link
Member

@pablogsal pablogsal commented Oct 31, 2022

When there are a lot of different allocations, some of the reporters that
we currently offer do not allow for a very granular inspection of the
data. This can happen for example when certain libraries like TensorFlow
or sklearn or pandas are imported as the number of allocations
goes easily to several thousand. In these situations, reporters like the
summary reporter are not helpful as the number of rows to display needs
to be really high and is not easy to analyze.

To allow other more specialized data analysis tools to work with the
data, add a new 'csv' output format to the 'transform' reporter that
will dump a comma-separated-value file with every allocation in the high
watermark as a row with columns representing different properties
of every allocation. The stack trace will be represented as a string
joined with characters that are illegal in function names.

Closes: #239

Signed-off-by: Pablo Galindo pablogsal@gmail.com

When there is a lot of different allocations, some of the reporters that
we currently offer do not allow for very granular inspection of the
data. This can happen for example when certain libraries like tensorflow
or sklearn or pandas are imported as the number of allocations
goes easily to several thousand. In these situations reporters like the
summary reporter are not helpful as the number of rows to display needs
to be really high and is not easy to analyze.

To allow other more specialized data analysis tools to work with the
data, add a new 'csv' output format to the 'transform' reporter that
will dump a comma-separated-value file with every allocation in the high
water mark as a row with the columns representing different properties
of every allocation. The stack trace will be represented as a string
joined with characters that are illegal in function names.

Signed-off-by: Pablo Galindo <pablogsal@gmail.com>
Copy link
Contributor

@godlygeek godlygeek left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@pablogsal pablogsal merged commit 849ae4c into bloomberg:main Oct 31, 2022
@pablogsal pablogsal deleted the csv branch October 31, 2022 22:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Converting the output of summary reporter to csv/table/dataframe
2 participants