Add 'csv' format to the 'transform' reporter #241

pablogsal · 2022-10-31T13:21:31Z

When there are a lot of different allocations, some of the reporters that
we currently offer do not allow for a very granular inspection of the
data. This can happen for example when certain libraries like TensorFlow
or sklearn or pandas are imported as the number of allocations
goes easily to several thousand. In these situations, reporters like the
summary reporter are not helpful as the number of rows to display needs
to be really high and is not easy to analyze.

To allow other more specialized data analysis tools to work with the
data, add a new 'csv' output format to the 'transform' reporter that
will dump a comma-separated-value file with every allocation in the high
watermark as a row with columns representing different properties
of every allocation. The stack trace will be represented as a string
joined with characters that are illegal in function names.

Closes: #239

Signed-off-by: Pablo Galindo pablogsal@gmail.com

When there is a lot of different allocations, some of the reporters that we currently offer do not allow for very granular inspection of the data. This can happen for example when certain libraries like tensorflow or sklearn or pandas are imported as the number of allocations goes easily to several thousand. In these situations reporters like the summary reporter are not helpful as the number of rows to display needs to be really high and is not easy to analyze. To allow other more specialized data analysis tools to work with the data, add a new 'csv' output format to the 'transform' reporter that will dump a comma-separated-value file with every allocation in the high water mark as a row with the columns representing different properties of every allocation. The stack trace will be represented as a string joined with characters that are illegal in function names. Signed-off-by: Pablo Galindo <pablogsal@gmail.com>

godlygeek

LGTM

pablogsal requested review from godlygeek and lkollar October 31, 2022 13:21

pablogsal mentioned this pull request Oct 31, 2022

Converting the output of summary reporter to csv/table/dataframe #239

Closed

1 task

pablogsal force-pushed the csv branch from 24f98c6 to e009c4b Compare October 31, 2022 13:29

godlygeek force-pushed the csv branch from e009c4b to d72b561 Compare October 31, 2022 18:37

pablogsal enabled auto-merge (rebase) October 31, 2022 18:42

godlygeek force-pushed the csv branch from d72b561 to 20890c7 Compare October 31, 2022 18:46

godlygeek approved these changes Oct 31, 2022

View reviewed changes

pablogsal merged commit 849ae4c into bloomberg:main Oct 31, 2022

pablogsal deleted the csv branch October 31, 2022 22:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add 'csv' format to the 'transform' reporter #241

Add 'csv' format to the 'transform' reporter #241

pablogsal commented Oct 31, 2022 •

edited

godlygeek left a comment

Add 'csv' format to the 'transform' reporter #241

Add 'csv' format to the 'transform' reporter #241

Conversation

pablogsal commented Oct 31, 2022 • edited

godlygeek left a comment

Choose a reason for hiding this comment

pablogsal commented Oct 31, 2022 •

edited