First version of the report generator. Add further tools. by perezjosibm · Pull Request #320 · ceph/cbt

perezjosibm · 2024-12-10T12:24:35Z

This PR introduces some new tools:

report_gen.py - script that traverses the dir tree to select .JSON entries to produce a report in .tex (uses some existing templates with contents to add, like the header, table of contents, etc.)
diskstat_diff.py - script to filter disk utilisation, producing a .JSON as result, ready to plot, etc.
top-parser.py - script to filter .JSON files from top (translated by jc), calculating averages for CPU and MEM utilisation, used to combine this data with .JSON from FIO.
gnuplot_plate.py - thin wrapper around gnuplot to generate Response latency curves (hockey stick performance charts).
gen_json_xtractor.py, test_run_spec.py - refactoring of fio-parse-jsons.py into modules that can be reused by other tools.

Usage:

All the scripts have a --help option to provide guide of usage.

parse-top.py:

    cat ${TEST_RESULT}_top.out | jc --top --pretty > ${TEST_RESULT}_top.json
    python3 /root/bin/parse-top.py --config=${TEST_RESULT}_top.json --cpu="${OSD_CORES}" --avg=${OSD_CPU_AVG} \
          --pids=${TOP_PID_JSON} 2>&1 > /dev/null

will produce a ${TEST_RESULT}_cpu_avg.json file as output (in the specified dir).

diskstat_diff.py: the following snippet illustrates taking two samples, one before and one after the test execution, the script calculates the difference and produces the result updating the given file name as argument:

# Take diskstats measurements before FIO instances
      jc --$pretty /proc/diskstats > ${DISK_STAT}

# Measure the diskstats after the completion of FIO
      jc --pretty /proc/diskstats | python3 /root/bin/diskstat_diff.py -a ${DISK_STAT}

Signed-off-by: Jose J Palacios-Perez <perezjos@uk.ibm.com>

perezjosibm · 2024-12-10T12:25:23Z

Hi @sseshasa I'd appreciate if you could review my PR please. Many thanks in advance.

perezjosibm · 2025-01-10T12:37:21Z

Hi @sseshasa when convenient, could you please review my PR? Many thanks in advance.

sseshasa · 2025-01-13T11:42:19Z

Hi @sseshasa when convenient, could you please review my PR? Many thanks in advance.

@perezjosibm Apologies, Have been busy with other things. I will try and look into it this week.

perezjosibm · 2025-01-27T14:13:54Z

Hi @sseshasa it would be great if you could please provide some feedback, when you find it convenient please. I understand you are busy, so I appreciate your time. Thanks

sseshasa

LGTM! Apologies again for taking time. I Just left a couple of questions. This is shaping up to be quite a comprehensive set of tools! I haven't gone through each and every line and just tried to understand the purpose behind the set of tools and the general mechanism you employ to chart out the metrics.

sseshasa · 2025-01-28T10:25:45Z

tools/report_gen.py

+    OSD_LIST = [1,3,8]
+    REACTOR_LIST = [1,2,4]
+    ALIEN_LIST = [7,14,21]


My understanding of crimson is limited and so this question:
Are these lists not going to change for the foreseeable future? Can these instead be passed as an input to the script?

Hi @sseshasa thanks for your feedback! Yes, the plan is that such list is going to be received from the input test plan .yaml. That example is temporarily, only for the good practice of initializing Python data structures 👍

sseshasa · 2025-01-28T10:31:48Z

tools/parse-top.py

+- (input/output) and a _cpu_avg.json file name, average over a range (typically for Response latency curves).
+
+Example of usage:
+    cat ${TEST_RESULT}_top.out | jc --top --pretty > ${TEST_RESULT}_top.json


Can the script itself be made to generate the _top.json file? I mean the script itself be made to execute the top command and saving it in json format and then using that to extract the metrics.

I see your point. I decided to make it separated to not depend on the underlying system, since often one needs to test on cold data taken from the target box whilst running another set of tests 👍
Thanks

perezjosibm added 6 commits November 29, 2024 11:34

First version of the report generator in Python

a4fd400

Signed-off-by: Jose J Palacios-Perez <perezjos@uk.ibm.com>

tools: add parse-top.py to produce avg CPU util gnuplot script from top

dff829a

Signed-off-by: Jose J Palacios-Perez <perezjos@uk.ibm.com>

Minor updates to fio-parse-jsons.py

0206ba4

Signed-off-by: Jose J Palacios-Perez <perezjos@uk.ibm.com>

tools: Add first version of gnuplot_plate.py

04d5964

Signed-off-by: Jose J Palacios-Perez <perezjos@uk.ibm.com>

tools: Add first version diskstat_diff.py

0e266d3

Signed-off-by: Jose J Palacios-Perez <perezjos@uk.ibm.com>

tools: Add first version of gen_json_xtractor.py and test_run_spec.py

bfe3dce

Signed-off-by: Jose J Palacios-Perez <perezjos@uk.ibm.com>

perezjosibm requested a review from sseshasa December 10, 2024 12:25

perezjosibm self-assigned this Dec 11, 2024

perezjosibm added the enhancement label Dec 11, 2024

sseshasa approved these changes Jan 28, 2025

View reviewed changes

perezjosibm merged commit 84d0176 into ceph:master Jan 28, 2025

harriscr mentioned this pull request Jul 18, 2025

Updating post processing #337

Merged

harriscr mentioned this pull request Aug 29, 2025

Improvements to the report generation #339

Merged

harriscr mentioned this pull request Sep 30, 2025

Code quality improvements for workloads and post_processing #343

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

First version of the report generator. Add further tools.#320

First version of the report generator. Add further tools.#320
perezjosibm merged 6 commits intoceph:masterfrom
perezjosibm:wip.report_gen

perezjosibm commented Dec 10, 2024 •

edited

Loading

Uh oh!

perezjosibm commented Dec 10, 2024

Uh oh!

perezjosibm commented Jan 10, 2025

Uh oh!

sseshasa commented Jan 13, 2025

Uh oh!

perezjosibm commented Jan 27, 2025

Uh oh!

sseshasa left a comment

Uh oh!

sseshasa Jan 28, 2025

Uh oh!

perezjosibm Jan 28, 2025

Uh oh!

sseshasa Jan 28, 2025

Uh oh!

perezjosibm Jan 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

perezjosibm commented Dec 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Usage:

Uh oh!

perezjosibm commented Dec 10, 2024

Uh oh!

perezjosibm commented Jan 10, 2025

Uh oh!

sseshasa commented Jan 13, 2025

Uh oh!

perezjosibm commented Jan 27, 2025

Uh oh!

sseshasa left a comment

Choose a reason for hiding this comment

Uh oh!

sseshasa Jan 28, 2025

Choose a reason for hiding this comment

Uh oh!

perezjosibm Jan 28, 2025

Choose a reason for hiding this comment

Uh oh!

sseshasa Jan 28, 2025

Choose a reason for hiding this comment

Uh oh!

perezjosibm Jan 28, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

perezjosibm commented Dec 10, 2024 •

edited

Loading