Add profiler #333

qwe661234 · 2024-01-21T07:32:26Z

Based on our observation, a high percentage of true hotspots involve loops or backward jumps, but the number of IR is unstable within these true hotspots.

Therefore, we believe our profiler can use three indices to detect hotspots:

Backward jump
Loop
Used frequency

Close: #189

jserv

Check #189 (comment) carefully. You should implement some of the following:

Allow Linux perf to profile JITted code as QEMU.
Generate profiling logs and use report program to view as mono does.
Display in tree-based view as vmprof does.

It is important to provide interactive ways to browse profiling data with external tools. See All my favorite tracing tools: eBPF, QEMU, Perfetto, new ones I built and more.

qwe661234 · 2024-01-21T07:50:57Z

Check #189 (comment) carefully. You should implement some of the following:

Allow Linux perf to profile JITted code as QEMU.

Generate profiling logs and use report program to view as mono does.

Display in tree-based view as vmprof does.

It is important to provide interactive ways to browse profiling data with external tools. See All my favorite tracing tools: eBPF, QEMU, Perfetto, new ones I built and more.

This runtime profiler serves as a trigger for the tier-1 JIT compiler. Do you mean we should provide a log mode for users to dump the profiling information?

jserv · 2024-01-21T08:02:51Z

This runtime profiler serves as a trigger for the tier-1 JIT compiler. Do you mean we should provide a log mode for users to dump the profiling information?

If you are going to close #189, you should consolidate the profiling facilities.

src/riscv.c

src/riscv.h

src/emulate.c

src/riscv_private.h

qwe661234 · 2024-01-23T00:45:59Z

Currently, we only support for providing the information about branch, branch-untaken, and IRs of a basic block. I think we can create another issue for supporting visualizing graph IR.

src/main.c

jserv

Separate the processes of generating and rendering profiling data. Specifically, rv32emu (activated with the -p option) will be responsible for generating profiling data, while the rendering will be handled by the newly added tool rv_profile.
Ensure that block information, including the instructions (referencing ELF files for disassembly as needed), is captured as part of the profiling data. This approach will make the data more informative and valuable for analysis.
The proposed changes aim to equip the system with adequate capabilities for depicting runtime profiling information, a crucial aspect for ongoing JIT development.

src/main.c

src/cache.h

src/cache.c

jserv · 2024-01-23T11:09:09Z

The rationale behind segregating the generation and rendering of profiling data includes:

Enabling graphical representations without complicating the main rv32emu program.
Facilitating the use of tools like objdump for disassembling specific blocks, exemplified by commands such as objdump --start-address=0x50c40 --stop-address=0x50c60 -d.
Allowing for the profiling of RISC-V executables in a separate, offline run to collect profiling data. This data can then be utilized to assign accurate weights to the predecessors and successors in superoptimizers.

qwe661234 · 2024-01-23T11:53:31Z

The rationale behind segregating the generation and rendering of profiling data includes:

Enabling graphical representations without complicating the main rv32emu program.

Facilitating the use of tools like objdump for disassembling specific blocks, exemplified by commands such as objdump --start-address=0x50c40 --stop-address=0x50c60 -d.

Allowing for the profiling of RISC-V executables in a separate, offline run to collect profiling data. This data can then be utilized to assign accurate weights to the predecessors and successors in superoptimizers.

We can achieve this by writing a python script to parse generated profiling data.
We need another debug mode for this? because we don't have debug mode currently.

jserv · 2024-01-23T12:12:17Z

We can achieve this by writing a python script to parse generated profiling data.

Yes, you can write a Python-based renderer for profiling data.

We need another debug mode for this? because we don't have debug mode currently.

No, you can specify the start/stop address for objdump -d without any debugging symbols.

qwe661234 · 2024-01-24T03:18:54Z

Facilitating the use of tools like objdump for disassembling specific blocks, exemplified by commands such as objdump --start-address=0x50c40 --stop-address=0x50c60 -d.

Does you means this function supports for providing the instruction sequence of a basic block starting from address start-address without other profiling data?

jserv · 2024-01-24T03:40:49Z

Does you means this function supports for providing the instruction sequence of a basic block starting from address start-address without other profiling data?

It depends on the format/layout of profiling data recorded during the execution of RISC-V programs. You can simply record the entry/exit address of each block, so that you can consult binutils.

qwe661234 · 2024-01-25T09:29:49Z

Currently, the profiler script support for:

printing profiling data from start_address to stop_address
visualizing graph IR of the specficed program counter like example below

README.md

Based on our observation, a high percentage of true hotspots involve loops or backward jumps, but the number of IR is unstable within these true hotspots. Therefore, we believe our profiler can use three indices to detect hotspots: 1. Backward jump 2. Loop 3. Used frequency Close: sysprog21#189

jserv requested changes Jan 21, 2024

View reviewed changes

qwe661234 requested a review from jserv January 21, 2024 12:15

jserv reviewed Jan 21, 2024

View reviewed changes

src/riscv.c Outdated Show resolved Hide resolved

jserv reviewed Jan 21, 2024

View reviewed changes

src/riscv.h Outdated Show resolved Hide resolved

qwe661234 force-pushed the add_profiler branch 2 times, most recently from 16adbc9 to 473babf Compare January 22, 2024 14:24

jserv reviewed Jan 22, 2024

View reviewed changes

src/emulate.c Outdated Show resolved Hide resolved

jserv reviewed Jan 22, 2024

View reviewed changes

src/riscv_private.h Outdated Show resolved Hide resolved

qwe661234 force-pushed the add_profiler branch 2 times, most recently from 1b0be9d to 49938f0 Compare January 23, 2024 00:41

jserv reviewed Jan 23, 2024

View reviewed changes

src/main.c Outdated Show resolved Hide resolved

jserv requested changes Jan 23, 2024

View reviewed changes

qwe661234 force-pushed the add_profiler branch from 49938f0 to 6bf43c3 Compare January 23, 2024 07:11

jserv reviewed Jan 23, 2024

View reviewed changes

src/main.c Outdated Show resolved Hide resolved

jserv reviewed Jan 23, 2024

View reviewed changes

src/cache.h Outdated Show resolved Hide resolved

jserv reviewed Jan 23, 2024

View reviewed changes

src/cache.c Show resolved Hide resolved

qwe661234 force-pushed the add_profiler branch from 6bf43c3 to 1ef4680 Compare January 23, 2024 11:50

qwe661234 force-pushed the add_profiler branch from 1ef4680 to b8b8db8 Compare January 23, 2024 11:56

qwe661234 force-pushed the add_profiler branch 3 times, most recently from 4cb78b9 to ecd2ef6 Compare January 25, 2024 09:25

qwe661234 requested a review from jserv January 25, 2024 16:02

jserv reviewed Jan 27, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

qwe661234 added 3 commits January 27, 2024 18:14

Correctly update the used frequency of BB

844e9fc

Add profiling data log

6d0e0ef

qwe661234 force-pushed the add_profiler branch from ecd2ef6 to 61a9152 Compare January 27, 2024 10:22

Add profiler script

ee8ac93

qwe661234 force-pushed the add_profiler branch from 61a9152 to ee8ac93 Compare January 27, 2024 10:24

jserv merged commit 6dcfd84 into sysprog21:master Jan 27, 2024
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add profiler #333

Add profiler #333

qwe661234 commented Jan 21, 2024

jserv left a comment

qwe661234 commented Jan 21, 2024

jserv commented Jan 21, 2024

qwe661234 commented Jan 23, 2024

jserv left a comment •

edited

Loading

jserv commented Jan 23, 2024

qwe661234 commented Jan 23, 2024

jserv commented Jan 23, 2024

qwe661234 commented Jan 24, 2024

jserv commented Jan 24, 2024

qwe661234 commented Jan 25, 2024

Add profiler #333

Add profiler #333

Conversation

qwe661234 commented Jan 21, 2024

jserv left a comment

Choose a reason for hiding this comment

qwe661234 commented Jan 21, 2024

jserv commented Jan 21, 2024

qwe661234 commented Jan 23, 2024

jserv left a comment • edited Loading

Choose a reason for hiding this comment

jserv commented Jan 23, 2024

qwe661234 commented Jan 23, 2024

jserv commented Jan 23, 2024

qwe661234 commented Jan 24, 2024

jserv commented Jan 24, 2024

qwe661234 commented Jan 25, 2024

jserv left a comment •

edited

Loading