Skip to content

TopMC supports fine-grained continuous acquisition of architectural features during program execution without incurring significant overhead or interfering with program behavior. Fine grained features can mine rich and key features, such as the peak bandwidth requirement of memory.

ASG-ICT/TopMC

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TopMC : Performance Counter Monitor Tool

Overview of the TopMC

TopMC is a performance counter monitor tool which can easily support newest CPU architectures under linux. Currently, most of performance counter monitor tools cannot fully support newest Intel architecture, such as Nehalem. The performance counter events in Nehalem are splitted into core events and uncore events. And tools such as PAPI, Oprofile, pfmon, perfctr only support core events, excluding uncore events such as L3 misses, memory operations.

TopMC contains a kernel module to interact with linux kernel which eliminates the supports by native linux kerneli(Oprofile) or kernel patch(pfmon).

TopMC is a lightweight performance counter monitor and trace collector:

  • Easy to use: TopMC is consisted of a kernel module and some scripts. It is convenient to install, easy to understand, and efficienct to modify. Linux kernel is no longer to be recompiled, and scripts are free to modify.

  • Lightweight: Most performance counter monitor tools make use of system call to set the performance counter events in linux kernel. However, TopMC achieves this by procfs file system which occurs less overhead.

  • Per-thread counting: TopMC's kernel trace mode can detect thread switching which distinguishs performance counters of different threads, no matter whether thread migration happens.

  • User level RDPMC instruction: TopMC provides API using RDPMC instruction to monitor code fragements of applications. RDPMC instruction costs only tens of cycles.Thus, applications can obtain more accurate performance counters.

  • Support uncore events in Nehalem,ivybridge_e,broadwell,skylake_sp: The Intel architecture splits the performance counter events into two parts: core events and uncore events. Each part has its own performance counter registers. Most performance counter monitor tools can only support core events, but not uncore events related with L3 cache and memory controller.

Features of the TopMC

Besides providing low overhead performance counters at specific time, TopMC can collect performance counter traces which are helpful to analysis periodical behaviors of applications.

Installation

Operating environment

linux kernel 3.10.X or 4.18

Compile and Install

  • Change to 'topmc_mod' direcotry, and compile the module:
     make -C /lib/modules/$(shell uname -r) M=`pwd` modules 
  • Install the kernel module:
     insmod topmc_module.ko 

Interface with Performance Counter Units

  • Change to '/proc/topmc' directory:

    If 'topmc_module' is successfully loaded, all the performance counter registers of each CPU have their corresponding file under the '/proc/topmc' directory.

Usage

TopMC provides two ways to use performance counters. One is performance counter trace of an entire application. The other is performance counter value of a piece of code.

Collect Performance Counter Trace

  • Set the performance counter event to counters:

    cd topmc_script/
    vim set_event_nehelam.sh (intel) or
    vim set_event_amd.sh (amd)           
    After change the value of "incore_counter0_event" to what you are interested.
    
    ./set_event_nehelam.sh (intel) or
    ./set_event_amd.sh (amd)
  • Run your application immediately:

    python display_topmc.py

    results

  • Gather the performance counters:

    python record_result.py
    The interval time of collecting counters can be varied in record_result.py by changing the sleeping time.
    

Monitor Piece of Codes

  • Set the performance counter event to counters:

      cd topmc_script/
      vim set_event_nehelam.sh (intel) or
      vim set_event_amd.sh (amd)     
      
    After change the value of "incore_counter0_event" to what you are interested.
    
    ./set_event_nehelam.sh (intel) or
    ./set_event_amd.sh (amd)
  • Insert corresponding macros to monitored codes and rebuild the application:

    Samples can be found in the source codes.

Differences between TopMC and Perf

Perf is a performance analyzing tool in Linux,TopMC can be competent at anything that TopMC can do, while TopMC supports uncore events better. Perf can't count the performance of the certain function, but TopMC can do it, just insert some code at the beginning and end of the function. such as :

#counter event format:'**--xyz#'
#'**' stands for event
#'--' stands for unit mask
#'x' stands for cmask (only 4-bit value)
#'y' stands for inv 
#'z' stands for edge
#'#' stands for usr/kernel:0-nothing,1-user,2-kernel,3-user and kernel

incore_counter0_event='**--xyz#'
echo "0" >  "/proc/topmc/core0/incore_counter0/enable"  # you can change the core number and counter number
echo incore_counter0_event > "/proc/topmc/core0/incore_counter0/event"  # you can change the core number and counter number
echo "1" >  "/proc/topmc/core0/incore_counter0/enable"  # you can change the core number and counter number

####
# your function
####

# display_topmc.py will record the value of the certain event
python topmc_script/display_topmc.py

Implementation Mechanism

The implementation of performance counter monitor tools focus on the interaction with linux kernel. TopMC registers a procs file system using linux kernel module. The framework also contains how to collect kernel trace:

architecture

Our Website of TopMC

http://asg.ict.ac.cn/projects/topmc/

About

TopMC supports fine-grained continuous acquisition of architectural features during program execution without incurring significant overhead or interfering with program behavior. Fine grained features can mine rich and key features, such as the peak bandwidth requirement of memory.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published