UCS/ARCH: Read from `cpu MHz` for CPU speed on x86 #2793

rzambre · 2018-08-13T16:50:45Z

When the user changes the frequency of the CPU, UCS still reads the base frequency on x86 for its calculations. This patch ensures that the CPU frequency is read from the `cpu MHz` field of `cpuinfo` and not from `model name`. The function that reads from the model name is removed because it is not used anywhere else and, if kept, would throw a build error. Signed-off-by: Rohit Zambre <rzambre@uci.edu>

swx-jenkins1 · 2018-08-13T16:53:04Z

Can one of the admins verify this patch?

hoopoepg · 2018-08-13T16:53:46Z

ok to test

mellanox-github · 2018-08-13T16:55:03Z

Can one of the admins verify this patch?

shamisp · 2018-08-13T17:21:34Z

It is okay for now but in longer term for x86 I would suggest to follow perftest approach
https://github.com/linux-rdma/perftest/blob/master/src/get_clock.c

swx-jenkins1 · 2018-08-13T17:51:52Z

Test PASSed.
See http://bgate.mellanox.com/jenkins/job/gh-ucx-pr/5040/ for details.

mellanox-github · 2018-08-13T17:58:03Z

Test FAILed.
See http://hpc-master.lab.mtl.com:8080/job/hpc-ucx-pr/7448/ for details (Mellanox internal link).

yosefe

this fix is incorrect since the clock reported by "cpu MHz" in procinfo is not always accurate and may change

yosefe · 2018-08-30T12:20:13Z

as discussed on conf call, suggested approach is to have "slow and accurate" and "fast and inaccurate" version of get_time

shamisp · 2018-08-30T13:48:52Z

@rzambre what is the paper that you mentioned the other day ?

rzambre · 2018-08-30T14:18:30Z

This is the paper that I was referring to: https://www.intel.com/content/dam/www/public/us/en/documents/white-papers/ia-32-ia-64-benchmark-code-execution-paper.pdf The paper talks how to measure for architectures that do and do not support rdtscp.

The paper's methodology also seems to be used in courses at universities: http://cseweb.ucsd.edu/classes/wi16/cse221-a/timing.html The infrastructure is basically the code snippet under the Examine the Assembly Code section.

The overhead of this infrastructure is 16ns (similar to the current infrastructure), according to my measurements.

Since UCS has different macros for start (SCOPE_START) and end (SCOPE_END) of timers, I think we can forward the ucs_profile_type_t to ucs_get_time to call the right sequence of assembly instructions. For SAMPLE, a rdtscp followed by cpuid would be the right thing (the end-timer portion of the infrastructure), according to the explanation in the white paper.

yosefe · 2018-12-03T11:59:38Z

@rzambre closing this one, replaced by #3070

yosefe requested changes Aug 15, 2018

View reviewed changes

yosefe added Bugfix Waiting for Author Response labels Aug 30, 2018

yosefe mentioned this pull request Dec 1, 2018

Use gettimeofday() in perftest and if rdtsc is unstable #3070

Merged

yosefe closed this Dec 3, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UCS/ARCH: Read from `cpu MHz` for CPU speed on x86 #2793

UCS/ARCH: Read from `cpu MHz` for CPU speed on x86 #2793

rzambre commented Aug 13, 2018

swx-jenkins1 commented Aug 13, 2018

hoopoepg commented Aug 13, 2018

mellanox-github commented Aug 13, 2018

shamisp commented Aug 13, 2018

swx-jenkins1 commented Aug 13, 2018

mellanox-github commented Aug 13, 2018

yosefe left a comment

yosefe commented Aug 30, 2018

shamisp commented Aug 30, 2018

rzambre commented Aug 30, 2018 •

edited

yosefe commented Dec 3, 2018

UCS/ARCH: Read from cpu MHz for CPU speed on x86 #2793

UCS/ARCH: Read from cpu MHz for CPU speed on x86 #2793

Conversation

rzambre commented Aug 13, 2018

swx-jenkins1 commented Aug 13, 2018

hoopoepg commented Aug 13, 2018

mellanox-github commented Aug 13, 2018

shamisp commented Aug 13, 2018

swx-jenkins1 commented Aug 13, 2018

mellanox-github commented Aug 13, 2018

yosefe left a comment

Choose a reason for hiding this comment

yosefe commented Aug 30, 2018

shamisp commented Aug 30, 2018

rzambre commented Aug 30, 2018 • edited

yosefe commented Dec 3, 2018

UCS/ARCH: Read from `cpu MHz` for CPU speed on x86 #2793

UCS/ARCH: Read from `cpu MHz` for CPU speed on x86 #2793

rzambre commented Aug 30, 2018 •

edited