thinktime_spin option can cause fio worker to sleep for too long #1588

kelleymh · 2023-06-30T20:59:24Z

Please acknowledge the following before creating a ticket

[X ] I have read the GitHub issues section of REPORTING-BUGS.

Description of the bug:
When the thinktime_spin option specifies a value that is within a few milliseconds of the thinktime value, in function handle_thinktime() it's possible in a VM environment for the duration of usec_spin() to exceed the thinktime value. While doing usec_spin(), the vCPU could get de-scheduled or the hypervisor could steal CPU time from the vCPU. When the guest vCPU runs after being scheduled again, it may read the clock and find that more time has elapsed than intended. In such a case, the code in handle_thinktime() calculates a negative value for 'left'. Then 'left' is cast as an unsigned long long for comparison with 'runtime_left', and 'left' is set to 'runtime_left'. Finally usec_sleep() is called for 'left' amount of time, which is until the end of the job, when it should not have slept at all.

The solution is to use code like this after the call to usec_spin():

    if (total < td->o.thinktime)
            left = td->o.thinktime - total;
    else
            left = 0;

I've tested this fix and it solves the problem I observe.

Environment: Ubuntu 20.04 running a 5.15 kernel as a guest VM in the Azure cloud. But the problem could happen in any VM environment where vCPUs are subject to getting de-scheduled or are sharing cycles with the hypervisor.

fio version: 3.35. The same problem happens with earlier versions such as 3.7 and 3.16.

Reproduction steps
See above.

The text was updated successfully, but these errors were encountered:

ankit-sam · 2023-07-14T10:55:58Z

Hi @kelleymh the changes look good to me, can you please send a patch to the fio mailing list.

axboe · 2023-07-14T14:34:18Z

Or a PR in here is fine too. I'm fine making the edit too myself, let us know @kelleymh what you prefer.

kelleymh · 2023-07-14T16:53:12Z

I have a patch ready. I'll post it to the fio mailing list.

vincentkfu · 2023-07-14T18:06:31Z

Thanks for reviewing @ankit-sam

axboe closed this as completed in 14adf6e Jul 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

thinktime_spin option can cause fio worker to sleep for too long #1588

thinktime_spin option can cause fio worker to sleep for too long #1588

kelleymh commented Jun 30, 2023 •

edited

Loading

ankit-sam commented Jul 14, 2023

axboe commented Jul 14, 2023

kelleymh commented Jul 14, 2023

vincentkfu commented Jul 14, 2023

thinktime_spin option can cause fio worker to sleep for too long #1588

thinktime_spin option can cause fio worker to sleep for too long #1588

Comments

kelleymh commented Jun 30, 2023 • edited Loading

ankit-sam commented Jul 14, 2023

axboe commented Jul 14, 2023

kelleymh commented Jul 14, 2023

vincentkfu commented Jul 14, 2023

kelleymh commented Jun 30, 2023 •

edited

Loading