osd: Make dmclock's anticipation timeout be configurable #18827

TaewoongKim · 2017-11-09T00:41:31Z

This adds a configuration option that can control anticipation timeout for dmclock

This helps more accurate QoS or priority based scheduling when dmclock is used with this.

By setting with an appropriate value, a client or an operation type could take their unused resource that could be forfeited by other aggressive clients or operation types.

Signed-off-by: Taewoong Kim taewoong.kim@sk.com

myoungwon · 2017-11-09T01:02:14Z

src/osd/mClockClientQueue.cc

+  mClockClientQueue::mClockClientQueue(CephContext *cct,
+				       double anticipation_timeout) :
+    queue(std::bind(&mClockClientQueue::op_class_client_info_f, this, _1),
+	  anticipation_timeout),


How is anticipation_timeout used ?

My bad. I missed something.
Thank you for pointing that. I fixed it.

myoungwon · 2017-11-23T11:22:00Z

@TaewoongKim This PR seems that only the anticipation timeout is set. Could you explain how this value is used ?

TaewoongKim · 2017-11-27T14:17:46Z

@myoungwon Yes, it just set dmclock's parameter
I requested PR of dmclock project about anticipation timeout.
It was merged a few weeks ago. (ceph/dmclock#43, ceph/dmclock#34)
Now, this PR is for enabling the dmclock anticipation timeout PR on Ceph.

Dmclock scheduler supports IOPS reservation.
However, an aggressive worker can take light woker's reserved shares.
Assume that worker A is a light worker whose IOPS reservation is 100 IOPS and worker B is an aggressive worker whose IOPS reservation is 1 IOPS.
Also assume that Woker A generates just 10 IOs in one second but every its IO does not come in exactly every 10 ms(arrived with a very small variation)
In this case, Worker A couldn't get serviced 10 IOPS, because worker B takes worker A's share.
If Worker A's IO is a little late(even 1ms) Worker B's IOs will be processed rather than Woker A's and Worker A's IO will be delayed.
(You can see an example in ceph/dmclock#34)

This is because dmclock reset the time tag of worker A's IO.
In a normal case, dmclock set time tag for IO based on previous IO's tag.
However, if an IO arrived more than (1/reserved IOPS) ms later since the previous IO that belongs to the same worker arrived,
the time tag of newly arrived IO is reset by the current time.

Setting anticipation timeout can prevent this situation.
Reset will be deferred by anticipation timeout and time tag will be set based on previous IO's tag.

myoungwon · 2017-12-11T05:31:33Z

@TaewoongKim need rebase

Signed-off-by: Taewoong Kim <taewoong.kim@sk.com>

TaewoongKim · 2017-12-19T15:29:51Z

@myoungwon Rebased

myoungwon · 2018-01-04T01:34:33Z

@tchaikov @liewegas @ivancich Could you take a look? dmclock code related to this PR has already been merged.

yuriw · 2018-01-06T17:33:24Z

wip-yuri4-testing-2018-01-06-1732

myoungwon reviewed Nov 9, 2017

View reviewed changes

TaewoongKim force-pushed the anticipation_timeout branch from a571add to 7f46c18 Compare November 9, 2017 01:30

liewegas added the core label Nov 16, 2017

myoungwon requested a review from ivancich December 11, 2017 05:31

TaewoongKim force-pushed the anticipation_timeout branch from 7f46c18 to 496c610 Compare December 19, 2017 08:38

osd: Make dmclock's anticipation timeout be configurable

d2ea0b5

Signed-off-by: Taewoong Kim <taewoong.kim@sk.com>

TaewoongKim force-pushed the anticipation_timeout branch from 496c610 to d2ea0b5 Compare December 19, 2017 08:58

myoungwon approved these changes Jan 4, 2018

View reviewed changes

liewegas approved these changes Jan 4, 2018

View reviewed changes

liewegas added the needs-qa label Jan 4, 2018

yuriw added the wip-yuri4-testing label Jan 6, 2018

ivancich merged commit 158f317 into ceph:master Jan 7, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

osd: Make dmclock's anticipation timeout be configurable #18827

osd: Make dmclock's anticipation timeout be configurable #18827

TaewoongKim commented Nov 9, 2017 •

edited by myoungwon

myoungwon Nov 9, 2017

TaewoongKim Nov 9, 2017

myoungwon commented Nov 23, 2017

TaewoongKim commented Nov 27, 2017 •

edited

myoungwon commented Dec 11, 2017

TaewoongKim commented Dec 19, 2017

myoungwon commented Jan 4, 2018

yuriw commented Jan 6, 2018

osd: Make dmclock's anticipation timeout be configurable #18827

osd: Make dmclock's anticipation timeout be configurable #18827

Conversation

TaewoongKim commented Nov 9, 2017 • edited by myoungwon

myoungwon Nov 9, 2017

Choose a reason for hiding this comment

TaewoongKim Nov 9, 2017

Choose a reason for hiding this comment

myoungwon commented Nov 23, 2017

TaewoongKim commented Nov 27, 2017 • edited

myoungwon commented Dec 11, 2017

TaewoongKim commented Dec 19, 2017

myoungwon commented Jan 4, 2018

yuriw commented Jan 6, 2018

TaewoongKim commented Nov 9, 2017 •

edited by myoungwon

TaewoongKim commented Nov 27, 2017 •

edited