Xanmod 5.12 CacULE-r2 #147

hamadmarri · 2021-05-14T05:51:19Z

No description provided.

…ive task. This enahnce responsiveness since the interactive task doesn't compete with another interactive task in same cpu. We pick a cpu that has least interactivity for new/wakeup interactive task. This patch only enhance interactivity when autogroup is disabled. If autogroup is enabled there is no effect of this patch. I also introduced new sysctl sched_interactivity_threshold = 20480. Threshold is set to = sched_interactivity_factor/1.6 You can set threshold to 0 which disable the effects of this patch. Maximum threshold is 2 x sched_interactivity_factor. Note: threshold is used only in select_task_rq_faire to determine if a task is interactive or not. If it is interactive then select a cpu with the least interactive score. If not, then just use the normal cacule/cfs way.

…me of tasks normalize every 800ms)

…of tasks normalize every 11s)

…e Benchmar (pythone responsive script) shows bad results because autogroup assignments tries to group up interactive task with others. However, running interactive task in separate terminal shows good results.

… == 0

…find_and_get_node_by_id() [ Upstream commit 4207b55 ] The BPF helper bpf_cgroup_from_id() calls kernfs_find_and_get_node_by_id() which acquires kernfs_idr_lock, which is an non-raw non-IRQ-safe lock. This can lead to deadlocks as bpf_cgroup_from_id() can be called from any BPF programs including e.g. the ones that attach to functions which are holding the scheduler rq lock. Consider the following BPF program: SEC("fentry/__set_cpus_allowed_ptr_locked") int BPF_PROG(__set_cpus_allowed_ptr_locked, struct task_struct *p, struct affinity_context *affn_ctx, struct rq *rq, struct rq_flags *rf) { struct cgroup *cgrp = bpf_cgroup_from_id(p->cgroups->dfl_cgrp->kn->id); if (cgrp) { bpf_printk("%d[%s] in %s", p->pid, p->comm, cgrp->kn->name); bpf_cgroup_release(cgrp); } return 0; } __set_cpus_allowed_ptr_locked() is called with rq lock held and the above BPF program calls bpf_cgroup_from_id() within leading to the following lockdep warning: ===================================================== WARNING: HARDIRQ-safe -> HARDIRQ-unsafe lock order detected 6.7.0-rc3-work-00053-g07124366a1d7-dirty #147 Not tainted ----------------------------------------------------- repro/1620 [HC0[0]:SC0[0]:HE0:SE1] is trying to acquire: ffffffff833b3688 (kernfs_idr_lock){+.+.}-{2:2}, at: kernfs_find_and_get_node_by_id+0x1e/0x70 and this task is already holding: ffff888237ced698 (&rq->__lock){-.-.}-{2:2}, at: task_rq_lock+0x4e/0xf0 which would create a new lock dependency: (&rq->__lock){-.-.}-{2:2} -> (kernfs_idr_lock){+.+.}-{2:2} ... Possible interrupt unsafe locking scenario: CPU0 CPU1 ---- ---- lock(kernfs_idr_lock); local_irq_disable(); lock(&rq->__lock); lock(kernfs_idr_lock); <Interrupt> lock(&rq->__lock); *** DEADLOCK *** ... Call Trace: dump_stack_lvl+0x55/0x70 dump_stack+0x10/0x20 __lock_acquire+0x781/0x2a40 lock_acquire+0xbf/0x1f0 _raw_spin_lock+0x2f/0x40 kernfs_find_and_get_node_by_id+0x1e/0x70 cgroup_get_from_id+0x21/0x240 bpf_cgroup_from_id+0xe/0x20 bpf_prog_98652316e9337a5a___set_cpus_allowed_ptr_locked+0x96/0x11a bpf_trampoline_6442545632+0x4f/0x1000 __set_cpus_allowed_ptr_locked+0x5/0x5a0 sched_setaffinity+0x1b3/0x290 __x64_sys_sched_setaffinity+0x4f/0x60 do_syscall_64+0x40/0xe0 entry_SYSCALL_64_after_hwframe+0x46/0x4e Let's fix it by protecting kernfs_node and kernfs_root with RCU and making kernfs_find_and_get_node_by_id() acquire rcu_read_lock() instead of kernfs_idr_lock. This adds an rcu_head to kernfs_node making it larger by 16 bytes on 64bit. Combined with the preceding rearrange patch, the net increase is 8 bytes. Signed-off-by: Tejun Heo <tj@kernel.org> Cc: Andrea Righi <andrea.righi@canonical.com> Cc: Geert Uytterhoeven <geert@linux-m68k.org> Link: https://lore.kernel.org/r/20240109214828.252092-4-tj@kernel.org Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Signed-off-by: Sasha Levin <sashal@kernel.org>

hamadmarri added 20 commits May 14, 2021 07:31

remove RDB patch from CacULE. RDB will be a separate patch.

5912e68

Remove harsh_mode

8efc10d

rename reset_lifetime to normalize_lifetime

230a803

cleanup

d1f543b

set cacule_max_lifetime to 1.6s which normalize to 800ms (i.e. lifeti…

20a0c21

…me of tasks normalize every 800ms)

set cacule_max_lifetime to 22s which normalize to 11s (i.e. lifetime …

e85372a

…of tasks normalize every 11s)

Responsive wakeup with autogroup shows smoother window animations. Th…

7c43055

…e Benchmar (pythone responsive script) shows bad results because autogroup assignments tries to group up interactive task with others. However, running interactive task in separate terminal shows good results.

decouple se.vruntime from cn.vruntime

19ed61d

increase the precision of lifetime normalize from x8 to x1024

ccf43ae

select prev_cpu if interactive wakeup and number of tasks in prev_cpu…

1fb2139

… == 0

cleanup

a469e4d

add missing #ifdef CACULE in debug.c

dcf2886

interactivity_threshold to 1000.

88060d0

cleanup

ae55d78

cleanup

21990a6

use sched_clock() directly without now declaration.

5caeb4d

__pick_last_entity returns cfs_rq->tail

13aacfb

cleanup

4f8e2e7

fix: pick last in debug check if null

3dcd32c

xanmod merged commit 88b4ecc into xanmod:5.12-cacule May 14, 2021

hamadmarri deleted the xanmod-5.12-cacule branch June 1, 2021 13:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xanmod 5.12 CacULE-r2 #147

Xanmod 5.12 CacULE-r2 #147

hamadmarri commented May 14, 2021

Xanmod 5.12 CacULE-r2 #147

Xanmod 5.12 CacULE-r2 #147

Conversation

hamadmarri commented May 14, 2021