Skip to content

Commit 9ec2353

Browse files
David HildenbrandIngo Molnar
authored andcommitted
sched/preempt, mm/fault: Trigger might_sleep() in might_fault() with disabled pagefaults
Commit 662bbcb ("mm, sched: Allow uaccess in atomic with pagefault_disable()") removed might_sleep() checks for all user access code (that uses might_fault()). The reason was to disable wrong "sleep in atomic" warnings in the following scenario: pagefault_disable() rc = copy_to_user(...) pagefault_enable() Which is valid, as pagefault_disable() increments the preempt counter and therefore disables the pagefault handler. copy_to_user() will not sleep and return an error code if a page is not available. However, as all might_sleep() checks are removed, CONFIG_DEBUG_ATOMIC_SLEEP would no longer detect the following scenario: spin_lock(&lock); rc = copy_to_user(...) spin_unlock(&lock) If the kernel is compiled with preemption turned on, preempt_disable() will make in_atomic() detect disabled preemption. The fault handler would correctly never sleep on user access. However, with preemption turned off, preempt_disable() is usually a NOP (with !CONFIG_PREEMPT_COUNT), therefore in_atomic() will not be able to detect disabled preemption nor disabled pagefaults. The fault handler could sleep. We really want to enable CONFIG_DEBUG_ATOMIC_SLEEP checks for user access functions again, otherwise we can end up with horrible deadlocks. Root of all evil is that pagefault_disable() acts almost as preempt_disable(), depending on preemption being turned on/off. As we now have pagefault_disabled(), we can use it to distinguish whether user acces functions might sleep. Convert might_fault() into a makro that calls __might_fault(), to allow proper file + line messages in case of a might_sleep() warning. Reviewed-and-tested-by: Thomas Gleixner <tglx@linutronix.de> Signed-off-by: David Hildenbrand <dahi@linux.vnet.ibm.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: David.Laight@ACULAB.COM Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Thomas Gleixner <tglx@linutronix.de> Cc: airlied@linux.ie Cc: akpm@linux-foundation.org Cc: benh@kernel.crashing.org Cc: bigeasy@linutronix.de Cc: borntraeger@de.ibm.com Cc: daniel.vetter@intel.com Cc: heiko.carstens@de.ibm.com Cc: herbert@gondor.apana.org.au Cc: hocko@suse.cz Cc: hughd@google.com Cc: mst@redhat.com Cc: paulus@samba.org Cc: ralf@linux-mips.org Cc: schwidefsky@de.ibm.com Cc: yang.shi@windriver.com Link: http://lkml.kernel.org/r/1431359540-32227-3-git-send-email-dahi@linux.vnet.ibm.com Signed-off-by: Ingo Molnar <mingo@kernel.org>
1 parent 8bcbde5 commit 9ec2353

File tree

2 files changed

+8
-13
lines changed

2 files changed

+8
-13
lines changed

include/linux/kernel.h

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -244,7 +244,8 @@ static inline u32 reciprocal_scale(u32 val, u32 ep_ro)
244244

245245
#if defined(CONFIG_MMU) && \
246246
(defined(CONFIG_PROVE_LOCKING) || defined(CONFIG_DEBUG_ATOMIC_SLEEP))
247-
void might_fault(void);
247+
#define might_fault() __might_fault(__FILE__, __LINE__)
248+
void __might_fault(const char *file, int line);
248249
#else
249250
static inline void might_fault(void) { }
250251
#endif

mm/memory.c

Lines changed: 6 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -3737,7 +3737,7 @@ void print_vma_addr(char *prefix, unsigned long ip)
37373737
}
37383738

37393739
#if defined(CONFIG_PROVE_LOCKING) || defined(CONFIG_DEBUG_ATOMIC_SLEEP)
3740-
void might_fault(void)
3740+
void __might_fault(const char *file, int line)
37413741
{
37423742
/*
37433743
* Some code (nfs/sunrpc) uses socket ops on kernel memory while
@@ -3747,21 +3747,15 @@ void might_fault(void)
37473747
*/
37483748
if (segment_eq(get_fs(), KERNEL_DS))
37493749
return;
3750-
3751-
/*
3752-
* it would be nicer only to annotate paths which are not under
3753-
* pagefault_disable, however that requires a larger audit and
3754-
* providing helpers like get_user_atomic.
3755-
*/
3756-
if (in_atomic())
3750+
if (pagefault_disabled())
37573751
return;
3758-
3759-
__might_sleep(__FILE__, __LINE__, 0);
3760-
3752+
__might_sleep(file, line, 0);
3753+
#if defined(CONFIG_DEBUG_ATOMIC_SLEEP)
37613754
if (current->mm)
37623755
might_lock_read(&current->mm->mmap_sem);
3756+
#endif
37633757
}
3764-
EXPORT_SYMBOL(might_fault);
3758+
EXPORT_SYMBOL(__might_fault);
37653759
#endif
37663760

37673761
#if defined(CONFIG_TRANSPARENT_HUGEPAGE) || defined(CONFIG_HUGETLBFS)

0 commit comments

Comments
 (0)