locking/rtmutex: Prevent spurious EDEADLK return caused by ww_mutexes

rtmutex based ww_mutexes can legitimately create a cycle in the lock graph which can be observed by a blocker which didn't cause the problem: P1: A, ww_A, ww_B P2: ww_B, ww_A P3: A P3 might therefore be trapped in the ww_mutex induced cycle and run into the lock depth limitation of rt_mutex_adjust_prio_chain() which returns -EDEADLK to the caller. Disable the deadlock detection walk when the chain walk observes a ww_mutex to prevent this looping. [ tglx: Split it apart and added changelog ] Reported-by: Sebastian Siewior <bigeasy@linutronix.de> Fixes: add461325ec5 ("locking/rtmutex: Extend the rtmutex core to support ww_mutex") Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Signed-off-by: Thomas Gleixner <tglx@linutronix.de> Link: https://lore.kernel.org/r/YSeWjCHoK4v5OcOt@hirez.programming.kicks-ass.net
author: Peter Zijlstra <peterz@infradead.org> 2021-08-26 09:36:53 +0200
committer: Thomas Gleixner <tglx@linutronix.de> 2021-08-27 14:28:49 +0200
commit: 6467822b8cc96e5feda98c7bf5c6329c6a896c91 (patch)
tree: e48f49c0c44f69ee2ea93b98e18d17bc5ea1c17d /kernel/locking
parent: 37e8abff2bebbf9947d6b784f5c75ed48a717089 (diff)
download: linux-6467822b8cc96e5feda98c7bf5c6329c6a896c91.tar.gz
1 files changed, 25 insertions, 0 deletions
diff --git a/kernel/locking/rtmutex.c b/kernel/locking/rtmutex.c
index c8fe74ef8db9..3c1ba7b9a326 100644
--- a/kernel/locking/rtmutex.c
+++ b/kernel/locking/rtmutex.c
@@ -657,6 +657,31 @@ static int __sched rt_mutex_adjust_prio_chain(struct task_struct *task,
 		goto out_unlock_pi;
 
 	/*
+	 * There could be 'spurious' loops in the lock graph due to ww_mutex,
+	 * consider:
+	 *
+	 *   P1: A, ww_A, ww_B
+	 *   P2: ww_B, ww_A
+	 *   P3: A
+	 *
+	 * P3 should not return -EDEADLK because it gets trapped in the cycle
+	 * created by P1 and P2 (which will resolve -- and runs into
+	 * max_lock_depth above). Therefore disable detect_deadlock such that
+	 * the below termination condition can trigger once all relevant tasks
+	 * are boosted.
+	 *
+	 * Even when we start with ww_mutex we can disable deadlock detection,
+	 * since we would supress a ww_mutex induced deadlock at [6] anyway.
+	 * Supressing it here however is not sufficient since we might still
+	 * hit [6] due to adjustment driven iteration.
+	 *
+	 * NOTE: if someone were to create a deadlock between 2 ww_classes we'd
+	 * utterly fail to report it; lockdep should.
+	 */
+	if (IS_ENABLED(CONFIG_PREEMPT_RT) && waiter->ww_ctx && detect_deadlock)
+		detect_deadlock = false;
+
+	/*
 	 * Drop out, when the task has no waiters. Note,
 	 * top_waiter can be NULL, when we are in the deboosting
 	 * mode!
author	Peter Zijlstra <peterz@infradead.org>	2021-08-26 09:36:53 +0200
committer	Thomas Gleixner <tglx@linutronix.de>	2021-08-27 14:28:49 +0200
commit	6467822b8cc96e5feda98c7bf5c6329c6a896c91 (patch)
tree	e48f49c0c44f69ee2ea93b98e18d17bc5ea1c17d /kernel/locking
parent	37e8abff2bebbf9947d6b784f5c75ed48a717089 (diff)
download	linux-6467822b8cc96e5feda98c7bf5c6329c6a896c91.tar.gz