Skip to content

Commit 47d8ac0

Browse files
mmhalPaolo Abeni
authored and
Paolo Abeni
committed
af_unix: Fix garbage collector racing against connect()
Garbage collector does not take into account the risk of embryo getting enqueued during the garbage collection. If such embryo has a peer that carries SCM_RIGHTS, two consecutive passes of scan_children() may see a different set of children. Leading to an incorrectly elevated inflight count, and then a dangling pointer within the gc_inflight_list. sockets are AF_UNIX/SOCK_STREAM S is an unconnected socket L is a listening in-flight socket bound to addr, not in fdtable V's fd will be passed via sendmsg(), gets inflight count bumped connect(S, addr) sendmsg(S, [V]); close(V) __unix_gc() ---------------- ------------------------- ----------- NS = unix_create1() skb1 = sock_wmalloc(NS) L = unix_find_other(addr) unix_state_lock(L) unix_peer(S) = NS // V count=1 inflight=0 NS = unix_peer(S) skb2 = sock_alloc() skb_queue_tail(NS, skb2[V]) // V became in-flight // V count=2 inflight=1 close(V) // V count=1 inflight=1 // GC candidate condition met for u in gc_inflight_list: if (total_refs == inflight_refs) add u to gc_candidates // gc_candidates={L, V} for u in gc_candidates: scan_children(u, dec_inflight) // embryo (skb1) was not // reachable from L yet, so V's // inflight remains unchanged __skb_queue_tail(L, skb1) unix_state_unlock(L) for u in gc_candidates: if (u.inflight) scan_children(u, inc_inflight_move_tail) // V count=1 inflight=2 (!) If there is a GC-candidate listening socket, lock/unlock its state. This makes GC wait until the end of any ongoing connect() to that socket. After flipping the lock, a possibly SCM-laden embryo is already enqueued. And if there is another embryo coming, it can not possibly carry SCM_RIGHTS. At this point, unix_inflight() can not happen because unix_gc_lock is already taken. Inflight graph remains unaffected. Fixes: 1fd05ba ("[AF_UNIX]: Rewrite garbage collector, fixes race.") Signed-off-by: Michal Luczaj <[email protected]> Reviewed-by: Kuniyuki Iwashima <[email protected]> Link: https://lore.kernel.org/r/[email protected] Signed-off-by: Paolo Abeni <[email protected]>
1 parent 17c5601 commit 47d8ac0

File tree

1 file changed

+17
-1
lines changed

1 file changed

+17
-1
lines changed

net/unix/garbage.c

Lines changed: 17 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -274,18 +274,34 @@ static void __unix_gc(struct work_struct *work)
274274
* receive queues. Other, non candidate sockets _can_ be
275275
* added to queue, so we must make sure only to touch
276276
* candidates.
277+
*
278+
* Embryos, though never candidates themselves, affect which
279+
* candidates are reachable by the garbage collector. Before
280+
* being added to a listener's queue, an embryo may already
281+
* receive data carrying SCM_RIGHTS, potentially making the
282+
* passed socket a candidate that is not yet reachable by the
283+
* collector. It becomes reachable once the embryo is
284+
* enqueued. Therefore, we must ensure that no SCM-laden
285+
* embryo appears in a (candidate) listener's queue between
286+
* consecutive scan_children() calls.
277287
*/
278288
list_for_each_entry_safe(u, next, &gc_inflight_list, link) {
289+
struct sock *sk = &u->sk;
279290
long total_refs;
280291

281-
total_refs = file_count(u->sk.sk_socket->file);
292+
total_refs = file_count(sk->sk_socket->file);
282293

283294
WARN_ON_ONCE(!u->inflight);
284295
WARN_ON_ONCE(total_refs < u->inflight);
285296
if (total_refs == u->inflight) {
286297
list_move_tail(&u->link, &gc_candidates);
287298
__set_bit(UNIX_GC_CANDIDATE, &u->gc_flags);
288299
__set_bit(UNIX_GC_MAYBE_CYCLE, &u->gc_flags);
300+
301+
if (sk->sk_state == TCP_LISTEN) {
302+
unix_state_lock(sk);
303+
unix_state_unlock(sk);
304+
}
289305
}
290306
}
291307

0 commit comments

Comments
 (0)