syscall: swap use of read-write locks for ForkLock #54162

junchuan-tzh · 2022-08-01T08:47:29Z

The ForkLock disables fork while creating new fd and marking it close-on-exec. Fd ops hold read lock, and fork holds write lock.

In kernels newer than 2.6, the two fd ops can be done in one step, and thus the ForkLock is no need. Actually, the ForkLock brings a bottleneck to concurrent forks.

Therefore, we exchange read-write locks: Fd ops hold write lock, and fork holds read lock. It is reasonable since fd ops "modify" process state, and fork just "reads" state. With this change, the newer kernels can remove the concurrent bottlenect, and the older kernels can still ensure the safety of fd close-on-exec.

ianlancetaylor · 2022-08-02T01:04:51Z

This is not an API change so it doesn't have to go through the proposal process.

rittneje · 2022-08-02T04:23:24Z

This is a breaking change. Consumers that have to set close-on-exec on fds themselves may already be using syscall.ForkLock, and you cannot force everyone to replace Lock with RLock and vice versa.

junchuan-tzh · 2022-08-02T09:28:02Z

syscall.ForkLock is only used in several cases (Pipe, Socket, Accept, Open, Dup), and swapping Lock and RLock in these cases that already are using syscall.ForkLock is enough. The upper APIs/consumers are not aware of this.

rittneje · 2022-08-02T10:48:35Z

No, syscall.ForkLock is a public property that anyone can use. And this is required when you are working with lower-level APIs (e.g., syscall.Socketpair) and have to set close-on-exec yourself.

Also, your statement that kernels newer that 2.6 can do this atomically (via O_CLOEXEC, etc.) only applies to Linux. For example, macOS does not support that flag in all cases. In other words, this change would cause performance issues for macOS applications that create new fds more often than they fork, which personally I suspect is the majority case.

junchuan-tzh · 2022-08-05T03:01:29Z

Using an either-or lock is ok? See issues/23558

Like this (only the first fork holds the write lock):

close-on-exec:

    ForkLock.RLock()
    set_fd_close_on_exec()
    ForkLock.RUnlock()

fork: refcnt
    Mutex.Lock()
    if refcnt == 0:
        // get write lock for the first fork (if a non-fork op holds wirte lock, it is also ok)
        FockLock.Lock()
    refcnt += 1
    Mutex..Unlock()

    do_forkexec()

    Mutex..Lock()
        refcnt -= 1
        if refcnt == 0:
            // release write lock for the last fork
            ForkLock.UnLock()
    Mutex.Unlock()

rittneje · 2022-08-05T03:30:51Z

I'm assuming you meant to write ForkLock.Unlock() for the second block. I think that will work.

junchuan-tzh · 2022-08-05T03:39:54Z

Yes, a typo error.

There is a potential problem. The latency to get ForkLock.RLock() can be very long if there are too many forks.
Is there a better way to do either-or lock?

rittneje · 2022-08-05T04:10:21Z

Unfortunately, I think you'd have to be able to ask the ForkLock whether there are pending readers in order to avoid that, and currently that information is not exposed.

ianlancetaylor · 2022-08-06T00:51:35Z

Perhaps we should close this issue as a dup of #23558, as they are about the same problem.

ianlancetaylor · 2022-08-06T00:55:31Z

Let me add that both this and #23558 have the same problem: we unfortunately exported syscall.ForkLock, and it is at least possible that people are using it, and that makes it difficult to change.

ianlancetaylor · 2022-08-06T01:08:01Z

OK, I think I see a way: https://go.dev/cl/421441. Anybody see a way to improve that code?

gopherbot · 2022-08-06T01:08:02Z

Change https://go.dev/cl/421441 mentions this issue: syscall: avoid serializing forks on ForkLock

rittneje · 2022-08-06T01:35:38Z

@ianlancetaylor This is the approach that @junchuan-tzh mentioned above. As discussed, it has an issue where if forkExec is continually being invoked it will prevent syscall.ForkLock.RLock() from ever unblocking.

ianlancetaylor · 2022-08-06T19:54:49Z

@rittneje My apologies for not reading more carefully.

I've updated https://go.dev/cl/421441 to avoid that problem. But it may be too complicated now. What do you think of that approach? Thanks.

rittneje · 2022-08-06T21:53:48Z

Indeed, it is fairly complex. The choice of 10 as the concurrency limit (sort of) also seems very arbitrary.

Perhaps sync.Cond would be a better choice than adding a channel? Just to reduce the number of players.

var forkingCond = sync.NewCond(new(sync.Mutex))

...

forkingCond.L.Lock()
if forking > 10 {
    forkingCond.Wait()
}
if forking == 0 {
    syscall.ForkLock.Lock()
}
forking++
forkingCond.L.Unlock()

defer func() {
    forkingCond.L.Lock()
    forking--
    if forking == 0 {
        syscall.ForkLock.Unlock()
        forkingCond.Broadcast()
    }
    forkingCond.L.Unlock()
}()

ianlancetaylor · 2022-08-06T22:16:38Z

My personal feeling is that sync.Cond never makes things clearer. See also #21165.

ianlancetaylor · 2023-05-22T22:28:46Z

I've updated https://go.dev/cl/421441 to use a different approach, using internal secret knowledge about sync.RWMutex. Does anybody see a problem with this approach? Thanks.

rittneje · 2023-05-22T23:06:14Z

@ianlancetaylor I am confused by the call to runtime.Gosched(). In order for the preceding call to ForkLock.RLock() to unblock, whatever was holding the write lock must have unlocked it. That means (I assume) that whatever calls ForkLock.Lock() next would have to wait for those readers to finish anyway.

ianlancetaylor · 2023-05-22T23:39:01Z

@rittneje Thanks, you're right, that's not needed. I was thinking that the RLock callers would be woken up but would still have to acquire the read lock, but that's not how it works. The read lock is already held when they are woken up. I will remove the Gosched call.

rittneje · 2023-05-23T00:05:34Z

@ianlancetaylor I think there may still be a starvation potential hidden here. There is no guarantee that goroutines will acquire the (logical) fork lock in order. For example:

Goroutine A calls ForkLock.Lock().
Goroutine B calls ForkLock.RLock(), which blocks.
Goroutine C tries to acquire fork lock. It sees there is a pending reader (B), so it waits.
Goroutine A releases the write lock.
Goroutine B acquires then releases the read lock.
Goroutine D calls ForkLock.Lock().
Goroutine E calls ForkLock.RLock(), which blocks.
Goroutine C is finally scheduled, and tries to acquire fork lock again. Once again, it is thwarted by pending reader (E), so it waits.

In short, there is a possibility that a goroutine that is intending to fork keeps losing out to other goroutines indefinitely.

ianlancetaylor · 2023-05-23T00:32:44Z

@rittneje I don't think that can happen with the current RWMutex implementation. When goroutine C blocks trying to acquire the read lock, it increments readerCount before it blocks. The subsequent call to Lock in D will block until readerCount drops back to zero.

ianlancetaylor · 2023-05-23T00:34:17Z

To put it another way, I think that if starvation is possible with this CL, then it is possible with any use of RWMutex, and in particular it's possible today without this CL.

rittneje · 2023-05-23T01:00:00Z

When goroutine C blocks trying to acquire the read lock, it increments readerCount before it blocks. The subsequent call to Lock in D will block until readerCount drops back to zero.

But that doesn't matter. Even if D blocks, then when C gets scheduled it will immediately release read lock, which unblocks D. Then when C goes to the top of the for loop, it will observe the ForkLock is already taken so will once again fall to the pending readers check.

I think the correct fix is to remove the for loop. Instead, it should unconditionally acquire logical fork lock after waiting for pending readers once.

ianlancetaylor · 2023-05-23T03:19:09Z

Oh, sorry, I did misunderstand. I see what you mean. Done.

rittneje · 2023-05-23T03:38:30Z

Thanks, I think it looks good now. One question I do have is whether it would be worth checking for pending writers in addition to pending readers. Otherwise, if there is code outside syscall doing ForkLock.Lock() (which is pretty unlikely but not impossible), it could get starved now.

zhuangqh · 2023-05-23T04:20:32Z

Thanks, I think it looks good now. One question I do have is whether it would be worth checking for pending writers in addition to pending readers. Otherwise, if there is code outside syscall doing ForkLock.Lock() (which is pretty unlikely but not impossible), it could get starved now.

I think this is as expected, if the user misuses forklock

ianlancetaylor · 2023-05-23T16:47:57Z

I'm OK with assuming that no program calls syscall.ForkLock.Lock(). Even if there are such programs, they will continue to work unless they also fork new processes continuously. So I'm willing to wait to see if anybody reports a bug.

gopherbot · 2023-06-30T15:21:50Z

Change https://go.dev/cl/507355 mentions this issue: syscall: serialize locks on ForkLock on platforms where forkExecPipe is not atomic

…is not atomic In CL 421441, we changed syscall to allow concurrent calls to forkExec. On platforms that support the pipe2 syscall that is the right behavior, because pipe2 atomically opens the pipe with CLOEXEC already set. However, on platforms that do not support pipe2 (currently aix and darwin), syscall.forkExecPipe is not atomic, and the pipes do not initially have CLOEXEC set. If two calls to forkExec proceed concurrently, a pipe intended for one child process can be accidentally inherited by the other. If the process is long-lived, the pipe can be held open unexpectedly and prevent the parent process from reaching EOF reading the child's status from the pipe. Fixes #61080. Updates #23558. Updates #54162. Change-Id: I83edcc80674ff267a39d06260c5697c654ff5a4b Reviewed-on: https://go-review.googlesource.com/c/go/+/507355 TryBot-Result: Gopher Robot <gobot@golang.org> Reviewed-by: Ian Lance Taylor <iant@google.com> Run-TryBot: Bryan Mills <bcmills@google.com> Auto-Submit: Bryan Mills <bcmills@google.com>

…is not atomic In CL 421441, we changed syscall to allow concurrent calls to forkExec. On platforms that support the pipe2 syscall that is the right behavior, because pipe2 atomically opens the pipe with CLOEXEC already set. However, on platforms that do not support pipe2 (currently aix and darwin), syscall.forkExecPipe is not atomic, and the pipes do not initially have CLOEXEC set. If two calls to forkExec proceed concurrently, a pipe intended for one child process can be accidentally inherited by the other. If the process is long-lived, the pipe can be held open unexpectedly and prevent the parent process from reaching EOF reading the child's status from the pipe. Fixes golang#61080. Updates golang#23558. Updates golang#54162. Change-Id: I83edcc80674ff267a39d06260c5697c654ff5a4b Reviewed-on: https://go-review.googlesource.com/c/go/+/507355 TryBot-Result: Gopher Robot <gobot@golang.org> Reviewed-by: Ian Lance Taylor <iant@google.com> Run-TryBot: Bryan Mills <bcmills@google.com> Auto-Submit: Bryan Mills <bcmills@google.com>

Fixes golang#23558 Fixes golang#54162 Change-Id: I3cf6efe466080cdb17e171218e9385ccb272c301 Reviewed-on: https://go-review.googlesource.com/c/go/+/421441 Run-TryBot: Ian Lance Taylor <iant@golang.org> Auto-Submit: Ian Lance Taylor <iant@golang.org> Reviewed-by: Brad Fitzpatrick <bradfitz@golang.org> TryBot-Result: Gopher Robot <gobot@golang.org> Reviewed-by: David Chase <drchase@google.com> Reviewed-by: Bryan Mills <bcmills@google.com> Reviewed-by: Michael Knyszek <mknyszek@google.com>

junchuan-tzh added the Proposal label Aug 1, 2022

gopherbot added this to the Proposal milestone Aug 1, 2022

ianlancetaylor changed the title ~~proposal: syscall: exchange read-write locks for ForkLock~~ syscall: use read-write locks for ForkLock Aug 2, 2022

gopherbot added the compiler/runtime label Aug 2, 2022

ianlancetaylor added NeedsInvestigation and removed compiler/runtime Proposal labels Aug 2, 2022

ianlancetaylor modified the milestones: Proposal, Backlog Aug 2, 2022

ianlancetaylor changed the title ~~syscall: use read-write locks for ForkLock~~ syscall: swap use of read-write locks for ForkLock Aug 2, 2022

junchuan-tzh closed this as completed Aug 2, 2022

junchuan-tzh reopened this Aug 2, 2022

gopherbot closed this as completed in d6473a1 May 23, 2023

golang locked and limited conversation to collaborators Jun 29, 2024

gopherbot added the FrozenDueToAge label Jun 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

syscall: swap use of read-write locks for ForkLock #54162

syscall: swap use of read-write locks for ForkLock #54162

junchuan-tzh commented Aug 1, 2022

ianlancetaylor commented Aug 2, 2022

rittneje commented Aug 2, 2022

junchuan-tzh commented Aug 2, 2022

rittneje commented Aug 2, 2022 •

edited

Loading

junchuan-tzh commented Aug 5, 2022 •

edited

Loading

rittneje commented Aug 5, 2022

junchuan-tzh commented Aug 5, 2022

rittneje commented Aug 5, 2022

ianlancetaylor commented Aug 6, 2022

ianlancetaylor commented Aug 6, 2022

ianlancetaylor commented Aug 6, 2022

gopherbot commented Aug 6, 2022

rittneje commented Aug 6, 2022

ianlancetaylor commented Aug 6, 2022

rittneje commented Aug 6, 2022 •

edited

Loading

ianlancetaylor commented Aug 6, 2022

ianlancetaylor commented May 22, 2023

rittneje commented May 22, 2023

ianlancetaylor commented May 22, 2023

rittneje commented May 23, 2023

ianlancetaylor commented May 23, 2023

ianlancetaylor commented May 23, 2023

rittneje commented May 23, 2023 •

edited

Loading

ianlancetaylor commented May 23, 2023

rittneje commented May 23, 2023

zhuangqh commented May 23, 2023

ianlancetaylor commented May 23, 2023

gopherbot commented Jun 30, 2023

syscall: swap use of read-write locks for ForkLock #54162

syscall: swap use of read-write locks for ForkLock #54162

Comments

junchuan-tzh commented Aug 1, 2022

ianlancetaylor commented Aug 2, 2022

rittneje commented Aug 2, 2022

junchuan-tzh commented Aug 2, 2022

rittneje commented Aug 2, 2022 • edited Loading

junchuan-tzh commented Aug 5, 2022 • edited Loading

rittneje commented Aug 5, 2022

junchuan-tzh commented Aug 5, 2022

rittneje commented Aug 5, 2022

ianlancetaylor commented Aug 6, 2022

ianlancetaylor commented Aug 6, 2022

ianlancetaylor commented Aug 6, 2022

gopherbot commented Aug 6, 2022

rittneje commented Aug 6, 2022

ianlancetaylor commented Aug 6, 2022

rittneje commented Aug 6, 2022 • edited Loading

ianlancetaylor commented Aug 6, 2022

ianlancetaylor commented May 22, 2023

rittneje commented May 22, 2023

ianlancetaylor commented May 22, 2023

rittneje commented May 23, 2023

ianlancetaylor commented May 23, 2023

ianlancetaylor commented May 23, 2023

rittneje commented May 23, 2023 • edited Loading

ianlancetaylor commented May 23, 2023

rittneje commented May 23, 2023

zhuangqh commented May 23, 2023

ianlancetaylor commented May 23, 2023

gopherbot commented Jun 30, 2023

rittneje commented Aug 2, 2022 •

edited

Loading

junchuan-tzh commented Aug 5, 2022 •

edited

Loading

rittneje commented Aug 6, 2022 •

edited

Loading

rittneje commented May 23, 2023 •

edited

Loading