proposal: sync/v2: prohibit unlocking mutex in a different goroutine #9201

dvyukov · 2014-12-04T08:50:11Z

sync.Mutex allows lock/unlock in different goroutines:

http://golang.org/pkg/sync/#Mutex.Unlock
"A locked Mutex is not associated with a particular goroutine. It is allowed for
one goroutine to lock a Mutex and then arrange for another goroutine to unlock it."

And the same for RWMutex:
http://golang.org/pkg/sync/#RWMutex.Unlock

The possibility to unlock the mutex in a different goroutine is very rarely used in real
code. And if you really need something more complex than lock/unlock in a single
goroutine, you can always use channels.

But it creates several problems:

Deadlock detection becomes impossible, as there is no notion of "critical
sections".
Similarly static lock annotations becomes impossible for the same reason.
Optimizations like hardware lock elision (see e.g. Intel HLE) become impossible.
Another potential optimization that becomes impossible is priority boosting inside of critical sections. Namely, if a goroutine is preempted inside of a critical sections, scheduler may give it another tiny time slot to finish the critical section (or perhaps it can then voluntarily switch in Unlock). Solaris did this for a long time.

We should prohibit possibility to unlock in a different goroutine in Go2.

davecheney · 2014-12-04T09:05:38Z

Comment 1:

For Go 1.x, could detecting unlocking from another goroutine be added as a runtime
experimental features, similar to the memory fence code that was added a while back ?

dvyukov · 2014-12-04T10:07:13Z

Comment 2:

Detect and then what?

davecheney · 2014-12-04T10:13:07Z

Comment 3:

Panic I guess. This would only be an experimental mode like the memory fence checker

dvyukov · 2014-12-04T10:28:05Z

Comment 4:

What's the profit here?

davecheney · 2014-12-04T10:31:58Z

Comment 5:

Its sounds like this is already a programming error, I would like an opportunity to
build a version of Go that detects this.

cznic · 2014-12-04T10:38:13Z

Comment 6:

Dmitry, I think that the idea is too close to what the situation is in some other
languages when they prohibit calling stuff from other than a specific/privileged thread.
IMHO, gouroutines having no implicit user-facing identity is an important property and I
suggest to keep its value.

dvyukov · 2014-12-04T10:39:09Z

Comment 7:

It's not a programming error today. The docs say:
http://golang.org/pkg/sync/#Mutex.Unlock
"A locked Mutex is not associated with a particular goroutine. It is allowed for one
goroutine to lock a Mutex and then arrange for another goroutine to unlock it."
Also it can be in third-party code. So even if you detect it, you may not be able to
change it.

dvyukov · 2014-12-04T10:45:33Z

Comment 8:

Re #6: what you say makes sense, but I think that benefits do not outweigh drawbacks in
this case. It does not look exactly the same as restricting all GUI calls to the main
thread. And we do not take this possibility away, more complex patterns are still
possible with channels. Mutex is for:
mu.Lock()
defer mu.Unlock
// something

ianlancetaylor · 2014-12-04T17:07:18Z

Comment 9:

It should be easy enough for the compiler and/or static analysis tools to see, for most
uses of mutexes, that the mutex is locked and unlocked in the same goroutine.  So I
believe that HLE is entirely implementable in the common case, which is the only case in
which it can be plausibly be used anyhow.
It should also be straightforward for the compiler to annotate lock/unlock in the same
goroutine for the benefit of dynamic analysis, if that seems useful.
Not that I'm necessarily opposed to this idea, but I think you need stronger arguments.

Labels changed: added repo-main, release-none.

dvyukov · 2014-12-04T17:17:15Z

Comment 10:

Is it really easy to do?
I see at least 3 problems:
1. If you do obj.subobj.mu.Lock/obj.subobj.mu.Unlock, compiler must prove that it's the
same mutex in both cases, that is, that obj.subobj pointer has not changed. Since these
objects are shared (they contain a mutex), it may be not that simple.
2. If compiler sees:
mu.Lock()
...
mu.Unlock()
and proves mu refers to the same object, it can be the case that the situation is
actually:
mu.Lock()
...
// pass responsibility to unlock to a different goroutine
...
// receive responsibility to unlock from yet another goroutine
...
mu.Unlock()
3. Non-trivial control flow like:
func foo() *T {
  t := ...
  ...
  if ... {
    t.mu.Lock()
  }
  ...
  return t
}
t := foo()
...
if ... {
  t.mu.Unlock()
}
...

ianlancetaylor · 2014-12-04T17:34:55Z

Comment 11:

If you can't resolve those issues anyhow, I don't understand how you can use HLE.  That
is, I don't see how the restriction that the mutex is manipulated from a single
goroutine is sufficient to use HLE in cases where you can't resolve the issues you
describe.

dvyukov · 2014-12-04T17:52:43Z

Comment 12:

There is nothing to resolve. With the new rules, there must be pairing lock/unlock. Even
if identity of obj.subobj.mu has changed in the function, there still must be a pairing
unlock for the original value of obj.subobj.mu and a pairing lock for the new value of
obj.subobj.mu somewhere else. For HLE and dynamic analysis you don't need to find pairs
statically, they will just manifest themselves at runtime.

ianlancetaylor · 2014-12-04T18:10:18Z

Comment 13:

I guess I don't grasp how that would work.  You lock a mutex presumably with an XACQUIRE
instruction.  You make a system call.  You block.  The scheduler runs.  You unblock. 
You come back in a different thread.  You unlock the mutex with an XRELEASE instruction.
 The unlock fails.  Execution resumes back at the XACQUIRE lock.  But you're running on
a different core.  Can that really work?
My point is that you need to know what happens between the XACQUIRE and the XRELEASE. 
Or so it seems to me.

dvyukov · 2014-12-04T18:17:03Z

Comment 14:

XACQUIRE/XRELEASE transparently fallback on real locking if transactions fail using some
heuristics.
If you use RTM (restricted transaction memory), then you fallback to locking manually
using explicit heuristics if you see something that you can't handle (e.g. a syscall).
But this happens at runtime, no prior static knowledge required.
Intel prototyped libpthread support for HLE where you could switch any C program that
uses pthread_mutex to HLE. And it worked.

ianlancetaylor · 2014-12-04T18:37:32Z

Comment 15:

OK, if that is how it works, then why can't we use the same heuristics to fallback if
the mutex unlock happens in a different goroutine?  By definition we must be on the same
core/thread, or we would have already done a fallback when we entered the scheduler.
(To be clear, I'm not saying you're wrong, I'm just saying that I don't understand.)

dvyukov · 2014-12-04T18:53:47Z

Comment 16:

Yes, we can fallback on normal locking with current mutex semantics. It will work.
But good heuristics are critical for efficient HLE. With the proposed new semantics we
permanently fall back to locking iff encounter something that we can't handle within
transaction (e.g. a syscall). But unlock in a different thread will look like normal
contention (it will be detected earlier when the goroutine passes responsibility to
unlock mutex to a different goroutine), so condition for permanent fallback to locking
becomes moot; and if we don't fallback (always try to execute transactionally several
times first), we will waste lots of work.
Static analysis don't have dynamic information, so it won't work for static analysis.
Dynamic analysis may or may not work, I don't know yet how to implement it.

dvyukov · 2014-12-04T18:54:33Z

Comment 17:

Yes, we can fallback on normal locking with current mutex semantics. It will work.
But good heuristics are critical for efficient HLE. With the proposed new semantics we
permanently fall back to locking iff encounter something that we can't handle within
transaction (e.g. a syscall). But unlock in a different thread will look like normal
contention (it will be detected earlier when the goroutine passes responsibility to
unlock mutex to a different goroutine), so condition for permanent fallback to locking
becomes moot; and if we don't fallback (always try to execute transactionally several
times first), we will waste lots of work.
Static analysis don't have dynamic information, so it won't work for static analysis.
Dynamic analysis may or may not work, I don't know yet how to implement it.

ghasemloo · 2017-07-12T00:37:33Z

@dvyukov, we had a case for passing locked objects around in C++, we didn't want the object to be unlocked while passed around. So there are cases where we might want to lock in one goroutine and unlock in another one I think. Feel free to email me for details.

dvyukov · 2017-07-12T05:30:04Z

@ghasemloo I believe there are cases like this, but it does not mean you have to use Mutex for them. chan will do just fine.
Note: it is absolutely illegal to lock/unlock almost all C++ mutexes in different threads (e.g. pthread_mutex_lock, EnterCriticalSection, std::mutex, etc). And if you built an own semaphore and call it a mutex, you can do the same in Go as well.

pciet · 2017-12-20T20:42:45Z

Here's a solution to a deadlock case I had recently that requires unlocking in a different goroutine:

	// if the lock can't be acquired we have to try a read
	// here because the notifier could be holding it waiting to send
	//
	// this won't work serially: lock blocks us from 
	// trying to read and trying to read first gives 
	// the notifier a chance to lock before we do
	acq := make(chan struct{})
	go func() {
		gameMonitorsLock.Lock()
		acq <- struct{}{}
	}()
OUTER2:
	for {
		select {
		case <-channels.move:
		case <-channels.done:
		case <-acq:
			break OUTER2
		}
	}
	delete(gameMonitors, gameid)
	gameMonitorsLock.Unlock()

One of the senders called in an independent HTTP POST handling function:

go func() {
	gameMonitorsLock.RLock()
	c, has := gameMonitors[g.ID]
	if has {
		c.move <- time.Now()
	}
	gameMonitorsLock.RUnlock()
}()

Normally a send on c.move causes some reloads and checks of a changed database row, but in this case the race is where a move happens while a timeout detection (triggered by a timer instead of an HTTP request) is causing a teardown of the goroutine.

I don't know the quality of my design, but here's the one case I have where lock/unlock has to happen in separate goroutines.

dvyukov · 2017-12-21T08:56:29Z

@pciet no, it doesn't require unlocking a Mutex in a different goroutine. See the previous comment:
#9201 (comment)

ianlancetaylor · 2018-01-03T22:33:44Z

@griesemer suggests that perhaps instead of changing the existing sync.Mutex type, we should introduce a different kind of construct. For example, we could add a new method to sync.Mutex, Critical(section func()), which evaluates the function with the mutex held. That would provide the guarantee you are looking for, and in some cases might be easier and safer for people to use than separate Lock and Unlock calls.

dvyukov · 2018-01-04T08:33:02Z

Unless switching to Go2 includes rewriting all of Go code from scratch (as far as I understand it doesn't), this won't give any significant benefit. If Lock/Unlock stay and provide the current semantics, we may not do this at all. The point is that 99% Mutex uses already comply.

dvyukov · 2018-01-04T08:36:11Z

Another potential optimization that becomes possible is priority boosting inside of critical sections. Namely, if a goroutine is preempted inside of a critical sections, scheduler may give it another tiny time slot to finish the critical section (or perhaps it can then voluntarily switch in Unlock). Solaris did this for a long time.

ianlancetaylor · 2018-01-04T20:30:35Z

If people convert over time to a Critical method, then it seems to me that all of your suggested improvements apply to uses of that method.

On the other hand, if we change the meaning of Lock and Unlock, then we break the 1% of programs that do not follow the new guidelines. That's only going to be OK if we can reliably statically detect the programs that will break, which as far as I can see we can't. So I don't see how we can implement this proposal as written. We can introduce a Critical method. We can introduce a new kind of Mutex that only permits locking and unlocking in the same goroutine. But I don't think we can change sync.Mutex. Or, to put it a different way, the cost of changing sync.Mutex is high; we need a correspondingly high benefit, and I don't see it here.

nathanjsweet · 2018-10-08T13:38:31Z

You could create your own trylock pretty simply:
https://play.golang.org/p/DcITzWTNlrD

dvyukov added new v2 labels Dec 4, 2014

bradfitz removed the new label Dec 18, 2014

rsc added this to the Unplanned milestone Apr 10, 2015

rsc removed release-none labels Apr 10, 2015

rsc changed the title ~~sync: prohibit unlocking mutex in a different goroutine~~ proposal: sync: prohibit unlocking mutex in a different goroutine Jun 17, 2017

ianlancetaylor added the NeedsInvestigation label Jan 3, 2018

gfr10598 mentioned this issue Jun 15, 2018

Ndt sandbox async flush cleanup m-lab/etl#523

Merged

bcmills mentioned this issue Mar 7, 2019

Read-locking shouldn't hang if thread has already a write-lock? #30657

Closed

navytux mentioned this issue Jun 3, 2020

proposal: sync: Add UnlockToRLock() to RWMutex #38891

Closed

bcmills mentioned this issue Sep 22, 2020

sync: deemphasize goroutines in RWMutex documentation #41555

Closed

zwass mentioned this issue Jun 7, 2023

Add context and lock functionality to client interface osquery/osquery-go#108

Merged

mafredri mentioned this issue Nov 28, 2023

fix: avoid data race in session signal channel register/deregister coder/ssh#5

Merged

ianlancetaylor changed the title ~~proposal: sync: prohibit unlocking mutex in a different goroutine~~ proposal: sync/v2: prohibit unlocking mutex in a different goroutine Aug 6, 2024

ianlancetaylor added Proposal and removed NeedsInvestigation labels Aug 6, 2024

ianlancetaylor modified the milestones: Unplanned, Proposal Aug 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

proposal: sync/v2: prohibit unlocking mutex in a different goroutine #9201

proposal: sync/v2: prohibit unlocking mutex in a different goroutine #9201

dvyukov commented Dec 4, 2014 •

edited

Loading

davecheney commented Dec 4, 2014

dvyukov commented Dec 4, 2014

davecheney commented Dec 4, 2014

dvyukov commented Dec 4, 2014

davecheney commented Dec 4, 2014

cznic commented Dec 4, 2014

dvyukov commented Dec 4, 2014

dvyukov commented Dec 4, 2014

ianlancetaylor commented Dec 4, 2014

dvyukov commented Dec 4, 2014

ianlancetaylor commented Dec 4, 2014

dvyukov commented Dec 4, 2014

ianlancetaylor commented Dec 4, 2014

dvyukov commented Dec 4, 2014

ianlancetaylor commented Dec 4, 2014

dvyukov commented Dec 4, 2014

dvyukov commented Dec 4, 2014

ghasemloo commented Jul 12, 2017 •

edited

Loading

dvyukov commented Jul 12, 2017

pciet commented Dec 20, 2017

dvyukov commented Dec 21, 2017

ianlancetaylor commented Jan 3, 2018

dvyukov commented Jan 4, 2018

dvyukov commented Jan 4, 2018

ianlancetaylor commented Jan 4, 2018

nathanjsweet commented Oct 8, 2018

proposal: sync/v2: prohibit unlocking mutex in a different goroutine #9201

proposal: sync/v2: prohibit unlocking mutex in a different goroutine #9201

Comments

dvyukov commented Dec 4, 2014 • edited Loading

davecheney commented Dec 4, 2014

dvyukov commented Dec 4, 2014

davecheney commented Dec 4, 2014

dvyukov commented Dec 4, 2014

davecheney commented Dec 4, 2014

cznic commented Dec 4, 2014

dvyukov commented Dec 4, 2014

dvyukov commented Dec 4, 2014

ianlancetaylor commented Dec 4, 2014

dvyukov commented Dec 4, 2014

ianlancetaylor commented Dec 4, 2014

dvyukov commented Dec 4, 2014

ianlancetaylor commented Dec 4, 2014

dvyukov commented Dec 4, 2014

ianlancetaylor commented Dec 4, 2014

dvyukov commented Dec 4, 2014

dvyukov commented Dec 4, 2014

ghasemloo commented Jul 12, 2017 • edited Loading

dvyukov commented Jul 12, 2017

pciet commented Dec 20, 2017

dvyukov commented Dec 21, 2017

ianlancetaylor commented Jan 3, 2018

dvyukov commented Jan 4, 2018

dvyukov commented Jan 4, 2018

ianlancetaylor commented Jan 4, 2018

nathanjsweet commented Oct 8, 2018

dvyukov commented Dec 4, 2014 •

edited

Loading

ghasemloo commented Jul 12, 2017 •

edited

Loading