net: Buffers makes multiple Write calls on Writers that don't implement buffersWriter #21676

zombiezen · 2017-08-29T02:06:05Z

The writev syscall is supposed to act like a single write. The WriteTo method of net.Buffers will make a single write syscall on writers that have the unexported writeBuffers method. However, for writers that do not have such a method, it will call Write multiple times. This becomes significant if you are wrapping a *net.TCPConn without embedding, for instance, since it has different performance characteristics with respect to Nagle's algorithm. Frustratingly, since the writeBuffers method is unexported, there's no way for the application to know the behavior of Buffers.WriteTo in order to work around the issue.

Repro case: https://play.golang.org/p/rF0JRZs8z8

The text was updated successfully, but these errors were encountered:

odeke-em · 2017-08-29T06:00:53Z

/cc @mikioh @ianlancetaylor @rsc

ianlancetaylor · 2017-08-29T21:45:34Z

This is an interesting one. I think it's somewhat clear that calling the Write method should turn into a single write system call for low-level types. But this is the WriteTo method, and I don't think there has ever been such a guarantee for WriteTo. In general WriteTo writes all available data to the Writer argument, which can imply fetching more data. For example, the more-or-less canonical implementation of WriteTo, bufio.(*Reader).WriteTo, makes multiple Write calls. Similarly, the original implementation of WriteTo, in https://golang.org/cl/166041, used multiple Write calls.

But clearly for a low-level type it is desirable to minimize write calls, and when using net.Buffers it's inconvenient to not know whether you will get one Write call or several. So there does seem to be an argument that net.(*Buffers).WriteTo should copy the bytes and call Write once. But there is also a counter-argument that the whole point of net.Buffers is to avoid copying bytes, and so doing a copy anyhow seems like a bit of a trick.

gobwas · 2017-09-05T11:51:01Z

@ianlancetaylor hi! I think the problem is not in the WriteTo method, but in the writeBuffers. The point is that when some public function wants to cast its interface argument to some type that provides more efficient method to do same thing, that type and its method must be exported. I thing the good example of this is io.Copy function, that makes type assertion to io.WriterTo and io.ReaderFrom. net.Buffers.WriteTo does the similar thing, but with non-exported interface. This could bring some problems – I've tried to show them in duplicate of this issue. TL;DR: it is all about wrappers and method overloading.

ianlancetaylor · 2017-09-08T13:37:02Z

@gobwas You are describing a general problem that Go code can run into, but as far as I can see this particular problem will not be helped by exporting the writeBuffers method. That will let the calling code see whether it gets a single write or not, but it doesn't solve the basic question of whether we should always be using a single write. Unless, I suppose, we want to restrict this problem to make it possible to wrap a net.TCPConn while still getting a single write.

zombiezen · 2017-09-08T18:47:12Z

I understand the concern. I think that users of the API want to make the choice: not allocating might be important or not issuing more than one write might be important, depending on context. Exporting the interface would be a means of allowing the caller to use a type-assertion to determine capabilities of the io.Writer, much like other I/O capabilities.

Could we perhaps introduce a general I/O interface (consider this a sketch, nothing more):

// Writever wraps the Writev method.
//
// Writev behaves identically to a Write where all of the byte slices in p are concatenated together.
type Writever interface {
  Writev(p [][]byte) (n int, err error)
}

Then in usage:

func singleWritev(w io.Writer, p [][]byte) (n int, err error) {
  if wv, ok := w.(Writever); ok {
    return wv.Writev(p)
  }
  pp := bytes.Join(p, nil)
  return w.Write(pp)
}

func noallocWritev(w io.Writer, p [][]byte) (n int, err error) {
  if wv, ok := w.(Writever); ok {
    return wv.Writev(p)
  }
  for _, pp := range p {
    nn, err := w.Write(pp)
    n += nn
    if err != nil {
      return n, err
    }
  }
  return n, nil
}

I could see this also being done as an alternate method on net.Buffers, but I still don't see why net.Buffers restricts to particular types, instead of allowing any type that implements the Writev semantics to benefit.

rsc · 2017-10-23T21:04:17Z

Wait, does anyone use Nagle's algorithm anymore? I thought we turned that off on all our file descriptors.

zombiezen · 2017-10-24T16:03:59Z

@rsc It's off by default, but could be enabled by calling *TCPConn.SetNoDelay. It's not the only case where minimizing the number of Write calls is useful though.

rsc · 2017-10-30T20:21:33Z

I think it's probably too late for this as an API change in Go 1.10. Let's leave this for Go 1.11 and be able to discuss with @bradfitz. I think maybe a more compelling motivation than a custom TCP wrapper would be letting os.File implementations get the writev optimization too.

odeke-em · 2018-04-24T23:02:58Z

How's it going @bradfitz? I am just pinging you here as per @rsc's last comment :)

ianlancetaylor · 2018-05-08T14:06:06Z

See also #21756.

egorse · 2018-05-11T21:39:34Z

Step into this problem. net.Pipe impossible to use for tests with net.Buffers where there are multiple writers and single reader. Wire data became interleaved :(

rsc · 2018-11-14T19:04:35Z

For Go 1.13 we should look at this earlier in the cycle. The part about os.File maybe implementing this suggests that if we do expose an API it should not refer to types in package net. Perhaps just [][]byte directly.

noblehng · 2018-12-13T08:07:31Z

I think the bigger problem here is for datagram sockets.

The underlying system call guarantee a writev call will generate a single datagram, so (*net.Buffers).WriteTo will have different behavior when using bare *net.UDPConn vs wrapped, that is single datagram vs multiple.

Edit:
Even for stream sockets, it is not atomic as the underlying system call does. If there are concurrent writers, the result will be interleaved.

As for os.File usage, I think the Buffers type and related interfaces should be in the io package, ~~os.File should also implement syscall.Conn interface~~.

The current net.Buffers is also a little hard for reuse, because it modify the slice directly, caller have to keep a own copy for reuse. Use bare [][]byte and let caller do the consume tracking, or use a ring buffer could be easier and less copying.

Edit2:
Forget about the implement syscall.Conn interface part. I am thinking about wiring to the runtime poller, but it doesn't register to the runtime poller to begin with, except for those sockets already in net.conn.

Merovius · 2019-12-18T13:56:21Z

As another voice: I'm particularly interested in what @rsc mentioned - having Writev for *os.File. My use-case is a write-ahead log, which adds a header and footer to some user-provided data. File is opened with O_APPEND, so there is a semantic difference between one and multiple writes. Currently I allocate a buffer and copy everything, but I'd like to avoid that.

Personally I like the interface @zombiezen wrote down above and I would put it in io. This would be akin to io.StringWriter/WriterTo/WriterAt/… where the io package defines multiple interfaces to make an io.Writer more efficient in some circumstances and then also offers functions like io.WriteString with the natural fallbacks.

seebs · 2019-12-18T15:23:28Z

I would love to have access to readv and writev from Go, but I would point out that there's no way to guarantee those semantics generically across arbitrary operating systems. But the ability to get those atomic writes from multiple buffers is a HUGE performance win for a lot of applications.

ianlancetaylor · 2019-12-18T19:23:15Z

Although this issue only has 15 comments they all seem to head in different directions. I think that if the request is a Writev method for *os.File, perhaps only available on systems that support writev, then we should open a new proposal issue for that.

tv42 · 2019-12-19T18:04:37Z

I'm not convinced there's any scenario in which concurrent writers without holding a mutex is safe. The write(2) syscall can make short writes on e.g. interrupts, and the Go Write implementation will need a for loop around that -> concurrent Writes can interleave anyway.

Writev matters for datagrams and performance.

gobwas · 2020-03-03T11:11:37Z

It didn't shipped with Go 1.14, right? Any plans to include that Writever interface by @zombiezen ?

stokito · 2022-01-08T13:32:11Z

@tv42 "I'm not convinced there's any scenario in which concurrent writers without holding a mutex is safe. The write(2) syscall can make short writes on e.g. interrupts"
I'm not an expert but it looks like the writev is atomic by design
https://en.wikipedia.org/wiki/Vectored_I/O

Merovius · 2022-01-08T23:54:14Z

@stokito ISTM that the meaning of "atomic" is under-specified there. Both the text in the wikipedia and the information from the manpage seem to suggest that it doesn't mean "either all writes succeed or none of them", but just that writes by different processes are not interleaved. i.e. it seems it refers to the isolation of ACID, not the atomicity. Also, from what I can tell, the actual POSIX standard does not guarantee even that, unless writes are less than PIPE_BUF in size ([1] [2]).

In my experience, interpreting what guarantees the POSIX standard really makes is very subtle. And what is actually implemented even more so. Personally, I got convinced by the argument that short writes can happen, at least for my usecase.

ianlancetaylor added this to the Go1.10 milestone Aug 29, 2017

odeke-em mentioned this issue Sep 4, 2017

net: export writeBuffers() and buffersWriter #21756

Closed

ianlancetaylor added the NeedsDecision Feedback is required from experts, contributors, and/or the community before a change can be made. label Sep 8, 2017

Zeymo mentioned this issue Sep 26, 2017

Use net.Buffers.WriteTo to reduce Syscall and memcopy grpc/grpc-go#1540

Closed

gobwas mentioned this issue Sep 28, 2017

ws.Dial() looses frames from server gobwas/ws#19

Closed

rsc modified the milestones: Go1.10, Go1.11 Oct 30, 2017

ianlancetaylor modified the milestones: Go1.11, Go1.12 Jun 27, 2018

rsc modified the milestones: Go1.12, Go1.13 Nov 14, 2018

rsc added the early-in-cycle A change that should be done early in the 3 month dev cycle. label Nov 14, 2018

ploxiln mentioned this issue Jan 9, 2019

nsq_to_file: check for rotate size specifically in rev-incr loop nsqio/nsq#1123

Merged

andybons modified the milestones: Go1.13, Go1.14 Jul 8, 2019

rsc modified the milestones: Go1.14, Backlog Oct 9, 2019

Merovius mentioned this issue Dec 18, 2019

x/sys/unix: Add wrapper for readv/writev #36201

Closed

cbandy mentioned this issue Feb 22, 2022

net: Buffers.WriteTo is prone to memory leaks #45163

Closed

MattBrittan mentioned this issue Mar 7, 2022

found that the message is not safe to write eclipse-paho/paho.golang#81

Closed

pascaldekloe mentioned this issue Feb 12, 2023

Utilize writev(2) in SimpleWriter pascaldekloe/seeq#1

Closed

Jorropo mentioned this issue Jul 29, 2024

proposal: io,net: add WriteMany interface #68625

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

net: Buffers makes multiple Write calls on Writers that don't implement buffersWriter #21676

net: Buffers makes multiple Write calls on Writers that don't implement buffersWriter #21676

zombiezen commented Aug 29, 2017

odeke-em commented Aug 29, 2017

ianlancetaylor commented Aug 29, 2017

gobwas commented Sep 5, 2017

ianlancetaylor commented Sep 8, 2017

zombiezen commented Sep 8, 2017 •

edited

Loading

rsc commented Oct 23, 2017

zombiezen commented Oct 24, 2017

rsc commented Oct 30, 2017

odeke-em commented Apr 24, 2018

ianlancetaylor commented May 8, 2018

egorse commented May 11, 2018

rsc commented Nov 14, 2018

noblehng commented Dec 13, 2018 •

edited

Loading

Merovius commented Dec 18, 2019

seebs commented Dec 18, 2019

ianlancetaylor commented Dec 18, 2019

tv42 commented Dec 19, 2019 •

edited

Loading

gobwas commented Mar 3, 2020

stokito commented Jan 8, 2022

Merovius commented Jan 8, 2022

net: Buffers makes multiple Write calls on Writers that don't implement buffersWriter #21676

net: Buffers makes multiple Write calls on Writers that don't implement buffersWriter #21676

Comments

zombiezen commented Aug 29, 2017

odeke-em commented Aug 29, 2017

ianlancetaylor commented Aug 29, 2017

gobwas commented Sep 5, 2017

ianlancetaylor commented Sep 8, 2017

zombiezen commented Sep 8, 2017 • edited Loading

rsc commented Oct 23, 2017

zombiezen commented Oct 24, 2017

rsc commented Oct 30, 2017

odeke-em commented Apr 24, 2018

ianlancetaylor commented May 8, 2018

egorse commented May 11, 2018

rsc commented Nov 14, 2018

noblehng commented Dec 13, 2018 • edited Loading

Merovius commented Dec 18, 2019

seebs commented Dec 18, 2019

ianlancetaylor commented Dec 18, 2019

tv42 commented Dec 19, 2019 • edited Loading

gobwas commented Mar 3, 2020

stokito commented Jan 8, 2022

Merovius commented Jan 8, 2022

zombiezen commented Sep 8, 2017 •

edited

Loading

noblehng commented Dec 13, 2018 •

edited

Loading

tv42 commented Dec 19, 2019 •

edited

Loading