encoding/base32: decoder silently ignores unpadded trailing characters, depending on the chunking of the underlying Reader #25296

AxbB36 · 2018-05-08T20:37:47Z

The Reader returned by base32.NewReader sometimes fails to signal an error when its input is improperly padded (length not a multiple of 4 bytes). Whether an error is signaled depends on how the underlying Reader segments its output. If its final read is aligned to a 4-byte boundary and contains only the final unpadded block, then the package does the (IMO) right thing and returns io.UnexpectedEOF. But otherwise, then the package silently ignores the extraneous characters and returns io.EOF (i.e., successful completion of decoding).

The example program demonstrates this with the 18-byte input "NBSWY3DPO5XXE3DEZZ". When the final read is of "ZZ", it returns io.UnexpectedEOF. But when the final read is "EZZ", "DEZZ", or anything else, it returns io.EOF. I would expect it to return io.UnexpectedEOF in all cases. (A case could be made for returning base32.CorruptInputError, like base32.DecodeString does, but at any rate the error should be something other than io.EOF.)

This bug was originally reported on golang-nuts, in 2014 for go1.3.2. There was some discussion but it never got fixed. The thread mentions that base64 may be similarly affected, but I didn't check.

What version of Go are you using (`go version`)?

go version go1.10.1 linux/amd64

Does this issue reproduce with the latest release?

Yes, with go1.10.2 on play.golang.org.

What operating system and processor architecture are you using (`go env`)?

GOARCH="amd64"
GOHOSTARCH="amd64"
GOHOSTOS="linux"
GOOS="linux"

What did you do?

https://play.golang.org/p/Bya6YSpMJrB

package main

import (
	"encoding/base32"
	"fmt"
	"io"
	"io/ioutil"
)

func test(chunks []string) {
	fmt.Printf("\n")
	fmt.Printf("%q\n", chunks)

	pr, pw := io.Pipe()

	// Write the encoded chunks into the pipe.
	go func() {
		for _, chunk := range chunks {
			pw.Write([]byte(chunk))
		}
		pw.Close()
	}()

	// Decode base32 from the read end of the pipe.
	decoder := base32.NewDecoder(base32.StdEncoding, pr)
	data, err := ioutil.ReadAll(decoder)
	if err == nil {
		// Restore the EOF that ReadAll elides.
		err = io.EOF
	}
	fmt.Printf("%q %v\n", data, err)
}

func main() {
	// base32.CorruptInputError
	s, err := base32.StdEncoding.DecodeString("NBSWY3DPO5XXE3DEZZ")
	fmt.Printf("DecodeString(%q)\n", "NBSWY3DPO5XXE3DEZZ")
	fmt.Printf("%q %v\n", s, err)

	// io.UnexpectedEOF
	test([]string{"NBSW", "Y3DP", "O5XX", "E3DE", "ZZ"})
	test([]string{"NBSWY3DPO5XXE3DE", "ZZ"})

	// io.EOF
	test([]string{"NBSWY3DPO5XXE3DEZZ"})
	test([]string{"NBS", "WY3", "DPO", "5XX", "E3D", "EZZ"})
	test([]string{"NBSWY3DPO5XXE3", "DEZZ"})
}

What did you expect to see?

DecodeString("NBSWY3DPO5XXE3DEZZ")
"helloworld" illegal base32 data at input byte 16

["NBSW" "Y3DP" "O5XX" "E3DE" "ZZ"]
"helloworld" unexpected EOF

["NBSWY3DPO5XXE3DE" "ZZ"]
"helloworld" unexpected EOF

["NBSWY3DPO5XXE3DEZZ"]
"helloworld" unexpected EOF

["NBS" "WY3" "DPO" "5XX" "E3D" "EZZ"]
"helloworld" unexpected EOF

["NBSWY3DPO5XXE3" "DEZZ"]
"helloworld" unexpected EOF

What did you see instead?

DecodeString("NBSWY3DPO5XXE3DEZZ")
"helloworld" illegal base32 data at input byte 16

["NBSW" "Y3DP" "O5XX" "E3DE" "ZZ"]
"helloworld" unexpected EOF

["NBSWY3DPO5XXE3DE" "ZZ"]
"helloworld" unexpected EOF

["NBSWY3DPO5XXE3DEZZ"]
"helloworld" EOF

["NBS" "WY3" "DPO" "5XX" "E3D" "EZZ"]
"helloworld" EOF

["NBSWY3DPO5XXE3" "DEZZ"]
"helloworld" EOF

The text was updated successfully, but these errors were encountered:

josharian · 2018-05-08T21:40:08Z

cc @zegl

This changes decoder.Read to always return io.ErrUnexpectedEOF if the input contains surplus padding or unexpected content. Previously the error could be io.EOF or io.ErrUnexpectedEOF depending on how the input was chunked. Fixes golang#25296

gopherbot · 2018-05-09T21:02:30Z

Change https://golang.org/cl/112516 mentions this issue: encoding/base32: handle surplus padding consistently

zegl · 2018-05-09T21:08:26Z

Changing the behavior of decoder.Read is a bit risky, as some inputs will now cause ioutil.ReadAll to return a non-nil error where it previously returned nil. I definitely think that the CL should be merged anyway, as the current behavior is very surprising.

Thanks for two very nice bug reports @AxbB36! 👏

josharian changed the title ~~base32 decoder silently ignores unpadded trailing characters, depending on the chunking of the underlying Reader~~ encoding/base32: decoder silently ignores unpadded trailing characters, depending on the chunking of the underlying Reader May 8, 2018

josharian added this to the Go1.11 milestone May 8, 2018

zegl mentioned this issue May 9, 2018

encoding/base32: handle surplus padding consistently #25319

Closed

ianlancetaylor added the NeedsFix The path to resolution is known, but the work has not been done. label May 9, 2018

gopherbot closed this as completed in 0f2d4d0 May 9, 2018

AxbB36 mentioned this issue May 10, 2018

encoding/base32: buffered decoder expects padding even when NoPadding is set #25332

Closed

AxbB36 mentioned this issue Apr 23, 2019

encoding/base64: decoder output depends on chunking of underlying reader #31626

Open

golang locked and limited conversation to collaborators May 9, 2019

gopherbot added the FrozenDueToAge label May 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

encoding/base32: decoder silently ignores unpadded trailing characters, depending on the chunking of the underlying Reader #25296

encoding/base32: decoder silently ignores unpadded trailing characters, depending on the chunking of the underlying Reader #25296

AxbB36 commented May 8, 2018

josharian commented May 8, 2018

gopherbot commented May 9, 2018

zegl commented May 9, 2018

encoding/base32: decoder silently ignores unpadded trailing characters, depending on the chunking of the underlying Reader #25296

encoding/base32: decoder silently ignores unpadded trailing characters, depending on the chunking of the underlying Reader #25296

Comments

AxbB36 commented May 8, 2018

What version of Go are you using (go version)?

Does this issue reproduce with the latest release?

What operating system and processor architecture are you using (go env)?

What did you do?

What did you expect to see?

What did you see instead?

josharian commented May 8, 2018

gopherbot commented May 9, 2018

zegl commented May 9, 2018

What version of Go are you using (`go version`)?

What operating system and processor architecture are you using (`go env`)?