encoding/csv: add the true number of lines to ErrFieldCount #6770

btracey · 2013-11-15T18:37:40Z

In Reader.Read(), if the true record length is not equal to the FieldsPerRecord field,
an ErrFieldCount error is returned. It would be nice if this error also included the
expected and found number of lines. So, for example, instead of the error string being
"wrong number of fields in line", it would be instead "wrong number of
fields in line. 15 read, 10 expected". The specific problem I encountered was that
my csv file had an extra delimiter at the end of the line, so one more record was read
than I expected. If the error message had also listed that there was one more record
than expected it would have been much easier to debug.

robpike · 2013-11-16T18:42:35Z

Comment 1:

Perhaps. I would like to resist using error strings as API, though. They exist to tell
you what's wrong, but you're asking them to tell you what's right. I think that's a bad
precedent.

Labels changed: added priority-someday, removed priority-triage.

Status changed to Thinking.

btracey · 2013-11-18T22:28:17Z

Comment 2:

Why is a bad precedent? I can see that we don't want to encourage blind changing of
numbers to pass error checks. However, in this particular case, I don't think this error
message change would tell you what is right, it would just be more specific about what
is wrong. I understand the criticism with the word "expected", but the intent of the
issue is to help the user understand what went wrong.

btracey · 2013-11-18T22:40:10Z

Comment 3:

As an addendum, I don't see the difference between this case (returning the number of
fields set and the number of fields read) and, for example, returning the filename that
couldn't be opened in os.Open. os.Open could just return "no such file or directory",
but the filename is added so it's easier to tell if a file is unexpectedly not there, or
if the filename passed to os.Open was incorrect.

rsc · 2013-11-27T18:44:28Z

Comment 4:

Labels changed: added go1.3maybe.

rsc · 2013-12-04T01:31:31Z

Comment 5:

Labels changed: added release-none, removed go1.3maybe.

rsc · 2013-12-04T01:50:32Z

Comment 6:

Labels changed: added repo-main.

nussjustin · 2017-04-28T11:07:17Z

I think the easiest fix here is to add a new field "Field" to ParseError, which would contain the number of the field where the error occured (e.g. a,b"", c would report Field == 2), which for the ErrFieldCount case would contain the number of fields.

/cc @bradfitz

bradfitz · 2017-04-28T18:08:45Z

So Field would be 1-based and ParseError.Field == 0 would mean no information?

How does that solve the original problem when r.FieldsPerRecord is 10 but 15 rows were read. You would report Field == 11? or 15?

nussjustin · 2017-04-28T18:25:01Z

Both 11 and 15 could make sense, although I think 11 makes more sense especially since in the ErrFieldCount case Read also returns the record along with the error, so the user can get the field count using len(record) even now.

15 would make sense as the code always reads the whole record before checking the field count. Although this could be seen as an implementation detail it is exposed as the returned slice.

I'm personally fine with both.

Field could also be 0-based. This basically only matters for the ErrFieldCount case as it's the only case where the user has an error and the records read. It would make accessing the fields more simpler (no -1 required), but I think that's all it does.

bradfitz · 2017-04-28T18:28:24Z

The problem with 0-based (even if it makes more sense) is then you either have to always use it, or document on on the new ParseError.Field docs when it's defined and when it's not. Saying "when it's non-zero" is much easier than enumerating a list of reasons.

nussjustin · 2017-04-28T18:36:29Z

I agree with you that the 1-based variant is easier.

I'm still unsure about the value of Field for ErrFieldCount, but am leaning towards FieldsPerRecord+1. What's your opinion on this?

bradfitz · 2017-04-28T18:44:49Z

especially since in the ErrFieldCount case Read also returns the record along with the error, so the user can get the field count using len(record) even now.

If the user can already get 15 and they set 10, I think this whole bug is kinda useless. Who cares if we report 11, 12, 13, 14, or 15? The user set 10 and we returned len() of 15 and ErrFieldCount. Can't they draw their own conclusions?

nussjustin · 2017-04-28T19:08:21Z

Yes. It would probably make more sense for the non-ErrFieldCount cases where the field could be more important and the user has just the column (rune index) of the error, although I can't currently think of a real use case where just the column isn't enough. I only thought about the possibility of using len(record) when writing my second comment.

So yeah, this whole bug is really kinda useless in retrospective.

btracey added Thinking priority-someday labels Dec 4, 2013

rsc added this to the Unplanned milestone Apr 10, 2015

rsc removed priority-someday labels Apr 10, 2015

bradfitz closed this as completed Apr 28, 2017

golang locked and limited conversation to collaborators Apr 28, 2018

gopherbot added the FrozenDueToAge label Apr 28, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

encoding/csv: add the true number of lines to ErrFieldCount #6770

encoding/csv: add the true number of lines to ErrFieldCount #6770

btracey commented Nov 15, 2013

robpike commented Nov 16, 2013

btracey commented Nov 18, 2013

btracey commented Nov 18, 2013

rsc commented Nov 27, 2013

rsc commented Dec 4, 2013

rsc commented Dec 4, 2013

nussjustin commented Apr 28, 2017

bradfitz commented Apr 28, 2017

nussjustin commented Apr 28, 2017

bradfitz commented Apr 28, 2017

nussjustin commented Apr 28, 2017

bradfitz commented Apr 28, 2017

nussjustin commented Apr 28, 2017

encoding/csv: add the true number of lines to ErrFieldCount #6770

encoding/csv: add the true number of lines to ErrFieldCount #6770

Comments

btracey commented Nov 15, 2013

robpike commented Nov 16, 2013

btracey commented Nov 18, 2013

btracey commented Nov 18, 2013

rsc commented Nov 27, 2013

rsc commented Dec 4, 2013

rsc commented Dec 4, 2013

nussjustin commented Apr 28, 2017

bradfitz commented Apr 28, 2017

nussjustin commented Apr 28, 2017

bradfitz commented Apr 28, 2017

nussjustin commented Apr 28, 2017

bradfitz commented Apr 28, 2017

nussjustin commented Apr 28, 2017