go/scanner: potential inefficiency #35588

danaugrs · 2019-11-14T15:23:48Z

I noticed a potential scanner inefficiency here:

Lines 791 to 793 in 3f21c23

    
           switch ch := s.ch; { 
        
           case isLetter(ch): 
        
           	lit = s.scanIdentifier()

s.scanIdentifier() calls isLetter(s.ch) with the same char that was just checked in the case condition:

go/src/go/scanner/scanner.go

Lines 350 to 356 in 3f21c23

    
           func (s *Scanner) scanIdentifier() string { 
        
           	offs := s.offset 
        
           	for isLetter(s.ch) || isDigit(s.ch) { 
        
           		s.next() 
        
           	} 
        
           	return string(s.src[offs:s.offset]) 
        
           }

I think that instead of calling s.next() on line 809 we should both record the current char and call s.next() immediately before the entire nested switch statement. That would remove the double call to isLetter and also remove the need for the s.peek() on line 805. What do you think? It's very possible I'm overlooking something. Let me know if I should put together a PR.

The text was updated successfully, but these errors were encountered:

mvdan · 2019-11-14T15:26:28Z

There are benchmarks. Does any change you can come up with actually improve the benchmarks with benchstat? If not, then it's probably not worth optimizing.

danaugrs · 2019-11-14T17:13:47Z

Before modifying and benchmarking something I find it beneficial to discuss it with those familiar with the matter. Is anyone familiar with the scanner logic interested in reasoning about the inefficiency I described and my proposal? @griesemer

rsc · 2019-11-14T17:21:46Z

It is almost always better to write code that is clear and easy to adjust in the future than code that is awkward but saves a few cycles. In this case, the string conversion far outweighs the cost of potentially looking up the first character twice. Even if the string conversion were not here, the current code can't be running slower than if every identifier were just one character longer, which we know doesn't really affect time. (Otherwise everyone would be saying to use short names in your program to make it parse faster.)

There's no change to make here, but thanks for taking the time to file the issue.

danaugrs changed the title ~~Potential scanner inefficiency~~ go/scanner: potential inefficiency Nov 14, 2019

rsc closed this as completed Nov 14, 2019

golang locked and limited conversation to collaborators Nov 13, 2020

gopherbot added the FrozenDueToAge label Nov 13, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

go/scanner: potential inefficiency #35588

go/scanner: potential inefficiency #35588

danaugrs commented Nov 14, 2019

mvdan commented Nov 14, 2019

danaugrs commented Nov 14, 2019

rsc commented Nov 14, 2019

go/scanner: potential inefficiency #35588

go/scanner: potential inefficiency #35588

Comments

danaugrs commented Nov 14, 2019

mvdan commented Nov 14, 2019

danaugrs commented Nov 14, 2019

rsc commented Nov 14, 2019