Golang : Read large file with bufio.Scanner cause token too long error
There are couple of ways to read files with Golang and one of the common way to read in file is with bufio.NewScanner() function. For example :
package main
import (
"bufio"
"fmt"
"os"
)
func main() {
file, err := os.Open("somefile.dat")
if err != nil {
panic(err.Error())
}
defer file.Close()
reader := bufio.NewReader(file)
scanner := bufio.NewScanner(reader)
scanner.Split(bufio.ScanBytes)
for scanner.Scan() {
fmt.Println(scanner.Bytes())
}
if err := scanner.Err(); err != nil {
fmt.Println("error while reading :", err)
}
}
Most of the time, things should be fine and dandy. However, there will be cases when the program attempts to read in a very large file and it will throw out this error message :
"bufio.Scanner: token too long"
This error message was caused by the input token size that exceeded the bufio.MaxScanTokenSize
buffer size. The allocated size is MaxScanTokenSize = 64 * 1024
To read in large files, it is recommended to use bufio.Reader.ReadLine instead.
References :
See also : Golang : Scanf function weird error in Windows
By Adam Ng
IF you gain some knowledge or the information here solved your programming problem. Please consider donating to the less fortunate or some charities that you like. Apart from donation, planting trees, volunteering or reducing your carbon footprint will be great too.
Advertisement
Tutorials
+23.3k Golang : Check if element exist in map
+5.4k Golang : Configure crontab to poll every two minutes 8am to 6pm Monday to Friday
+7.2k Golang : Gorrila set route name and get the current route name
+10.3k Golang : Resolve domain name to IP4 and IP6 addresses.
+5.9k Golang : Extract XML attribute data with attr field tag example
+32k Golang : Copy directory - including sub-directories and files
+12.4k Golang : Pass database connection to function called from another package and HTTP Handler
+22.4k Golang : Strings to lowercase and uppercase example
+17.3k Golang : Upload/Receive file progress indicator
+15.2k Golang : Intercept Ctrl-C interrupt or kill signal and determine the signal type
+21.7k Fix "Failed to start php5-fpm.service: Unit php5-fpm.service is masked."
+4.9k Golang : Customize scanner.Scanner to treat dash as part of identifier