fread unable to read .csv files with first column empty

拟墨画扇 提交于 2019-12-18 14:54:44

问题


Say I have the first test.csv that looks like this

,a,b,c,d,e

If I try to read it using read.csv, it works fine.

read.csv("test.csv",header=FALSE)
#  V1 V2 V3 V4 V5 V6
#1 NA  a  b  c  d  e
#Warning message:
#In read.table(file = file, header = header, sep = sep, quote = quote,  :
#  incomplete final line found by readTableHeader on 'test.csv'

However, if I attempt to read this file using fread, i get an error instead.

require(data.table)
fread("test.csv",header=FALSE)
#Error in fread("test.csv", header = FALSE) : 
#  Not positioned correctly after testing format of header row. ch=','

Why does this happen and what can I do to correct this?


回答1:


As for me, my problem was only that the first ? rows of my file had a missing ID value.

So I was able to solve the problem by specifying autostart to be sufficiently far into the file that a nonmissing value popped up:

fread("test.csv", autostart = 100L, skip = "A")

This guarantees that when fread attempts to automatically identify sep and sep2, it does so at a well-formatted place in the file.

Specifying skip also makes sure fread finds the correct row in which to base the names of the columns.

If indeed there are no nonmissing values for the first field, you're better off just deleting that field from the .csv with Richard Scriven's approach or a find-and-replace in your favorite text editor.




回答2:


I think you could use skip/select/drop attributes of the fread function for this purpose.

fread("myfile.csv",sep=",",header=FALSE,skip="A")#to just skip the 1st column
fread("myfile.csv",sep=",",header=FALSE,select=c(2,3,4,5)) # to read other columns except 1
fread("myfile.csv",sep=",",header=FALSE,drop="A") #to drop first column



回答3:


I've tried making that csv file and running the code. It seems to work now - same for other people? I thought it might be an issue with not having a new line at the end (hence the warning from read.csv), but fread copes fine whether there's an new line at the end or not.



来源:https://stackoverflow.com/questions/22344161/fread-unable-to-read-csv-files-with-first-column-empty

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!