我正在使用read.table
读取数据文件。并得到以下错误:
扫描错误(文件,什么,nmax,sep,dec,quote,skip,nlines,na.strings,:
scan()预期'a real',得到'true'
我知道这意味着我的数据文件中有一些错误,问题是我怎样才能找到它在哪里。错误消息没有告诉哪一行有问题,我很难找到它。或者我怎样才能跳过这些行?
这是我的R代码:
data<-read.csv("/home/jianfezhang/prod/conversion_yaap/data/part-r-00000",
sep="\t",
col.names=c("site",
"treatment",
"mode",
"segment",
"source",
"itemId",
"leaf_categ_id",
"condition_id",
"auct_type_code",
"start_price_lstg_curncy",
"bin_price_lstg_curncy",
"start_price_variance",
"start_price_mean",
"start_price_media",
"bin_price_variance",
"bin_price_mean",
"bin_price_media",
"is_sold"),
colClasses=c(rep("factor",5),"numeric",rep("factor",3),rep("numeric",8),"factor")
);