3

我正在摄取一个 CSV 文件:

"ID","LASTNAME","FIRSTNAME","PERM_ADDR1","PERM_ADDR2","PERM_CITY","PERM_ST","PERM_ZIP","DOB","LIB_TYPE","BARCODE","EMAIL","LOCAL_ADDR1","LOCAL_ADDR2","LOCAL_CITY","LOCAL_ST","LOCAL_ZIP","CAMPUS_ADDR1","CAMPUS_ADDR2","CAMPUS_CITY","CAMPUS_ST","CAMPUS_ZIP","DEPARTMENT","MAJOR"
"123","Lastname","Firstname","123 Home St","","Home City","HS","12345-6789","0101","S","1234567890","last.first@domain.local","123 Local St","","Local City","LS","98765-4321","123 Campus St","","Campus City","CS","54321-6789","IT",""

使用Text::CSV,我试图将其解析为哈希:

my $csv = Text::CSV->new();

chomp(my $line = <READ>);
$csv->column_names(split(/,/, $line));

until (eof(READ)) {
    $line = $csv->getline_hr(*READ);
    my %linein = %$line;
    my %patron;

    $patron{'patronid'} = $linein{'ID'};
    $patron{'last'} = $linein{'LASTNAME'};
    $patron{'first'} = $linein{'FIRSTNAME'};

    print p(%linein)."\n";
    print p(%patron)."\n";
}

使用此代码,最后的打印语句(使用Data::Printer)返回:

{
    "BARCODE"        1234567890,
    "CAMPUS_ADDR1"   "123 Campus St",
    "CAMPUS_ADDR2"   "",
    "CAMPUS_CITY"    "Campus City",
    "CAMPUS_ST"      "CS",
    "CAMPUS_ZIP"     "54321-6789",
    "DEPARTMENT"     "IT",
    "DOB"            0101,
    "EMAIL"          "last.first@domain.local",
    "FIRSTNAME"      "Firstname",
    "ID"             123,
    "LASTNAME"       "Lastname",
    "LIB_TYPE"       "S",
    "LOCAL_ADDR1"    "123 Local St",
    "LOCAL_ADDR2"    "",
    "LOCAL_CITY"     "Local City",
    "LOCAL_ST"       "LS",
    "LOCAL_ZIP"      "98765-4321",
    "MAJOR"          "",
    "PERM_ADDR1"     "123 Home St",
    "PERM_ADDR2"     "",
    "PERM_CITY"      "Home City",
    "PERM_ST"        "HS",
    "PERM_ZIP"       "12345-6789"
}
{
    first      undef,
    last       undef,
    patronid   undef
}

我不明白为什么%patron没有填充来自%linein. 我想知道这是否与 using 有某种关系Text::CSV,因为我正在解析脚本中其他地方的其他文件并且它们工作得很好。然而,这些文件不是 CSV,而是固定宽度,所以我手动解析它们。

4

1 回答 1

6

尝试

 $csv->column_names(map {/"(.*)"/ and $1} split(/,/, $line))

代替

 $csv->column_names(split(/,/, $line));

您的 CSV 键被定义为文字字符串

 '"LASTNAME"' ,  '"FIRSTNAME"'

而不仅仅是

 'LASTNAME' ,  'FIRSTNAME'

Data::Printer在向您展示发生了什么方面做得还不错 - 中的所有键p(%linein)都显示为包含双引号作为字符串的一部分,而不是p(%patron)

于 2013-01-30T20:08:58.537 回答