0

我正在尝试解析从国会图书馆下载的 MARC 文件。我已经使用 PyZ3950 成功下载了记录,但是当我尝试使用 PyMarc 解析文件时,出现以下错误:

Traceback (most recent call last):
  File "test.py", line 13, in <module>
    for record in reader:
  File "build/bdist.macosx-10.9-intel/egg/pymarc/reader.py", line 83, in next
ValueError: invalid literal for int() with base 10: '<PyZ3'

这是我的完整代码:

from PyZ3950 import zoom, zmarc
from pymarc import MARCReader

conn = zoom.Connection('z3950.loc.gov', 7090)
conn.databaseName = 'VOYAGER'
conn.preferredRecordSyntax = 'USMARC'

query = zoom.Query('CCL', 'ti="1066 and all that"')

res = conn.search(query)

reader = MARCReader(str(res))
for record in reader:
        print record.title()

conn.close()
4

1 回答 1

1

您的声明:

res = conn.search(query)

ResultSet根据http://www.panix.com/~asl2/software/PyZ3950/zoom.html返回 a

r中的每条记录resultSet都有数据r.data

r.data因此,您必须将每个或所有连接的 MARCReader 提供给 MARCReader 。

这将起作用:

from PyZ3950 import zoom, zmarc
from pymarc import MARCReader
conn = zoom.Connection('z3950.loc.gov', 7090)
conn.databaseName = 'VOYAGER'
conn.preferredRecordSyntax = 'USMARC'
query = zoom.Query('CCL', 'ti="1066 and all that"')
res = conn.search(query)
marc = ''
for r in res:
    marc = marc + r.data
reader = MARCReader(marc)
for record in reader:
        print record.title()
conn.close()
于 2014-03-18T19:24:11.347 回答