0

如何在 Hadoop 上解析具有如下结构的 xml 文件:

<row Id="2292" PostTypeId="2" ParentId="2284" CreationDate="2008-08-05T13:28:06.700" Score="0" ViewCount="0" Body="&lt;p&gt;The first thing you should do is contact the main people who run the open source project. Ask them if it is ok to contribute to the code and go from there.&lt;/p&gt;&#xD;&#xA;&#xD;&#xA;&lt;p&gt;Simply writing your improved code and then giving it to them may result in your code being rejected.&lt;/p&gt;" OwnerUserId="383" LastActivityDate="2008-08-05T13:28:06.700" />

注意:我已经为它编写了代码,但它不能正常工作。需要一个新的方法...

提前致谢...

4

1 回答 1

1

看看XMLInputFormat,它可能需要修改一下。

于 2013-10-20T11:32:03.853 回答