0

I am quite a newbie with xml. I used XML in R to parse content in xml and put into R objects. I have to deal with nearly 1TB xml data and it took me around 5 hours to parse 2.4 GB data. I know that xmlschema is used to generate xml. I wonder if there is any better method to convert xml to data or another method to use xmlschema to read xml and put values back into raw data other than xmlParse? I now have 5 xmlschema and xml. (I thought it is complex xml)

  • xmlns:nxce="http://tfm.faa.gov/tfms/NasXCoreElements"
  • xmlns:mmd="http://tfm.faa.gov/tfms/MessageMetaData"
  • xmlns:nxcm="http://tfm.faa.gov/tfms/NasXCommonMessages"
  • xmlns:idr="http://tfm.faa.gov/tfms/TFMS_IDRS"
  • xmlns:xis="http://tfm.faa.gov/tfms/TFMS_XIS"
  • xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
  • xsi:schemaLocation="http://tfm.faa.gov/tfms/TFMS_XIS

sample data: http://www.fly.faa.gov/ASDI/asdidocs/asdi_sample_data.zip I want to extract all flightManagementInfomation data out using SAX

Thanks in advance.

4

1 回答 1

0

模式的使用不会提高 XML 加载的性能——它们会告诉您一些有关已解析 XML 的预期结构的信息,但与解析过程本身无关。

您需要使用不同的解析器 - 如果在 R 中可用(如 Martin 所建议),或者将 XML 数据转换为 R 可以使用其他语言更容易处理的东西

于 2013-08-20T14:59:50.407 回答