0

在这里,我将 mongodb 驱动程序用于 ruby​​。但是在这将完美运行之后,我想在 Ruby on Rails 3 和 Mongoid ODB 中将它作为计划任务运行。

所以现在,我正在试验红宝石。

我注意到在将 XML 文件转换为可以插入 mongodb 的格式时,crack gem 非常方便。当我为 ruby​​ 使用 mongodb 驱动程序时,crack 转换为接近 JSON 的格式(它使用“=>”而不是“:”列),这是我将其插入 mondodb 数据库之前的必要条件,如下所示

我在下面使用破解的方式的问题是导入 XML 文件中的所有内容。请看下文。

示例.xml

<?xml version="1.0" encoding="utf-8"?>
<ShipmentRequest>
  <Envelope>
    <TransmissionDateTime>05/08/2013 23:06:02</TransmissionDateTime>
  </Envelope>
  <Message>
    <Comment />
    <Header>
      <MemberId>A00000001</MemberId>
      <MemberName>Bruce</MemberName>
      <DeliveryId>6377935</DeliveryId>
      <ShipToAddress1>123-4567</ShipToAddress1>
      <OrderDate>05/08/13</OrderDate>
      <Payments>
        <PayType>Credit Card</PayType>
        <Amount>1000</Amount>
      </Payments>
      <Payments>
        <PayType>Points</PayType>
        <Amount>5390</Amount>
      </Payments>
    </Header>
    <Line>
      <LineNumber>3.1</LineNumber>
      <Item>fruit-004</Item>
      <Description>Peach</Description>
      <Quantity>1</Quantity>
      <UnitCost>1610</UnitCost>
      <DeclaredValue>0</DeclaredValue>
      <PointValue>13</PointValue>
    </Line>
    <Line>
      <LineNumber>8.1</LineNumber>
      <Item>fruit-001</Item>
      <Description>Fruit Set</Description>
      <Quantity>1</Quantity>
      <UnitCost>23550</UnitCost>
      <PointValue>105</PointValue>
      <PickLine>
        <PickLineNumber>8.1..1</PickLineNumber>
        <PickItem>fruit-002</PickItem>
        <PickDescription>Apple</PickDescription>
        <PickQuantity>1</PickQuantity>
      </PickLine>
      <PickLine>
        <PickLineNumber>8.1..2</PickLineNumber>
        <PickItem>fruit-003</PickItem>
        <PickDescription>Orange</PickDescription>
        <PickQuantity>2</PickQuantity>
      </PickLine>
    </Line>
  </Message>
</ShipmentRequest>

sample_crack.rb

#!/usr/bin/ruby

require "crack"
require 'mongo'
include Mongo

mongo_client = MongoClient.new("localhost", 27017)
db = mongo_client.db("somedb")
coll = db.collection("somecoll")

myXML  = Crack::XML.parse(File.read("sample.xml"))
coll.insert(myXML)

puts myXML

它在控制台上打印:

{"ShipmentRequest"=>{"Envelope"=>{"TransmissionDateTime"=>"05/08/2013 23:06:02"}, "Message"=>{"Comment"=>nil, "Header"=>{"MemberId"=>"A00000001", "MemberName"=>"Bruce", "DeliveryId"=>"6377935", "ShipToAddress1"=>"123-4567", "OrderDate"=>"05/08/13", "Payments"=>[{"PayType"=>"Credit Card", "Amount"=>"1000"}, {"PayType"=>"Points", "Amount"=>"5390"}]}, "Line"=>[{"LineNumber"=>"3.1", "Item"=>"fruit-004", "Description"=>"Peach", "Quantity"=>"1", "UnitCost"=>"1610", "DeclaredValue"=>"0", "PointValue"=>"13"}, {"LineNumber"=>"8.1", "Item"=>"fruit-001", "Description"=>"Fruit Set", "Quantity"=>"1", "UnitCost"=>"23550", "PointValue"=>"105", "PickLine"=>[{"PickLineNumber"=>"8.1..1", "PickItem"=>"fruit-002", "PickDescription"=>"Apple", "PickQuantity"=>"1"}, {"PickLineNumber"=>"8.1..2", "PickItem"=>"fruit-003", "PickDescription"=>"Orange", "PickQuantity"=>"2"}]}]}}, :_id=>BSON::ObjectId('51ad8d83a3d24b3b9f000001')}

在 mongodb 中,转换后的 XML 文件如下所示:

{
    "_id" : ObjectId("51ad8d83a3d24b3b9f000001"),
    "ShipmentRequest" : {
        "Envelope" : {
            "TransmissionDateTime" : "05/08/2013 23:06:02"
        },
        "Message" : {
            "Comment" : null,
            "Header" : {
                "MemberId" : "A00000001",
                "MemberName" : "Bruce",
                "DeliveryId" : "6377935",
                "ShipToAddress1" : "123-4567",
                "OrderDate" : "05/08/13",
                "Payments" : [
                    {
                        "PayType" : "Credit Card",
                        "Amount" : "1000"
                    },
                    {
                        "PayType" : "Points",
                        "Amount" : "5390"
                    }
                ]
            },
            "Line" : [
                {
                    "LineNumber" : "3.1",
                    "Item" : "fruit-004",
                    "Description" : "Peach",
                    "Quantity" : "1",
                    "UnitCost" : "1610",
                    "DeclaredValue" : "0",
                    "PointValue" : "13"
                },
                {
                    "LineNumber" : "8.1",
                    "Item" : "fruit-001",
                    "Description" : "Fruit Set",
                    "Quantity" : "1",
                    "UnitCost" : "23550",
                    "PointValue" : "105",
                    "PickLine" : [
                        {
                            "PickLineNumber" : "8.1..1",
                            "PickItem" : "fruit-002",
                            "PickDescription" : "Apple",
                            "PickQuantity" : "1"
                        },
                        {
                            "PickLineNumber" : "8.1..2",
                            "PickItem" : "fruit-003",
                            "PickDescription" : "Orange",
                            "PickQuantity" : "2"
                        }
                    ]
                }
            ]
        }
    }
}

但我想导入它来消除不需要的节点并忽略空节点:

{
    "_id" : ObjectId("51ad8d83a3d24b3b9f000001"),
    "MemberId" : "A00000001",
    "MemberName" : "Bruce",
    "DeliveryId" : "6377935",
    "ShipToAddress1" : "123-4567",
    "OrderDate" : "05/08/13",
    "Payments" : [
    {
        "PayType" : "Credit Card",
        "Amount" : "1000"
    },
    {
        "PayType" : "Points",
        "Amount" : "5390"
    }
    ],
    "Line" : [
    {
        "LineNumber" : "3.1",
        "Item" : "fruit-004",
        "Description" : "Peach",
        "Quantity" : "1",
        "UnitCost" : "1610",
        "DeclaredValue" : "0",
        "PointValue" : "13"
    },
    {
        "LineNumber" : "8.1",
        "Item" : "fruit-001",
        "Description" : "Fruit Set",
        "Quantity" : "1",
        "UnitCost" : "23550",
        "PointValue" : "105",
        "PickLine" : [
        {
            "PickLineNumber" : "8.1..1",
            "PickItem" : "fruit-002",
            "PickDescription" : "Apple",
            "PickQuantity" : "1"
        },
        {
            "PickLineNumber" : "8.1..2",
            "PickItem" : "fruit-003",
            "PickDescription" : "Orange",
            "PickQuantity" : "2"
        }
        ]
    }
    ]
}

这可以用裂纹完成吗?或者这可以用 nokogiri 做得更好?


更新

非常感谢@Alex Peachey,我在这里放了更新的代码。

sample_crack/rb(更新):

#!/usr/bin/ruby

require "crack"
require 'mongo'
include Mongo

mongo_client = MongoClient.new("localhost", 27017)
db = mongo_client.db("somedb")
coll = db.collection("somecoll")

myXML  = Crack::XML.parse(File.read("sample.xml"))

myXML.merge!(myXML.delete("ShipmentRequest")) # not needed hash
myXML.merge!(myXML.delete("Message"))         # not needed hash
myXML.merge!(myXML.delete("Header"))          # not needed hash
myXML.delete("Envelope")                      # not needed hash

# planning to put here a code to remove hashes with empty values

coll.insert(myXML)

puts myXML
4

1 回答 1

2

很难说你如何定义“不需要”的节点,但空节点很容易理解。无论哪种方式,Crack 都非常擅长它为您做的事情,即基本上将 XML 转换为 Hash。一旦你有了哈希,在你将它插入 Mongo 之前,根据你所拥有的任何规则随意修剪它。

根据您的评论,我更好地理解了您的要求。我的回答仍然成立,只是操纵哈希。具体来说,您可以这样做:

myXML.merge!(myXML.delete("ShipmentRequest"))
myXML.delete("Envelope")
myXML.merge!(myXML.delete("Message"))
于 2013-06-04T07:14:31.423 回答