1

我有一些python经验不是很多。我没有使用 python 处理 XML,但现在我必须这样做。我试图在 Python 中解析的字符串中有一个 XML。我想将此 XML 存储在数据框中,但无法将其解析为 python。

import lxml.etree as ET
 lz4UC = rs['trade']['uc']
 UC = lz4ToString(base64.b64decode(lz4UC))
 parser = ET.XMLParser(recover=True)
 tree = ET.parse(UC,parser = parser) # option 1
 #tree2 = ET.fromstring(UC,parser = parser) # option 2

选项 1 的OSError: Error reading file '<?xml version="1.0" encoding="UTF-8" standalone="yes"?> 错误消息: 选项 2 的错误消息:ValueError: Unicode strings with encoding declaration are not supported. Please use bytes input or XML fragments without declaration.

加州大学看起来像:

'<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<positionEventMessage xmlns="urn:XXXX:uc" xmlns:td="urn:XXXX:uc:trade-id" xmlns:dt="http://www.dtcc.com/ext" xmlns:ip="urn:XXXX:ipt" xmlns:fpml="http://www.fpml.org/FpML-5/recordkeeping" xmlns:dtx="urn:XXXX:dtcc-5-ext" xmlns:w3="http://www.w3.org/2000/09/xmldsig#" xmlns:XXXX="urn:XXXX:fpml-5-ext" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
    <header>
        <sourceSystem>RODS</sourceSystem>
        <originatingSystem>MXG2000</originatingSystem>
        <timestamp>2020-07-04T16:23:46Z</timestamp>
    </header>
    <positionEvent>
        <eventType>Position:Update</eventType>
        <businessDate>2020-07-04</businessDate>
        <businessTime>16:23:46.046Z</businessTime>
        <position>
            <primaryAssetClass>Cash</primaryAssetClass>
            <productType productTypeScheme="urn:XXXX:product-type:RODS">ACFACFACF</productType>
            <productType productTypeScheme="urn:XXXX:product-type:RODS:qlDesc">ACF-FXD</productType>
            <owner>
                <partyReference href="Party1"/>
                <accountReference href="Account1"/>
            </owner>
            <aggregationCategory aggregationCategoryScheme="urn:XXXX:aggregation-category:MUREX:instrument">ACF-FXD</aggregationCategory>
            <currencyPair>
                <fpml:currency1>USD</fpml:currency1>
                <fpml:currency2>SAR</fpml:currency2>
            </currencyPair>
            <positionId positionIdScheme="urn:XXXX:position-id:HTI">0000002442892000207911</positionId>
            <positionId positionIdScheme="urn:XXXX:position-id:RODS:regulatory-key">999999999894891</positionId>
            <positionId positionIdScheme="urn:XXXX:position-id:RODS:valuation-id">USDSAR209</positionId>
            <positionId positionIdScheme="urn:XXXX:position-id:RODS:GlobalId">2000207911</positionId>
            <version>20151207000000000</version>
            <fpml:cash>
                <fpml:currency>SAR</fpml:currency>
            </fpml:cash>
            <positionType>Long</positionType>
            <quantity>7426113.8099999996</quantity>
            <internalProductType>
                <ip:productType productName="FX - SIMPLE CASH FLOW"/>
            </internalProductType>
        </position>
    </positionEvent>
    <party id="Party1">
        <fpml:partyId partyIdScheme="urn:XXXX:party-id:PO_ID">PO7</fpml:partyId>
        <fpml:partyId partyIdScheme="urn:XXXX:party-id:PO_GROUP">LOH</fpml:partyId>
        <fpml:partyId partyIdScheme="urn:XXXX:party-id:GROUP_ID">MDBK</fpml:partyId>
        <fpml:partyId partyIdScheme="urn:XXXX:party-id:BRANCH_ID">610</fpml:partyId>
        <fpml:partyId partyIdScheme="urn:XXXX:party-id:GRID_ID">43146</fpml:partyId>
    </party>
    <account id="Account1">
        <fpml:accountId accountIdScheme="urn:XXXX:book-id:RODS">209</fpml:accountId>
        <fpml:accountId accountIdScheme="urn:XXXX:book-id:HMS">FO0025489</fpml:accountId>
        <fpml:accountBeneficiary href="Party1"/>
    </account>
</positionEventMessage>'
4

1 回答 1

1

试试这种方式:

uc = """[your xml above"""]
tree = ET.XML(uc.encode())

看看这是否有效。

于 2020-08-20T16:32:18.340 回答