5

如何将 sql server 中的嵌套 xml 解析为单个表。考虑到每个客户的 RowGuid 都是独一无二的

例如

我想在一个表中解析这个 xml,该表将被非规范化并包含一对多的关系。考虑到每个嵌套都有业务主键。

<Customers>
    <Customer>
         <Type xsi:nil="true" />
          <RowGuid>FEFF32BC-1DAB-4F8A-80F0-CFE293C0BEC4</RowGuid>
          <AccountId>0</AccountId>
          <AccountNumber>bdb8eb51-d</AccountNumber>
          <AccountTransactions>
            <AccountTransaction>
                <PaymentDate>2012-09-13 22:19:58</PaymentDate>
                <Balance>500</Balance>
            </AccountTransaction>
            <AccountTransaction>
                <PaymentDate>2012-09-13 22:19:58</PaymentDate>
                <Balance>500</Balance>
            </AccountTransaction>
         </AccountTransactions>
        <Addresses>
             <Address>
                    <city>DELHI</city>
             </Address>
             <Address>
                    <city>MUMBAI</city>
             </Address>
         </Addresses>
      </Customer>
    <Customer>
         <Type xsi:nil="true" />
          <RowGuid>C3D4772E-1DAB-4F8A-80F0-CFE293C0BEC4</RowGuid>
          <AccountId>0</AccountId>
          <AccountNumber>bdb8eb51-d</AccountNumber>
          <AccountTransactions>
            <AccountTransaction>
                <PaymentDate>2012-09-13 22:19:58</PaymentDate>
                <Balance>500</Balance>
            </AccountTransaction>
            <AccountTransaction>
                <PaymentDate>2012-09-13 22:19:58</PaymentDate>
                <Balance>500</Balance>
            </AccountTransaction>
         </AccountTransactions>
      </Customer>

4

1 回答 1

5

如果不需要对表进行规范化,您可以执行LEFT JOIN. 我还为Customers元素添加了一个命名空间,因为xsi:nil="true". 尝试一下:

DECLARE @xml XML =
'<Customers xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
    <Customer>
         <Type xsi:nil="true" />
          <RowGuid>FEFF32BC-1DAB-4F8A-80F0-CFE293C0BEC4</RowGuid>
          <AccountId>0</AccountId>
          <AccountNumber>bdb8eb51-d</AccountNumber>
          <AccountTransactions>
            <AccountTransaction>
                <PaymentDate>2012-09-13 22:19:58</PaymentDate>
                <Balance>500</Balance>
            </AccountTransaction>
            <AccountTransaction>
                <PaymentDate>2012-09-13 22:19:58</PaymentDate>
                <Balance>500</Balance>
            </AccountTransaction>
         </AccountTransactions>
        <Addresses>
             <Address>
                    <city>DELHI</city>
             </Address>
             <Address>
                    <city>MUMBAI</city>
             </Address>
         </Addresses>
      </Customer>
    <Customer>
         <Type xsi:nil="true" />
          <RowGuid>C3D4772E-1DAB-4F8A-80F0-CFE293C0BEC4</RowGuid>
          <AccountId>0</AccountId>
          <AccountNumber>bdb8eb51-d</AccountNumber>
          <AccountTransactions>
            <AccountTransaction>
                <PaymentDate>2012-09-13 22:19:58</PaymentDate>
                <Balance>500</Balance>
            </AccountTransaction>
            <AccountTransaction>
                <PaymentDate>2012-09-13 22:19:58</PaymentDate>
                <Balance>500</Balance>
            </AccountTransaction>
         </AccountTransactions>
      </Customer>
</Customers>'

SELECT  a.[Type],
        a.RowGuid,
        a.AccountId,
        a.AccountNumber,
        b.PaymentDate,
        b.Balance,
        c.[Address]
FROM    
(
    SELECT  
            Customer.value('Type[1]', 'VARCHAR(500)') [Type],
            Customer.value('RowGuid[1]', 'UNIQUEIDENTIFIER') RowGuid,
            Customer.value('AccountId[1]', 'INT') AccountId,
            Customer.value('AccountNumber[1]', 'VARCHAR(500)') AccountNumber
    FROM    @xml.nodes('/Customers/Customer') tbl(Customer)
) a
LEFT JOIN
(
    SELECT  
            AccountTransaction.value('PaymentDate[1]', 'DATETIME') PaymentDate,
            AccountTransaction.value('Balance[1]', 'DECIMAL(20, 2)') Balance,
            AccountTransaction.value('../../RowGuid[1]', 'UNIQUEIDENTIFIER') RowGuid
    FROM    @xml.nodes('/Customers/Customer/AccountTransactions/AccountTransaction') tbl(AccountTransaction)
)   b ON
    a.RowGuid = b.RowGuid
LEFT JOIN
(
    SELECT  
            Address.value('city[1]', 'VARCHAR(500)') [Address],
            Address.value('../../RowGuid[1]', 'UNIQUEIDENTIFIER') RowGuid
    FROM    @xml.nodes('/Customers/Customer/Addresses/Address') tbl(Address)        
)   c ON
    a.RowGuid = c.RowGuid

更新:

由于这个查询的第一个版本(使用XML数据类型方法的那个)的查询成本很高,我创建了另一个版本,它使用OPENXML而不是nodesvalue方法。有利于OPENXML方法的成本差异很大:

DECLARE @handle INT

CREATE TABLE #Customer (Type VARCHAR(500),
    RowGuid UNIQUEIDENTIFIER,
    AccountId INT,
    AccountNumber VARCHAR(500)
)

CREATE TABLE #AccountTransaction (
    PaymentDate DATETIME,
    Balance DECIMAL(20, 2),
    RowGuid UNIQUEIDENTIFIER
)

CREATE TABLE #Address (
    City VARCHAR(500),
    RowGuid UNIQUEIDENTIFIER
)

EXEC sp_xml_preparedocument @handle OUTPUT, @xml

INSERT  #Customer
SELECT  *
FROM    OPENXML(@handle, '/Customers/Customer', 2)
WITH    (
        Type VARCHAR(500),
        RowGuid UNIQUEIDENTIFIER,
        AccountId INT,
        AccountNumber VARCHAR(500)
)

INSERT  #AccountTransaction
SELECT  *
FROM    OPENXML(@handle, '/Customers/Customer/AccountTransactions/AccountTransaction', 2)
WITH    (
        PaymentDate DATETIME,
        Balance DECIMAL(20, 2),
        RowGuid UNIQUEIDENTIFIER '../../RowGuid[1]'
)

INSERT  #Address
SELECT  *
FROM    OPENXML(@handle, '/Customers/Customer/Addresses/Address', 2)
WITH    (
        city VARCHAR(500),
        RowGuid UNIQUEIDENTIFIER '../../RowGuid[1]'
)

SELECT  a.*,
        b.PaymentDate,
        b.Balance,
        c.City
FROM    #Customer a
LEFT    JOIN #AccountTransaction b ON
        b.RowGuid = a.RowGuid
LEFT    JOIN #Address c ON
        c.RowGuid = a.RowGuid

EXEC sp_xml_removedocument @handle
于 2012-09-14T06:29:31.260 回答