0

我从 <div class="article-content"> 获取所有文本时遇到问题,示例:

// sample html code: <div class="article-content"> 
//<p>text I want text I want text I want text I want <strong> TEXT I WANT TOO </strong></p><p>text I want text I want text I want text I want <strong> TEXT I WANT TOO </strong></p><div>

foreach (HtmlNode link in htmlDocument.DocumentNode.SelectNodes("//div[@class='article-content']/p/text()"))
        {
            if((link.InnerText.Count() == 1) || (link.InnerText.Count() < 250 && link.InnerText[0] == ','))
                test.Text = test.Text + link.InnerText.Trim();              
            else
                test.Text = test.Text + "\n" + link.InnerText.Trim();                              
        }

我希望 textBlock 中的所有文本都以与现场相同的顺序排列,但我真的不知道该怎么做。

4

1 回答 1

0

尝试将您的 XPath 更改为"//div[@class='article-content']/p".

编辑:链接到工作示例

于 2014-05-19T13:01:22.693 回答