c# - 我如何从嵌套中获取文本
使用 html 敏捷包在外部 html 中标记？

Question

我正在尝试从外部站点获取一些文本。我要获取的文本嵌套在段落标记中。div 有一个类值

html代码片段：

<div class="discription"><p>this is the text I want to grab</p></div>

当前的 C# 代码：

public String getDiscription(string url)
{
    var web = new HtmlWeb();
    var doc = web.Load(url);


    var nodes = doc.DocumentNode.SelectNodes("//div[@class='discription']");

    if (nodes != null)
    {
        foreach (var node in nodes)
        {
            string Description = node.InnerHtml;
            return Description;
        }
    } else
      {
       string error = "could not find text";
       return error;
      }
}

我不明白的是 xpath 的语法//div[@class='discription']我知道它是错误的 xpath 应该是什么？

score 0 · Accepted Answer

使用//div[@class='discription']/p.

分解：

//div                    - All div elements
[@class='discription']   - With a class attribute whose value is discription
/p                       - Select the child p elements

c# - 我如何从嵌套中获取文本使用 html 敏捷包在外部 html 中标记？

1 回答 1

Related

Reference

c# - 我如何从嵌套中获取文本
使用 html 敏捷包在外部 html 中标记？