html - 无法从以下 html 中提取链接

问问题 2015-10-13T04:43:48.237

146 次

我需要从某个链接中获取链接以执行爬网，但是无论我重写 xpath 多少次，我都无法从 html 中提取链接。因此，我无法找到从中提取链接的方法。请给我一些建议来解决这个问题。

这是我要从中提取链接的链接的 html 代码：

<div class="" id="subject1" datacallname="主题_同类主题" params="{'catid':'12','sid':'336'}" isload="1" style="">
  <ul class="rail-list">
    <li>
      <cite class="start0" style="height:16px;">
      </cite>
      <a href="http://www.gorate.com.my/item-386.html">the Library&nbsp;@&nbsp;Leisure Ma
      </a>
    </li>

我如何提取链接"//*[@id="subject1"]/ul/li[1]/a/@href"和网站

我要刮掉链接表格：http ://www.gorate.com.my/item-336.html#.Vhx55BOqqkr

html - 无法从以下 html 中提取链接

0 回答 0

Related

Reference