运行这个
hxs.select('//*[@id="column_one"]/h2/following-sibling::div[1]').extract()
这是示例输出
<div class="OneLinkNoTx">
<strong>Location:</strong>
Abu Dhabi, United Arab Emirates
</div>
<div class="OneLinkNoTx">
<strong>Travel Percentage:</strong>
None
</div>
<div align="justify">
Salary: 100k
</div>
我希望输出看起来像这样
<div>
<strong>Location:</strong>
Abu Dhabi, United Arab Emirates
</div>
<div>
<strong>Travel Percentage:</strong>
None
</div>
<div>
Salary: 100k
</div>
我只想拥有没有任何 html 属性的 html 元素。scrapy/xpath 可以吗?