Find centralized, trusted content and collaborate around the technologies you use most.
Teams
Q&A for work
Connect and share knowledge within a single location that is structured and easy to search.
我想关注文本中有 Next 的那些链接,例如
<div id="pagination" <a href="#" > 1 </a> <a href="#" > 2 </a> <a href="#" > 3 </a> <a href="#" > Next </a> </div>
我怎么能做到这一点,我scrapy。它也是最后的选择
创建一个扩展BaseSgmlLinkExtractor并提供process_value可调用的类,如文档中所示
BaseSgmlLinkExtractor
process_value