python - 如何获得 在特定之后出现的
?

这是网页的布局：

`<h2>Featured Ads</h2> <a href=""></a> <h2>Ads</h2> <a href=""></a>`

常规广告中没有任何东西`class`可以用来区分它们。什么是只返回

Question

这是网页的布局：

<h2>Featured Ads</h2>
<a href=""></a>

<h2>Ads</h2>
<a href=""></a>

常规广告中没有任何东西class可以用来区分它们。什么是只返回<a href>之后出现的 's的有效方法<h2>Ads</h2>？

更新：

这是最终代码

h2 = soup.find("h2", text="Ads")
articles = h2.find_next_siblings("article")

for article in articles:
    for div in article.find_all('div', {'class': 'address'}):
        for link in div.find_all('a', href=True):
            print (link['href'])

更新 2：必须重构...

articles = soup.find("h2", text="Ads").find_next_siblings("article")
for article in articles:
    ad_url = article.find('a', href=True)['href']

score 2 · Accepted Answer

找到h2元素并找到下一个a兄弟：

h2 = soup.find("h2", text="Ads")
a = h2.find_next_sibling("a")

python - 如何获得 在特定之后出现的
?

这是网页的布局：

`<h2>Featured Ads</h2> <a href=""></a> <h2>Ads</h2> <a href=""></a>`

常规广告中没有任何东西`class`可以用来区分它们。什么是只返回

?

wordpress中页面的自定义模板

1 回答 1

python - 如何获得在特定之后出现的? 这是网页的布局： <h2>Featured Ads</h2> <a href=""></a> <h2>Ads</h2> <a href=""></a>

?

wordpress中页面的自定义模板

1 回答 1

Related

Reference

python - 如何获得在特定之后出现的
?

这是网页的布局：

`<h2>Featured Ads</h2> <a href=""></a> <h2>Ads</h2> <a href=""></a>`

常规广告中没有任何东西`class`可以用来区分它们。什么是只返回