我决定做这个小项目来学习如何使用机械化。现在它进入urbandictionary,在搜索表单中填写“skid”一词,然后按提交并打印出HTML。
我想要它做的是找到第一个定义并将其打印出来。我该怎么做呢?
到目前为止,这是我的源代码:
import mechanize
br = mechanize.Browser()
page = br.open("http://www.urbandictionary.com/")
br.select_form(nr=0)
br["term"] = "skid"
br.submit()
print br.response().read()
这是存储定义的位置:
<div class="definition">Canadian definition: Commonly used to refer to someone who stopped evolving, and bathing, during the 80's hair band era. Generally can be found wearing AC/DC muscle shirts, leather jackets, and sporting a <a href="/define.php?term=mullet">mullet</a>. The term "skid" is in part derived from "skid row", which is both a band enjoyed by those the term refers to, as well as their address. See also <a href="/define.php?term=white%20trash">white trash</a> and <a href="/define.php?term=trailer%20park%20trash">trailer park trash</a></div><div class="example">The skid next door got drunk and beat up his old lady.</div>
您可以看到它存储在 div 定义中。我知道如何在源代码中搜索 div,但我不知道如何获取标签之间的所有内容然后显示它。