python - 从 URL 字符串解析站点/域名

Question

可能重复：
从 python 中的 URL 获取站点名称

对于这样的 URL：

http://twitter.com/pypi
http://www.wolframalpha.com/input/?i=python

我想拉出'http://twitter.com'或'http://wolframalpha.com'部分。

以下代码有效，但我正在寻找一种更清洁的方法的建议......

'/'.join(url.split('/',3)[:3])

score 3 · Accepted Answer

您可以使用（在 Python 3 之前urllib.parse命名）模块：urlparse

>>> from urllib.parse import urlparse
>>> urlparse("http://twitter.com")
ParseResult(scheme='http', netloc='twitter.com', path='', params='', query='', fragment='')
>>> r = urlparse("http://twitter.com")
>>> r.scheme + '://' + r.netloc
'http://twitter.com'

score 0 · Accepted Answer

另一种（可读性较差）方法urlparse：

>>> from urlparse import urlparse, urlunparse
>>> urlunparse(urlparse("http://twitter.com/pypi")[:2] + ("",) * 4)
'http://twitter.com'

python - 从 URL 字符串解析站点/域名

2 回答 2

Related

Reference