以 IANA 的这种格式为例:http ://www.iana.org/assignments/language-subtag-registry
%%
Type: language
Subtag: aa
Description: Afar
Added: 2005-10-16
%%
Type: language
Subtag: ab
Description: Abkhazian
Added: 2005-10-16
Suppress-Script: Cyrl
%%
Type: language
Subtag: ae
Description: Avestan
Added: 2005-10-16
%%
假设我打开文件:
import urllib
f = urllib.urlopen("http://www.iana.org/assignments/language-subtag-registry")
all=f.read()
通常你会这样做
lan=all.split("%%")
迭代局域网,split("\n")
然后迭代结果和拆分(“:”),有没有办法在没有迭代的情况下在python中批量处理,输出仍然是这样的:
[[["Type","language"],["Subtag", "ae"],...]...]
?