好的,我已经缩小了使用 python 返回谷歌搜索结果的问题。这不是搜索或使用 API 的问题。这是一个如何用 Python 阅读结果的问题。
现在,请任何人,任何人,向我展示可以像浏览器一样简单地打印网页结果的代码(单词和文本,我什至不在乎它是否具有所有 html 脚本......只要它确实包含来自网页的文本和网址......)
所以,这是我试图将结果页面加载到 python 中的 url:
https://www.google.com/search?tbs=sbi:AMhZZiv5l6a7Y6F6PDOH62elkfPDoGserB70if-j-hUzcB30VB0IOmADts8UKGKr8dRpfXlU3VVMA-TOTbqYHXl4weH3A9fyYrpubR8zXlxSR_1jvfMbrz3nkeHCxTYgoSR34KQmSGYlV6NTXP1ag7QumbRdKqqlWsg6RwZh-v0ogPxPqEn3a_1dWgX_1SisyEKKRAD6K-Zge6NA-i54t0-sNR0lINQ96d_1YfN9aw7KQlZp2bzcw97h8Ckb1XR0-t9B_1WbdTv1A1WDWttcfUMGI3Y5zIgwHoQPpNM6x9zbhgemlabzqbMJzGUezF3KT1KG6AWCRdnlaMWbMb_1m_1T-A_1EHnqDm0Wt3wILSIAHdi_1sfMjSyDN2wNjeFxJ3LsGAZN-sx24MIlOz7nT4wOmXtEeECRIBvuSS4iH7SbeC5gf3Iac41He1k7w_1NTt1_1-R8TsdryCxs0Pysu2DA7WuBMLqHcrovkgoCZLGdXbuWx_1nY4nu39VqBiBIpQOPMfPkTknT_1M0nVgO_1ppmSgikY3fKbIByzz8FOJgUBb0Nu2p_1Zoo-XRprDgOb-UVubVk-CsGwN2dF_1I81J6vFOUI_14bS5nYjZaEHx6-Lxxij84Za-fL5G4XCnZc-0LPyzAmbvckao55OXqMUBgZbx9e7VRg2jLYaxrzFYtiIUqjG9-IqXoi4ZhWTxZuk_1sUkXFO3NNdeuQ_1Ise9lisoBCLyfpBz1Qnh1dvP_1zb0Cor2Vbp9gTr9VjuOxTHjglWJSntw4OLt6jCfdZOve-iuEdh9V5fSPk41WidEfkWTxMo1dfbPxShrGFGF1oEdd6soYCJnp6YjnXYEYGOEMUgdBUXdB1bFbmBop5WUUizhfgiOyTmGMPEyv7FH3MeiQCTf2pab6v1OdE7lRcKbHqQ3_1jcWi0IUTlk3UGnNSdWjS80h_1N4ljbWJ7CTT7RtJKDoL73rk0bKNgtGz5WeChnZ11AOQJEgV6hb9eyw56qIY37nNom3ehQhTuz5bvWtg1tKjySP1ezRU7TS_1ijJWK6m9ke9LkRXqbUVxZaCy0L6nAzIYh0ODOaZBexsCeLXV4N8iCxetcjmpBU7TvsoD319lRUcU-dfPEn4UfEAQTcP7x3WLHa1k5EOAXsYcXyWxk7N5GqcuPYmrYWvh_1V1hXjlcKCArh699NyyPCfaNpYlyS0IHRCz8VmVSXi07MxH9apW_1bFDK-h-zcy0_1qfH6uJoo_1BXp_1C0C8EM11UvczLZyKgX4LeFA7X0jJ57n6rr4Zysd2kEhrhJp6hJDSmkLcsF5UOsJ2i4MotSxrUB-tc5s9teV13Xr2-3f7wTbEqylWnYgwil1bA_10HKWMclDvhWJqsGQRFLu9_1lB-7gIwV0kS8xEfA
这是通过 www.hidemyass.com(代理网络搜索器)搜索的网页:
http://5.hidemyass.com/ip-1/encoded/Oi8vd3d3Lmdvb2dsZS5jb20vc2VhcmNoP3Ricz1zYmk6QU1oWlppdGw5UzlLXzF1LUFVdGFlUjNxQ3lpbE5TV2VIdVBzWVJhZkRvbS1UaGYyTi1jcmM0bm40T1N3V3R2ZzEtR3owOFZpMnZhbG52NklfMWFDNmQxRWNVZE1pSVZHN3N5TnlzOEttOElmY0NDVmpyOUtueVNPSmpFY015cG1Fckc4OU1DSHdTY0t5eUloNGNEQUpLbkl6a3ZVOWFvZEU0RXdlcTBNdW9mTWZSWU9uTl8xLWloUzBDRkN3QmREOHo4WjBxeFdudWhheXYya21uMU5OdDJEVzJ4Sno5OF8xNzl2QzlaNFFIZ2JveDZFMm9lZHVzSWhiZll5M2JDMGhvdlFsb3laTlhLWkpQLUZ5MkEyMkRtOWVEWnJkM2FQaV8xTi0zRjdVcTFYSnh4Rks5NV8xb1BPanlZNlFWRW1za043M1VFS3dlV0lJU2RfMW1qcERzS1lSQmQ2aC1uRjRFLXJEV2J2Q1Q3NTJGVlBMamFhZDdoZ1BadkdtRTlwbnRDaHk2LWhGQXhqVnUxQlhBQ1ZNUGFYbjRUODJsUDA0ZUV6OEJ1S2k5QktLck41bTZwelRBR3U4TzRRYkU4NDU1VVplbG0xb1ZTYW9LcUgzZ1lOUkhscEpwMEJzT0lBMlgzLTAzbFBwajUxUjR4M3JERDdaNWFpWG5QZUUtUVJiM1ZwZnlaV3JQYW82WDBzVTQ4LTdtWjRNUmgzVXhKVmQ5TDZwQU0tb3RFS0RHRHVVVUcwNVlnc2wyc3ZCd2xPMUZXRm1oWE9hbXBaTXVqaGoyQnl3YUFNcld4cmF0NmtHTHpxRDl4Wmh2cHBRaS1SNHFnYWxTaTJLZ18xLWVwZngydVBqbktMd1FaVktSakM4XzEyajk2MGtqM0F5ZUc0WE1YUU5xb3RvSHczdVpZOFpLaVAyVGxXMkU4UmNGZW9JUU5HWVVVOWk2UVpsTkhvQlFnTWNqNVZucHNFTVpRNE5VSEw3dUtud3JReVNRZV8xZ21nZU01VS1SXzFfMXROSUdhYlo5dHZUdzI5cWZBMnk0VHUyMnZNRVp4WDFDRTNBWVVuM3NkaVlyandLSkFUZ2Q1WlpDM2JjOHdiVk9YSmJXTXM3WFBNUV8xblVqS20xYW1tSVhHY2RYS1lETnQ5d2tHTjBhQ3hjdjBjV25jVE1qbEdBRndlanRfMUVlXzFZNWZub0xReEViSW51Q0tudy1QR1R1ZUZFWlljRmJRQkxUTDlTYnlZR096ZS1IcHVBUndqdEtBTC0wcWVBczVkMG00ZVo5N0ktemFjbGxraWxmUjRTdW5YZktZR210SDAxWXZRbW95dlBTN1NxVHE1dGZqc08tNmZwcjBSOHBvUmI1SDFMeVVjVF8xOGFOV2hoR29kMkpfMVVIbjRzd3VDY28teTROQl8xUU5uT1VsZEhEOE9CVm5LZGl5NjFsZnVhcEVKcHBCV18xcjBYYTJ2cVB0c2dVYUV6dWVQemJQd09hQlgxeHlwVmZiUXh2eDFRdDlKS1M1UVVCRjQzT3VCZWwtb3hieEVyUVVxdW12UWxMbTRIdEJrVGlXdjJZZHBFQjNkSm1TamdIQzdQUkh0aWY2Z0FCS3hrQlJZaFJhejQ5NkZzd1VoS2s5Qm1FR01YQ1djUk9nQ2kySm94b0tVSW5YNnNtb1hvVTVOWXhHeDZ2YWZNbURDemZfMWlXd3p2XzFTUlQwN1hXUnNTWWZZNU9ROUFSbDNzbUpQUFpybVNvb2ZkRWRnTXBMc3pUTS1aYWdlQVotaS1SUDViNHZQWHk0NlhjU1dvdl8xNFFYSTd2c3hVMmJPOVBGUzVfMWZpUklrNThBbFNMZG9uSTB1R2JqQjZaa2tRJmFtcDt1c2VyLWFnZW50PU1vemlsbGEvNS4wJTIwKFdpbmRvd3MlMjBOVCUyMDUuMTslMjBydjo2LjAuMiklMjBHZWNrby8yMDEwMDEwMSUyMEZpcmVmb3gvNi4wLjImYW1wO2Nvbm5lY3Rpb249a2VlcC1hbGl2ZQ%3D%3D&f=norefer
所以很明显python可以返回这个页面的搜索结果,我就是想不通。
现在,我是新人,所以我正在学习,但到目前为止,我得到的唯一回应是我应该使用 google API,或者我应该使用非 python 方法(包括 spynner 或 selenium,它们不是标准的python库..)这里肯定有人知道如何做到这一点。
这是我尝试过的代码:
import urllib2
req=urllib2.Request(link)
response=urllib2.urlopen(req)
page=response.read()
print(page)
#or thru hidemyass.com:
req=urllib2.Request('http://hidemyass.com/',link)
response=urllib2.urlopen(req)
page=response.read()
print(page)
# BeautifulSoup :
from BeautifulSoup import BeautifulSoup
soup=BeautifulSoup(page)
print(soup)
它都不起作用:(我没有得到结果页面,我没有得到“此图像的最佳猜测”文本..我一团糟:(