我正在尝试从借贷俱乐部下载数据的 url 列中抓取当前的借贷记录状态。例如https://lendingclub.com/browse/loanDetail.action?loan_id=104046830 它需要登录才能提取信息。
我已按照步骤创建登录会话,但似乎无法成功执行登录。结果不包含正确的代码。有人可以帮我确定问题吗?
USERNAME = "username"
PASSWORD = "password"
LOGIN_URL = "https://www.lendingclub.com/auth/login?"
loan_id=96490539
URL = "https://lendingclub.com/browse/loanDetail.action?loan_id=96490539"
def main():
session_requests = requests.session()
# Get login csrf token
result = session_requests.get(LOGIN_URL)
tree = html.fromstring(result.text)
authenticity_token = tree.xpath("//meta[@name='csrf-token']/@content")[0]
# Create payload
payload = {
"login_email": USERNAME,
"login_password": PASSWORD,
"csrf-token": authenticity_token
}
# Perform login
result = session_requests.post(LOGIN_URL, data = payload, headers = dict(referer = LOGIN_URL))
# Scrape url
result = session_requests.get(URL, headers = dict(referer = URL))
return result