0

我正在尝试编写执行以下操作的python代码:

1) 登录 OKCupid

2) 转到用户问题页面

3) 回答一个未回答的问题。

我正在使用 RoboBrowser 来执行此操作。我可以执行步骤 1)、2) 并且可以获得我想要提交的问题的形式,但是一旦我提交(使用 RoboBrowser 的提交)它似乎并没有通过 OKCupid(因为问题不是在我的个人资料上注册为已回答)。

这是我的代码。请注意,My_RoboBrowser 只允许我将 False 传递给 RoboBrowser.open 作为验证参数。

class My_RoboBrowser(RoboBrowser):
    def __init__(self, auth=None, parser=None, headers=None, user_agent=None, history=True):
        RoboBrowser.__init__(self, parser=None, user_agent=None, history=True)

    def Open(self, vURL, vVerify=True):
        response = self.session.get(vURL, verify=vVerify)
         self._update_state(response)

browser = My_RoboBrowser()
urlL = 'https://okcupid.com/login'

browser.open(url)

form = browser.get_form(id='loginbox_form')

form['username'] = 'Username'
form['password'] = 'Password'
browser.submit_form(form)

urlQ = 'https://www.okcupid.com/profile/USER/questions?low=1'
browser.open(urlQ)

Question_Tag = browser.find_all(class_="not_answered")[0]

ID = Question_Tag.get('data-qid')

#Get the form to fill out
Form = browser.get_form(id='answer_'+str(ID))
Form['my_answer'].value = '1'
Form['their_answer'].value = ['1']
Form['importance'].value = '1'
browser.submit_form(Form)

另外,如果当我在 IPython 笔记本中查看表单对象 Form 时它会有所帮助,它会说:

<RoboForm my_answer=, their_answer=[], importance=>

在提交之前和

<RoboForm my_answer=1, their_answer=['1'], importance=1>

后。

最后,如果有帮助,以下是我试图回答的一种形式的代码(通过检查元素获得)

<form id="answer_179268" name="answer_179268" class="answer_area okform initialized"> 
<div class="container my_answer">  
   <input id="my_answer_1_179268" name="my_answer" value="1" false="" type="radio"> 
   <label class="radio" for="my_answer_1_179268">
      <span class="icon"></span>
         Yes
      </label>  
      <input id="my_answer_2_179268" name="my_answer" value="2" false="" type="radio"></input> 
   <label class="radio" for="my_answer_2_179268">
      <span class="icon"></span>
      No
      </label>  
   </div> 
<div class="container acceptable_answers">  
   <div class="title"> 
      <p>Answer(s) you’ll accept</p> 
   </div>   
   <label class="checkbox acceptable_answer" for="their_answer_1_179268">
      <input id="their_answer_1_179268" class="acceptable_answer" name="their_answer" value="1" false="" type="checkbox"></input>
      <span class="icon"></span>
       Yes
       </label>   

<label class="checkbox acceptable_answer" for="their_answer_2_179268">
   <input id="their_answer_2_179268" class="acceptable_answer" name="their_answer" value="2" false="" type="checkbox"></input?
      <span class="icon"></span>
       No
      </label>    
<label class="checkbox irrelevant" for="their_answer_any_179268">
   <input id="their_answer_any_179268" class="irrelevant" name="their_answer" value="irrelevant" type="checkbox"></input>
      <span class="icon"></span>
      Any of the above
      </label> 
   </div> 
<div class="container importance"> 
   <div class="title"> 
      <p>Importance</p> 
   </div> 
<div class="importance_radios">  
   <input id="importance_179268_5" name="importance" value="5" false="" type="radio"></input>
   <label class="importance_5 radio" for="importance_179268_5" data-count="5">
      <span class="icon"></span> 
      <div class="bar"></div> 
      <span class="label"></span> 
   </label>  
      <input id="importance_179268_4" name="importance" value="4" false="" type="radio"></input>
   <label class="importance_4 radio" for="importance_179268_4" data-count="4">
      <span class="icon"></span> 
   <div class="bar"></div> 
      <span class="label">A little</span> 
   </label>  
      <input id="importance_179268_3" name="importance" value="3" false="" type="radio"></input>
   <label class="importance_3 radio" for="importance_179268_3" data-count="3">
      <span class="icon"></span> 
   <div class="bar"></div> 
      <span class="label">Somewhat</span> 
</label>  
   <input id="importance_179268_2" name="importance" value="2" false="" type="radio"> </input>
<label class="importance_2 radio" for="importance_179268_2" data-count="2">
      <span class="icon"></span> 
   <div class="bar"></div> 
      <span class="label"></span> 
</label>  
      <input id="importance_179268_1" name="importance" value="1" false="" type="radio"> </input>
   <label class="importance_1 radio" for="importance_179268_1" data-count="1">
      <span class="icon"></span> 
   <div class="bar"></div> 
      <span class="label">Very</span> 
</label>  
</div> 
   <div class="irrelevant_message"> 
   <span class="irrelevant_text">Irrelevant</span> 
      <span class="message_text">(Because you’ll accept any answer, this question is marked irrelevant)</span> 
   </div> 
</div> 
<div id="explanation_container_179268" class="container explanation"> 
   <div id="answer_179268_explanationContainer" class="inputcontainer textarea noresize empty">
      <textarea id="answer_179268_explanation" class="noresize" placeholder="Explain your answer (optional)" false=""></textarea>
    <span class="message empty" style="height: 0"></span>
   <div class="icon"></div>
</div> 
</div>  
<button id="submit_btn_179268" class="submit_btn flatbutton disabled small">Answer</button>
<button id="cancel_btn_179268" class="cancel_btn flatbutton silver small">Cancel</button> 
   <a class="skip_btn inner" href="javascript:void(0)" draggable="false">Skip question</a>  
   <div id="public_container_179268" class="answer_privately">  
   <label class="checkbox" for="private_179268">
   <input id="private_179268" false="" type="checkbox"></input>
      <span class="icon"></span> 
       <span class="text">Answer privately</span> 
   </label> 
   </div>  
</form>
4

2 回答 2

1

我见过几种情况,有人无法提交 b/c 该网站使用 Javascript 提交表单。换句话说,他们可以使用 Robobrowser 或 Mechanize 登录并填写表单,但表单本身无法提交——因为提交依赖于 JS。这可能是你的问题。如果是这种情况,您应该尝试使用Selenium

您可能会在浏览器的检查器中验证是否使用 JS 提交。 <ctrl + shift + i / network / *clear* inspector's network panel before clicking submit / click submit / check type for your Post>

我想这就是我验证的方式,但这里的其他人会比我更了解。祝你好运!

于 2014-12-27T18:54:02.713 回答
0

你应该看看https://github.com/IvanMalison/okcupyd。它可以让您在不使用浏览器的情况下执行此操作。

于 2015-04-19T20:33:21.790 回答