2

我想用 java 登录到互联网上的 https 站点,然后阅读一些信息。我已经用萤火虫查看了标题,但是我无法做到...

萤火虫告诉:

https://service.example.net/xxx/unternehmer/login.html?login=Anmelden&loginname=xxx&password=xxx&sessionid=&sprache=de

然后我想浏览这个网站:

https://service.example.net/xxx/unternehmer/ausgabe.html?code=PORTAL;sessionid=03112010150442

我怎么能用java做到这一点?我已经尝试过类似的东西:

import java.net.*;
import java.io.*;
import java.security.*;
import javax.net.ssl.*;

public class HTTPSClient {

  public static void main(String[] args) {
    int port = 443; // default https port
    String host = "service.example.net";
    try {
      SSLSocketFactory factory = (SSLSocketFactory) SSLSocketFactory.getDefault();

      SSLSocket socket = (SSLSocket) factory.createSocket(host, port);

      // enable all the suites
      String[] supported = socket.getSupportedCipherSuites();
      socket.setEnabledCipherSuites(supported);


      Writer out = new OutputStreamWriter(socket.getOutputStream());
      // https requires the full URL in the GET line
      out.write("POST https://" + host + "//xxx/unternehmer/login.html?login=Anmelden&loginname=xxx&password=xxx&sessionid=&sprache=de HTTP/1.1\r\n");
      out.write("Host: " + host + "\r\n");
      out.write("\r\n");
      out.flush();

      // read response
      BufferedReader in = new BufferedReader(
        new InputStreamReader(socket.getInputStream()));

      // read the header
      String s;
      while (!(s = in.readLine()).equals("")) {
          System.out.println(s);
      }
      System.out.println();

      // read the length
      String contentLength = in.readLine();
      int length = Integer.MAX_VALUE;
      try {
        length = Integer.parseInt(contentLength.trim(), 16);
      }
      catch (NumberFormatException ex) {
        // This server doesn't send the content-length
        // in the first line of the response body
      }
      System.out.println(contentLength);

      int c;
      int i = 0;
      while ((c = in.read()) != -1 && i++ < length) {
        System.out.write(c);
      }

      System.out.println("1.part done");

      out.close();
      in.close();
      socket.close();

    }
    catch (IOException ex) {
      System.err.println(ex);
    }

  }

}

不幸的是,这对登录不起作用....而且我也不知道从哪里获得这个 sessionid ...每次都是不同的。我希望你能帮助我。ps:我用xxx替换了一些相关信息

4

2 回答 2

4

问题解决了 :)

首先,我从 apache 添加了库:

  1. httpclient
  2. commons-httpclient
  3. 公共编解码器
  4. 公共记录

然后我结合了几个教程。

我的代码:

import java.io.BufferedWriter;
import java.io.FileWriter;
import org.apache.commons.httpclient.Header;
import org.apache.commons.httpclient.HttpClient;
import org.apache.commons.httpclient.methods.GetMethod;
import org.apache.commons.httpclient.methods.PostMethod;
import org.apache.http.client.params.CookiePolicy;

  public class Test {

     public static final String TARGET_HTTPS_SERVER = "www.example.net"; 
     public static final int    TARGET_HTTPS_PORT   = 443; 

     public static void main(String[] args) throws Exception {

         HttpClient httpClient = new HttpClient();
         httpClient.getParams().setCookiePolicy(CookiePolicy.BROWSER_COMPATIBILITY);

         PostMethod post = new PostMethod("https://www.example.com/login.html");
         post.setRequestHeader(new Header(
                 "User-Agent", "Mozilla/5.0 /Windows; U; Windows NT 4.1; de; rv:1.9.1.5) Gecko/20091102 Firefox/3.0"));

         post.addParameter("login", "true");
         post.addParameter("username", "xxx");
         post.addParameter("password", "xxx");
         post.addParameter("language", "de");
         httpClient.executeMethod(post);


         System.out.println(post.getResponseBodyAsString());
         String body=post.getResponseBodyAsString();
//Get the session id by parsing the code, i know this is not pretty
             String sessionid=body.substring(body.indexOf("session")+10,body.indexOf("session")+10+14);
             System.out.print(sessionid);


             GetMethod get=new GetMethod("https://www.example.com/thesiteyouwannabrowse?sessionid="+sessionid);

         get.setRequestHeader(new Header(
             "User-Agent", "Mozilla/5.0 /Windows; U; Windows NT 4.1; de; rv:1.9.1.5) Gecko/20091102 Firefox/3.0"));
         httpClient.executeMethod(get);

         System.out.println(get.getResponseBodyAsString());
         //write it into a file
         try{
                // Create file 
                FileWriter fstream = new FileWriter("file.html");
                    BufferedWriter out = new BufferedWriter(fstream);
                out.write(get.getResponseBodyAsString());
                //Close the output stream
                out.close();
                }catch (Exception e){//Catch exception if any
                  System.err.println("Error: " + e.getMessage());
             }     
         post.releaseConnection();
     }
  }
于 2010-11-09T08:37:21.817 回答
3

我自己也做过类似的事情。我使用这种“手动”方法让它工作,但这很麻烦,尤其是在 cookie 管理方面。

我建议你看看Apache HttpClient library。(当我意识到使用这个库是多么容易时,我扔掉了我的代码。)

正如 org.life.java 所指出的,这里http://hc.apache.org/httpclient-3.x/sslguide.html是关于如何使用这个库开始使用 SSL 的一个很好的方法。

于 2010-11-03T13:54:50.053 回答