我正在尝试获取此页面(它是中文的,对此感到抱歉):
亚马逊(dot)cn/s?rh=n:663227051
使用以下代码:
import java.io.BufferedReader;
import java.io.FileWriter;
import java.io.IOException;
import java.io.InputStreamReader;
import java.net.URL;
import java.net.URLConnection;
public class Application {
public static void main(String[] args) throws IOException, InterruptedException {
final URL url = new URL("http://www.amazon.cn/s?rh=n:663227051");
final String agentString = "Mozilla/5.0 (Windows; U; Windows NT 6.1; zh-CN; rv:1.9.1.5) Gecko/20091102 Firefox/3.5.5 (.NET CLR 3.5.30729)";
URLConnection urlConnection = url.openConnection();
urlConnection.setRequestProperty("User-Agent", agentString);
InputStreamReader streamReader = new InputStreamReader(urlConnection.getInputStream());
BufferedReader reader = new BufferedReader(streamReader);
final String path = "d:\\desktop\\Test.html";
FileWriter writer = new FileWriter(path);
writer.write("");
String line;
while ((line = reader.readLine()) != null)
writer.append(line).append(System.getProperty("line.separator"));
writer.close();
}
}
但是在多次运行这段代码之后,我发现我随机得到了两个不同的结果(见截图http://www.flickr.com/photos/31629891@N07/4173636464/)
无论我在浏览器中刷新此页面多少次,它都会返回相同的结果。
我想知道为什么会这样?