用Java和Python从URL读取HTML

String path = "https://html1-f.scribdassets.com/913q5pjrsw60h9i4/pages/106-6b1bd15200.jsonp"; URL url = new URL(path); InputStream in = url.openStream(); BufferedReader bw = new BufferedReader(new InputStreamReader(in, "UTF-8"); String line; while ((line = bw.readLine()) != null) { System.out.println(line); }

�ĘY106-6b1bd15200.jsonpmP�r� �Ƨ�!�%m�vD"��Ra*��w�%��ݳ�sβ��MK�d�9+%�m��l^��މ��:�� 8B�Vce�.A*��x$FCo��a�b�<��Xy��m�c�>t�� Z��Gx�o� �J��oKe�0�5�kGYpb�*l��+|�U��-�N3��jBp�R�z5Cۥjh��o�;�~)��~��)~ɮhy��<c,=;tHW��'�c�=~�w��

window.page106_callback(["<div class=\"newpage\" id=\"page106\" style=\"width: 902px; height:1273px\">\n<div class=image_layer style=\"z-index: 1\">\n<div class=ie_fix>\n<img class=\"absimg\" style=\"left:18px;top:27px;width:860px;height:1077px;clip:rect(1px 859px 1076px 1px)\" orig=\"http://html.scribd.com/913q5pjrsw60h9i4/images/106-6b1bd15200.jpg\"/>\n</div>\n</div>\n</div>\n\n"]);

2条回答

网友

1楼 · 编辑于 2024-10-01 07:43:46

@Maurice Perry是对的，我试过下面的代码

String url = "https://html1-f.scribdassets.com/913q5pjrsw60h9i4/pages/106-6b1bd15200.jsonp";

URL obj = new URL(url);
HttpURLConnection con = (HttpURLConnection) obj.openConnection();

BufferedReader in = new BufferedReader(
        new InputStreamReader(new GZIPInputStream(con.getInputStream())));
String inputLine;
StringBuffer response = new StringBuffer();

while ((inputLine = in.readLine()) != null) {
    response.append(inputLine);
}
in.close();

System.out.println(response.toString());

网友

2楼 · 编辑于 2024-10-01 07:43:46

响应是gzip编码的。你可以做：

        InputStream in = new GZIPInputStream(con.getInputStream());

相关问题更多 >

编程相关推荐

热门问题

热门文章