Python3 webscraping 403错误未使用Reques修复 - 问答 - Python中文网

Python3 webscraping 403错误未使用Reques修复

2024-05-08 22:38:42 发布

您现在位置：Python中文网/ 问答频道 /正文

男 | 程序猿一只，喜欢编程写python代码。

我已经通读了以前的问题，关于在尝试刮取具有安全性的页面时出现403错误。大多数人建议使用“请求”来设置用户代理值。我已经在我想要刮取的页面上尝试了这个方法，虽然它消除了错误消息，但脚本现在只是挂起（我已经尝试等待大约10-15秒，等待第二个print语句生成任何结果）

有什么想法吗

from urllib.request import Request, urlopen

from bs4 import BeautifulSoup

print('Example: Search White Pages Smith in Frenchs Forest area:')

req = Request('https://www.whitepages.com.au/residential/results?name=Smith&location=Frenchs%20Forest', headers={'User-Agent': 'Mozilla/5.0'})

html = urlopen(req)

bs = BeautifulSoup(html.read(), 'html.parser')

print(bs.h1)

Tags：用户 from import 代理 bs request html 错误

0条回答

目前没有回答

相关问题更多 >

编程相关推荐

热门问题

热门文章