https://www.realestate.com.au/ 不允许刮网？

2024-09-21 02:34:48 发布

男 | 程序猿一只，喜欢编程写python代码。

我正在尝试从https://www.realestate.com.au/中提取数据首先，我根据要查找的属性类型创建url，然后使用SeleniumWebDriver打开url，但页面是空白的！知道为什么会这样吗？是因为这个网站不提供网页抓取权限吗？有什么办法可以删除这个网站吗

这是我的密码：

from selenium import webdriver
from bs4 import BeautifulSoup
import time

PostCode = "2153"
propertyType = "house"
minBedrooms = "3"
maxBedrooms = "4"
page = "1"

url = "https://www.realestate.com.au/sold/property-{p}-with-{mib}-bedrooms-in-{po}/list-{pa}?maxBeds={mab}&includeSurrounding=false".format(p = propertyType, mib = minBedrooms, po = PostCode, pa = page, mab = maxBedrooms)
print(url)
# url should be "https://www.realestate.com.au/sold/property-house-with-3-bedrooms-in-2153/list-1?maxBeds=4&includeSurrounding=false"

driver = webdriver.Edge("./msedgedriver.exe") # edit the address to where your driver is located
driver.get(url)
time.sleep(3)

src = driver.page_source
soup = BeautifulSoup(src, 'html.parser')
print(soup)

Tags： from https import com url time 网站 www

1条回答

网友

1楼 · 发布于 2024-09-21 02:34:48

您传递的链接不正确，请尝试

driver.get("your link")

api-https://selenium-python.readthedocs.io/api.html?highlight=get#:~:text=ef_driver.get(%22http%3A//www.google.co.in/%22)

https://www.realestate.com.au/ 不允许刮网？

相关问题更多 >

编程相关推荐

热门问题

热门文章

https://www.realestate.com.au/ 不允许刮网？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >