如何使用Selenium和Python从这些JavaScript表中提取数据？

from urllib.request import urlopen from bs4 import BeautifulSoup from selenium import webdriver url = 'https://www.mcmaster.com/cam-lock-fittings/material~aluminum/' options = webdriver.ChromeOptions() options.add_experimental_option('excludeSwitches', ['enable-logging']) driver = webdriver.Chrome(executable_path='C:/Users/Brian Knoll/Desktop/chromedriver.exe', options=options) driver.get(url) html = driver.execute_script("return document.documentElement.outerHTML") driver.close() filename = "McMaster Text.txt" fo = open(filename, "w") fo.write(html) fo.close()

1条回答

网友

1楼 · 发布于 2024-04-27 09:40:14

我想你需要等到你要找的那张桌子上了货。
要执行此操作，请添加以下行以等待10秒钟，然后再开始抓取数据

fullLoad = WebDriverWait(driver, 10).until(EC.presence_of_element_located((By.XPATH, "//div[contains(@class, 'ItmTblCntnr')]")))

以下是完整的代码：

from urllib.request import urlopen
from bs4 import BeautifulSoup
from selenium import webdriver
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
from selenium.webdriver.common.by import By

url = 'https://www.mcmaster.com/cam-lock-fittings/material~aluminum/'


options = webdriver.ChromeOptions()
options.add_experimental_option('excludeSwitches', ['enable-logging'])
driver = webdriver.Chrome(executable_path=os.path.abspath("chromedriver"), options=options)

driver.get(url)
fullLoad = WebDriverWait(driver, 10).until(EC.presence_of_element_located((By.XPATH, "//div[contains(@class, 'ItmTblCntnr')]")))

html = driver.execute_script("return document.documentElement.outerHTML")
driver.close()

filename = "McMaster Text.txt"
fo = open(filename, "w")
fo.write(html)
fo.close()

相关问题更多 >

编程相关推荐

热门问题

热门文章