试图找到一种方法从好书页中摘录这本书的摘要。曾尝试过靓汤/硒,但不幸无效
链接:https://www.goodreads.com/book/show/67896.Tao_Te_Ching?from_search=true&from_srp=真&;qid=D19IQ7Kwi&;排名=1
代码:
from selenium import webdriver
from selenium.webdriver.common.keys import Keys
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.common.by import By
from selenium.webdriver.support import expected_conditions as EC
from bs4 import BeautifulSoup
import requests
link='https://www.goodreads.com/book/show/67896.Tao_Te_Ching?from_search=true&from_srp=true&qid=D19iQu7KWI&rank=1'
driver.get(link)
Description=driver.find_element_by_xpath("//div[contains(text(),'TextContainer')]")
#first TextContainer contains the sumary of the book
book_page = requests.get(link)
soup = BeautifulSoup(book_page.text, "html.parser")
print(soup)
Container = soup.find('class', class_='leftContainer')
print(Container)
错误:
container is empty +
NoSuchElementException: no such element: Unable to locate element: {"method":"xpath","selector":"//div[contains(text(),'TextContainer')]"} (Session info: chrome=83.0.4103.116)
你可以这样得到描述
我使用了CSS Selector 来获取包含完整描述的特定隐藏
span
。我还使用了一个explicit wait来给元素加载时间相关问题 更多 >
编程相关推荐