如何使用selenium从A标记中获取文本？

from selenium import webdriver from selenium.webdriver.common.keys import Keys import pandas as pd PATH = "C:\Program Files (x86)\chromedriver.exe" driver = webdriver.Chrome(PATH) driver.get("https://egypt.souq.com") dotd = "/html/body/div[2]/div/main/div[1]/div[1]/div/div[1]/a/img" driver.find_element_by_xpath(dotd).click() def get_deals(): title_xpath = "/html/body/div[1]/div/main/div/div[4]/div[3]/div[2]/div[1]/div[1]/div/div[2]/ul/li[1]/h6/span/a" titles = driver.find_elements_by_xpath(title_xpath) for title in titles: print(title) get_deals() print("successful")

2条回答

网友

1楼 · 编辑于 2024-09-30 06:30:24

来自print()的此输出

<selenium.webdriver.remote.webelement.WebElement (session="48e7924c296324a7a5a843d9ccab36fb", element="b8871651-23af-42c6-a49a-5b93fe932653")>

…除了WebElement本身没有任何错误

看来你很接近。由于您能够提取元素，要提取元素中的文本，您可以使用以下任一Locator Strategies：

使用文本属性：

for title in titles:
    print(title.text)

使用get_attribute()属性：

for title in titles:
    print(title.get_attribute("innerHTML"))

参考文献

您可以在以下内容中找到一些相关讨论：

Difference between text and innerHTML using Selenium

网友

2楼 · 编辑于 2024-09-30 06:30:24

问题是您正在打印包含所有属性的对象，而不是文本属性

因此，唯一需要更改的是，不要使用print(title)，而是使用print(title.text)

参考文献

相关问题更多 >

编程相关推荐

热门问题

热门文章