如何使用seleniumpython逐个点击获取网站数据

driver.find_elements_by_xpath("//div[@class='.l-srp__results.flex__item']") driver.find_element_by_css_selector('a').get_attribute('href') for matches in driver: print('Liking') print (matches) #matches.click() time.sleep(5)

2条回答

网友
1楼 · 编辑于 2024-09-28 01:23:24

我认为您应该收集列表中所有标记名为“a”且“href”属性不为空的元素。
然后遍历列表并逐个单击元素。
创建WebElement类型的列表并存储所有有效链接。
在这里，您可以应用更多的过滤器或条件，即链接包含一些字符或其他一些条件。
要在列表中存储WebElement，可以使用驱动程序.findEelements（）此方法将返回WebElement类型的列表。你知道吗

网友
2楼 · 编辑于 2024-09-28 01:23:24

硒不是一个好主意，网页刮。我建议您使用JMeter，它是免费的、开源的。你知道吗
http://www.testautomationguru.com/jmeter-how-to-do-web-scraping/
如果您想使用selenium，那么您尝试采用的方法并不是一种稳定的方法—单击并获取数据。相反，我建议你遵循这个-类似的东西在这里。这个例子是用java编写的。但你可以理解。你知道吗
driver.get("https://www.yahoo.com"); Map<Integer, List<String>> map = driver.findElements(By.xpath("//*[@href]")) .stream() // find all elements which has href attribute & process one by one .map(ele -> ele.getAttribute("href")) // get the value of href .map(String::trim) // trim the text .distinct() // there could be duplicate links , so find unique .collect(Collectors.groupingBy(LinkUtil::getResponseCode)); // group the links based on the response code
更多信息在这里。你知道吗
http://www.testautomationguru.com/selenium-webdriver-how-to-find-broken-links-on-a-page/

相关问题更多 >

编程相关推荐

热门问题

热门文章