在抓取之前，如何使用selenium从一个url选项卡切换到另一个url选项卡？

#Open Webpage url = "https://www.website.com" driver=webdriver.Chrome(executable_path=r"C:\mypathto\chromedriver.exe") driver.get(url) #Click Necessary Parameters driver.find_element_by_partial_link_text('Output').click() driver.find_element_by_xpath('//*[@id="flexOpt"]/table/tbody/tr/td[2]/input[3]').click() driver.find_element_by_xpath('//*[@id="flexOpt"]/table/tbody/tr/td[2]/input[4]').click() driver.find_element_by_xpath('//*[@id="repOpt"]/table[2]/tbody/tr/td[2]/input[4]').click() time.sleep(2) driver.find_element_by_partial_link_text('Dates').click() driver.find_element_by_xpath('//*[@id="RangeOption"]').click() driver.find_element_by_xpath('//*[@id="Range"]/table/tbody/tr[1]/td[2]/select/option[2]').click() driver.find_element_by_xpath('//*[@id="Range"]/table/tbody/tr[1]/td[3]/select/option[1]').click() driver.find_element_by_xpath('//*[@id="Range"]/table/tbody/tr[1]/td[4]/select/option[1]').click() driver.find_element_by_xpath('//*[@id="Range"]/table/tbody/tr[2]/td[2]/select/option[2]').click() driver.find_element_by_xpath('//*[@id="Range"]/table/tbody/tr[2]/td[3]/select/option[31]').click() driver.find_element_by_xpath('//*[@id="Range"]/table/tbody/tr[2]/td[4]/select/option[1]').click() time.sleep(2) driver.find_element_by_partial_link_text('Groupings').click() driver.find_element_by_xpath('//*[@id="availFld_DATE"]/a/img').click() driver.find_element_by_xpath('//*[@id="availFld_LOCID"]/a/img').click() driver.find_element_by_xpath('//*[@id="availFld_STATE"]/a/img').click() driver.find_element_by_xpath('//*[@id="availFld_DDSO_SA"]/a/img').click() driver.find_element_by_xpath('//*[@id="availFld_CLASS_ID"]/a/img').click() driver.find_element_by_xpath('//*[@id="availFld_REGION"]/a/img').click() time.sleep(2) driver.find_element_by_partial_link_text('Run').click() time.sleep(2) df_url = driver.switch_to_window(driver.window_handles[0]) page = requests.get(df_url).text soup = BeautifulSoup(page, features = 'html5lib') soup.prettify()

2条回答

网友

1楼 · 编辑于 2024-10-01 02:19:10

问题在于：

df_url = driver.switch_to_window(driver.window_handles[0])
page = requests.get(df_url).text

df_url未引用页面的url。要获得该结果，您应该在切换窗口后调用driver.current_url，以获取活动窗口的url

其他一些要点：

通过xpath查找元素的效率相对较低（source）
您可以研究使用explicit waits而不是time.sleep

网友

2楼 · 编辑于 2024-10-01 02:19:10

在驱动变量下面插入url，因为首先执行webdriver，然后执行提供的url

driver=webdriver.Chrome(executable_path=r"C:\mypathto\chromedriver.exe")
url = "https://www.website.com"

相关问题更多 >

编程相关推荐

热门问题

热门文章