如何从python中的动态下拉列表中提取/刮取选项值？

citiesList = [] stationNameList = [] siteIdList = [] for city in cityOptions[1:]: citiesList.append(city.text) stationDropDown = driver.find_element_by_xpath("//select[contains(@id,'stations')]") stationOptions = stationDropDown.find_elements_by_tag_name('option') for ele in citiesList: cityDropdown.send_keys(ele, Keys.RETURN) time.sleep(2) stationDropDown.click() print(stationDropDown.text)

1条回答

网友

1楼 · 发布于 2024-10-01 07:16:59

使用python尝试下面的方法-requests当涉及到请求时，需要简单、直接、可靠、快速和更少的代码。我在检查了谷歌chrome浏览器的网络部分后，从网站本身获取了API URL

下面的脚本到底在做什么：

首先，它将使用API URL和有效负载（执行POST请求非常重要）执行POST请求并获取返回的数据
获取数据后，脚本将使用JSON.loads库解析JSON数据
最后，它将逐个遍历所有站点列表，并打印详细信息，如州名、城市名、站点名和站点Id

Network call tab

Output of below code.

def scrape_aqi_site_id():
URL = 'https://app.cpcbccr.com/aqi_dashboard/aqi_station_all_india' #API URL
payload = 'eyJ0aW1lIjoxNjAzMTA0NTczNDYzLCJ0aW1lWm9uZU9mZnNldCI6LTMzMH0=' #Unique payload fetched from the network request
response = requests.post(URL,data=payload,verify=False) #POST request to get the data using URL and Payload information
result = json.loads(response.text) # parse the JSON object using json library
extracted_states = result['stations'] 
for state in range(len(extracted_states)): # loop over extracted states and its stations data.
    print('=' * 120)
    print('Scraping station data for state : ' + extracted_states[state]['stateID'])
    for station in range(len(extracted_states[state]['stationsInCity'])): # loop over each state station data to get the information of stations
        print('-' * 100)
        print('Scraping data for city and its station : City (' + extracted_states[state]['stationsInCity'][station]['cityID'] + ') & station (' + extracted_states[state]['stationsInCity'][station]['name'] + ')')
        print('City :' + extracted_states[state]['stationsInCity'][station]['cityID'])
        print('Station Name : ' + extracted_states[state]['stationsInCity'][station]['name'])
        print('Station Site Id : ' + extracted_states[state]['stationsInCity'][station]['id'])
        print('-' * 100)        
    print('Scraping of data for state : (' + extracted_states[state]['stateID'] + ') is conmpleted now going for another one...')
    print('=' * 120)

scrape_aqi_site_id()

相关问题更多 >

编程相关推荐

热门问题

热门文章