使用beautifulsoup从页面中删除特定的机场代码

import requests from bs4 import BeautifulSoup page = requests.get('https://www.loungebuddy.com/select/locations') soup = BeautifulSoup(page.text, 'html.parser') airport_code_html_lines = soup.find_all( attrs={'class': 'aiprt-code'})

2条回答

网友

1楼 · 编辑于 2024-10-01 19:20:47

如果你想知道机场代码和国家名称，你可以试试下面的方法：

import requests
from bs4 import BeautifulSoup

page = requests.get('https://www.loungebuddy.com/select/locations')
soup = BeautifulSoup(page.text, 'html.parser')
airport_code = {item.select_one("h2").text:item.select_one(".aiprt-code").text for item in soup.select(".country")}
print(airport_code)

部分输出：

{'India': 'BLR', 'Poland': 'KTW', 'Thailand': 'BKK', 'Croatia': 'ZAG', so on }

网友

2楼 · 编辑于 2024-10-01 19:20:47

只需将print(airport_code.prettify())替换为print(airport_code.text)即可得到所需的输出。你知道吗

请尝试以下代码（使其更干净）：

page = requests.get('https://www.loungebuddy.com/select/locations')
soup = BeautifulSoup(page.text, 'html.parser')

for country in soup.find_all('span', class_='aiprt-code'):
    print(country.text)

你也可以用soup.find_all('span', {'class': 'aiprt-code'})代替soup.find_all('span', class_='aiprt-code')。是一样的。你知道吗

输出：

BNE
SYD
BGI
BRU
...
...

或者，如果您想在列表中列出国家，您可以使用list comprehension，如下所示。它有助于存储、使用和修改数据。你知道吗

countries = [x.text for x in soup.find_all('span', class_='aiprt-code')]
print(countries)

输出：

['BNE', 'SYD', 'BGI', 'BRU', 'GIG', 'SOF', 'PNH', 'REP', ... ]

相关问题更多 >

编程相关推荐

热门问题

热门文章