如何在特定的英文维基百科文章中找到所有英文维基百科链接

##### Imports ##### from bs4 import BeautifulSoup from bs4.dammit import EncodingDetector import requests ##### Functions ##### parser = 'html.parser' resp = requests.get("https://en.wikipedia.org/wiki/Influenza") http_encoding = resp.encoding if 'charset' in resp.headers.get('content-type','').lower() else None html_encoding = EncodingDetector.find_declared_encoding(resp.content, is_html=True) encoding = html_encoding or http_encoding soup = BeautifulSoup(resp.content, parser, from_encoding=encoding) for link in soup.find_all('a', href=True): print(link['href'])

1条回答

网友

1楼 · 发布于 2024-06-26 00:16:46

您是否尝试使用WikipediaAPI获取所有链接？。这是获得此类结果的最佳、最准确的方法

在您的情况下，可以使用此API获取Influenza页面内的所有链接

https://en.wikipedia.org/w/api.php?action=query&format=json&prop=linkshere&titles=Influenza&lhlimit=500

只需更改任何维基百科文章的上一个链接中的Influenza，它就可以正常工作

相关问题更多 >

编程相关推荐

热门问题

热门文章