2024-10-04 05:26:46 发布
网友
给定一个类似http://savannah.gnu.org/bugs/?23435的URL,提取信息的最佳方法是什么:
1:项目名称:Gnash-GNU Flash播放器
2:问题标题:Firefox菜单上方呈现的Flash内容
您可以使用BeautifulSoup:
from bs4 import BeautifulSoup import urllib.request import re response = urllib.request.urlopen('http://savannah.gnu.org/bugs/?23435') html = response.read() soup = BeautifulSoup(html) p_title = soup.select('.toptitle')[0].text # 'Gnash - The GNU Flash player - Bugs: bug #23435, Flash content rendered above menus...' p_title = p_title.split(' - Bugs:')[0] i_title = soup.select('.priore')[0].text # 'bug #23435: Flash content rendered above menus in Firefox' i_title = re.findall('bug #[0-9]*: (.+)', i_title)[0] print(p_title) # 'Gnash - The GNU Flash player' print(i_title) # 'Flash content rendered above menus in Firefox'
(Python3)
您可以使用BeautifulSoup:
(Python3)
相关问题 更多 >
编程相关推荐