如何使用python beautifulsoup从html中提取文本

2024-09-30 01:34:05 发布

您现在位置:Python中文网/ 问答频道 /正文

我从网页上看到以下文字:

<dd><p class="Definition"><span class="DefinitionTerm"><df>Example Term</df> 
</span>Here is the meaning of my term and its description; (<span 
class="TermLink" lang="fr">définition</span>)</p></dd>
<dd><p class="Definition"><span class="DefinitionTerm"><df>Example Term 
2</df></span>Here is the meaning of my term 2 and its description; (<span 
class="TermLink" lang="fr">définition</span>)</p></dd>
<dd><p class="Definition"><span class="DefinitionTerm"><df>Example Term 
3</df></span>Here is the meaning of my term 3 and its description; (<span 
class="TermLink" lang="fr">définition</span>)</p></dd>

我正在尝试使用python beautifulsoup库提取定义术语,例如“示例术语”,然后是它的描述

因此,我希望看到: “示例术语”,“以下是我术语的含义及其说明” “示例Term2”,“以下是我的Term2的含义及其说明” “示例Term3”,“以下是我的Term3的含义及其说明”


Tags: ofthe示例dfhereisexampledd
1条回答
网友
1楼 · 发布于 2024-09-30 01:34:05
html = '''<dd><p class="Definition"><span class="DefinitionTerm"><df>Example Term</df> </span>Here is the meaning of my term and its description; (<span class="TermLink" lang="fr">définition</span>)</p></dd><dd><p class="Definition"><span class="DefinitionTerm"><df>Example Term 2</df></span>Here is the meaning of my term 2 and its description; (<span class="TermLink" lang="fr">définition</span>)</p></dd><dd><p class="Definition"><span class="DefinitionTerm"><df>Example Term 3</df></span>Here is the meaning of my term 3 and its description; (<span class="TermLink" lang="fr">définition</span></p></dd>'''

soup = BeautifulSoup(html, 'html.parser')

for each in soup.findAll('p', class_='Definition'):
    print(each.get_text())`

相关问题 更多 >

    热门问题