Python BeautifulSoup在找到的关键字周围添加标记

2条回答

网友

1楼 · 编辑于 2024-10-01 11:23:28

如果你加上文字。。。在

my_tag = node.parent.setString(node.replace(match, "<myspan>"+match+"</myspan>"))

…再把它传给美丽集团

^{pr2}$

它应该被分类为一个BS-tag对象，并可用于解析。在

您可以将这些更改应用于原始大量文本，并将其作为一个整体来运行，以避免重复。在

编辑：

从docs：

# Here is a more complex example that replaces one tag with another: 

from BeautifulSoup import BeautifulSoup, Tag
soup = BeautifulSoup("<b>Argh!<a>Foo</a></b><i>Blah!</i>")
tag = Tag(soup, "newTag", [("id", 1)])
tag.insert(0, "Hooray!")
soup.a.replaceWith(tag)
print soup
# <b>Argh!<newTag id="1">Hooray!</newTag></b><i>Blah!</i>

网友

2楼 · 编辑于 2024-10-01 11:23:28

下面是一个简单的示例，展示了一种方法：

import re
from bs4 import BeautifulSoup as Soup

html = '''
<html><body><p>This is a paragraph</p></body></html>
'''

（1）存储文本并清空标签

^{pr2}$

（2）找出要加粗的单词的起始位置和结束位置（为我的英语道歉）

match = re.search(r'\ba\b', text)
start, end = match.start(), match.end()

（3）拆分文本，增加第一部分

soup.p.append(text[:start])
print soup

（4）创建一个标记，向其添加相关文本，并将其附加到父对象

b = soup.new_tag('b')
b.append(text[start:end])
soup.p.append(b)
print soup

（5）附加正文其余部分

soup.p.append(text[end:])
print soup

下面是上面的输出：

<html><body><p></p></body></html>
<html><body><p>This is </p></body></html>
<html><body><p>This is <b>a</b></p></body></html>
<html><body><p>This is <b>a</b> paragraph</p></body></html>

相关问题更多 >

编程相关推荐

热门问题

热门文章

Python BeautifulSoup在找到的关键字周围添加标记

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >