基于元素字符串中的特定单词搜索HTML元素

<p> ExampleStringWord#1 needs to “find” this entire element based on the "finding" of the first word </p> <p> Example#2 this element ignored </p> <p> ExampleStringWord#1 needs to find this entire element as well because the first word of this string is what I’m “searching” for, even though the wording after the first word in the string is different <p>

<h1> ExampleStringWord#1 needs to “find” this entire element based on the "finding" of the first word </h1> <p> Example#2 this element ignored </p> <h1> ExampleStringWord#1 needs to find this entire element as well because the first word of this string is what I’m “searching” for, even though the wording after the first word in the string is different <h1>

1条回答

网友

1楼 · 发布于 2024-10-03 00:25:39

如果使用指定的子字符串（注意re.compile()部分）定位p元素，然后将元素名称替换为h1：

import re

from bs4 import BeautifulSoup

data = """
<body>
    <p> ExampleStringWord#1 needs to “find” this entire element based on the "finding" of the first word </p>
    <p> Example#2  this element ignored </p>
    <p> ExampleStringWord#1 needs to find this entire element as well because the first word of this string is what I’m “searching” for, even though the wording after the first word in the string is different </p>
</body>
"""

soup = BeautifulSoup(data, "html.parser")
for p in soup.find_all("p", string=re.compile("ExampleStringWord#1")):
    p.name = 'h1'
print(soup)

印刷品：

^{pr2}$

相关问题更多 >

编程相关推荐

热门问题

热门文章