如何在BS4中搜索包含给定字符串的标记？

def search(self): steam_results = self.soup.find_all('span', class_='title') itr = 1 for tag in steam_results: if self.title in tag.string: # <--- Not working print(str(itr) + ': ' + tag.string + '\n') itr = itr + 1

2条回答

网友

1楼 · 编辑于 2024-05-19 08:11:52

问题是子字符串检查，因为它是case-sensitive。如果使用skyrim进行检查，将得到空结果，因为没有title包含skyrim，而是包含Skyrim。所以，把它和小写的标题比较一下

steam_results = soup.find_all('span', class_='title')
for steam in steam_results:
    if 'skyrim' in steam.getText().lower():
        print(steam.getText())

输出：

The Elder Scrolls V: Skyrim Special Edition
The Elder Scrolls V: Skyrim VR
Skyrim Script Extender (SKSE)
The Elder Scrolls V: Skyrim Special Edition - Creation Club

网友

2楼 · 编辑于 2024-05-19 08:11:52

可以使用soup.find_all(string=re.compile("your_string_here")获取文本，然后使用.parent获取标记。你知道吗

from bs4 import BeautifulSoup
import re
html="""
<p id="1">Hi there</p>
<p id="2">hello<p>
<p id="2">hello there<p>
"""
soup=BeautifulSoup(html,'html.parser')
print([tag.parent for tag in soup.find_all(string=re.compile("there"))])

输出

[<p id="1">Hi there</p>, <p id="2">hello there<p>\n</p></p>]

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何在BS4中搜索包含给定字符串的标记？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >