使用beauthoulsoup搜索标记内的文本，并在i之后返回标记中的文本

<h2>Details</h2> <div class="section-inner"> <div class="_UCu"> <h3 class="_mEu">General</h3> <div class="_JDu"> <span class="_IDu">Color</span> <span class="_KDu">Slate, mykonos</span> </div> </div> <div class="_UCu"> <h3 class="_mEu">Carrying Case</h3> <div class="_JDu"> <span class="_IDu">Type</span> <span class="_KDu">Protective cover</span> </div> <div class="_JDu"> <span class="_IDu">Recommended Use</span> <span class="_KDu">For cell phone</span> </div> <div class="_JDu"> <span class="_IDu">Protection</span> <span class="_KDu">Impact protection</span> </div> <div class="_JDu"> <span class="_IDu">Cover Type</span> <span class="_KDu">Back cover</span> </div> <div class="_JDu"> <span class="_IDu">Features</span> <span class="_KDu">Camera lens cutout, hard shell, rubberized, port cut-outs, raised edges</span> </div> </div>

2条回答

网友

1楼 · 编辑于 2024-09-30 22:20:10

试试看。它还可以为您提供相应的值。请确保将html elements括在content=""" """变量内，并用三个引号括起来，看看它是如何工作的。在

from bs4 import BeautifulSoup

soup = BeautifulSoup(content,"lxml")
for elem in soup.select("._JDu"):
    item = elem.select_one("span")
    if "Features" in item.text:  #try to see if it misses the corresponding values
        val = item.find_next("span").text
        print(val)

网友

2楼 · 编辑于 2024-09-30 22:20:10

您可以定义一个函数来返回您输入的键的值：

def get_txt(soup, key):
    key_tag = soup.find('span', text=key).parent
    return key_tag.find_all('span')[1].text

color = get_txt(soup, 'Color')
print('Color: ' + color)
features = get_txt(soup, 'Features')
print('Features: ' + features)

输出：

^{pr2}$

我希望这就是你要找的。在

说明：

soup.find('span', text=key)返回<span>标记，其text=key。在

.parent返回当前<span>标记的父标记。在

示例：

当key='Color'时，soup.find('span', text=key).parent将返回

<div class="_JDu">
    <span class="_IDu">Color</span>
    <span class="_KDu">Slate, mykonos</span>
</div>

现在我们把它存储在key_tag。只剩下第二个<span>的文本，这是key_tag.find_all('span')[1].text行所做的。在

相关问题更多 >

编程相关推荐

热门问题

热门文章