BeautifulSoup：在定位div之后查找其他元素

<div class="ProfileDesc"> Name Tom Ready Born <bxi> 10 Jan 1960</bxi> Death <bxi> 01 Jun 2019</bxi> </div>

3条回答

网友

1楼 · 编辑于 2024-09-19 23:33:39

“1960年1月10日”之后的html代码没有结束p标记

name = soup.find('span',string='Name').parent.text.replace('Name','').strip()
born = soup.find('span',string='Born').parent.text.replace('Born','').strip()
death = soup.find('span',string='Death').parent.text.replace('Death','').strip()
print(f'Name: {name}')
print(f'Born: {born}')
print(f'Death: {death}')

网友

2楼 · 编辑于 2024-09-19 23:33:39

当您非常确定DOM结构时：

mydivs = soup.find("div", {"class": "ProfileDesc"})

for element in mydivs.find_all("p"):
    title = element.find("span")
    content = title.findNext("span")
    print("%s : %s" % (title.text.strip(), content.text.strip()))

输出：

Name : Tom Ready
Born : 10 Jan 1960
Death : 01 Jun 2019

网友

3楼 · 编辑于 2024-09-19 23:33:39

试试这个

keys_ = set() # avoid duplicate keys

for p in mydivs.find_all("p"):
    ss = list(p.stripped_strings)

    for k, v in zip(ss[::2], ss[1::2]):
        if k in keys_:
            continue
            
        keys_.add(k)
        print(k, ":", v)

Name : Tom Ready
Born : 10 Jan 1960
Death : 01 Jun 2019

相关问题更多 >

编程相关推荐

热门问题

热门文章

BeautifulSoup：在定位div之后查找其他元素

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >