Python BeautifulSoup刮取n种类型的元素

<div class="detail-50"> <div class="detail-panel-wrap"> <h3>Contact details</h3> Website: <a href="http://www.somewebsitefrompage.com">http://www.somewebsitefrompage.com</a><br />Email: <a href="mailto:somemailfrompage.com">somemailfrompage.com</a><br />Tel: 11111111 111 </div> </div>

<div class="detail-50"> <div class="detail-panel-wrap"> <h3>Public address</h3> Mr Martin Austin, Some street, Some city, some ZIP </div> </div>

1条回答

网友

1楼 · 发布于 2024-09-28 23:52:49

如果有多个divdetail panel wrap，则可以使用h3文本来获取所需的div:

contact = soup.find("h3", text="Contact details").parent
address = soup.find("h3", text="Public address").parent

如果我们在一个样本上运行，你可以看到我们得到两个div：

^{pr2}$

可能还有其他方法，但是如果没有看到完整的html结构，就不可能知道。在

对于您的编辑，您只需使用选择器和select-one：

 telephone = soup.select_one("#ContentPlaceHolderDefault_cp_content_ctl00_CharityDetails_4_TabContainer1_tpOverview_plContact.detail-panel div.detail-50:nth-of-type(1) div.detail-panel-wrap")            

address = soup.select_one("#ContentPlaceHolderDefault_cp_content_ctl00_CharityDetails_4_TabContainer1_tpOverview_plContact.detail-panel div.detail-50:nth-of-type(2) div.detail-panel-wrap")


website = soup.select_one("div.detail-50 a:nth-of-type(1)")

email = soup.select_one("div.detail-panel-wrap a:nth-of-type(2)")

但不能保证仅仅因为选择器在chrome工具中工作。。他们会在你找到的源头上工作。在

相关问题更多 >

编程相关推荐

热门问题

热门文章