使用Beautifulsoup从arialabel获取文本

2024-10-04 09:28:56 发布

您现在位置:Python中文网/ 问答频道 /正文

<span class="_1n9k" data-hover="tooltip" tabindex="-1"><a ajaxify="/ufi/reaction/profile/dialog/? ft_ent_identifier=ZmVlZGJhY2s6MzgyODczMjYzMDg5MTQy&amp;reaction_type=1&amp;av=0" aria-label="좋아요 17" class="_1n9l" href="/ufi/reaction/profile/browser/?ft_ent_identifier=ZmVlZGJhY2s6MzgyODczMjYzMDg5MTQy&amp;av=0" rel="dialog" role="button" tabindex="0"><i class="sp_KIvjPBBBAwk sx_eaca68" role="img"></i></a></span>

如何获取文本信息좋아요 17'来自“aria标签”? 我尝试使用get('aria-label'),但它不起作用


Tags: profilelabelclassdialogampspanidentifierft
1条回答
网友
1楼 · 发布于 2024-10-04 09:28:56

'aria-label'span标记内的a标记的attribute。下面是如何提取attribute的值:

from bs4 import BeautifulSoup

html = '<span class="_1n9k" data-hover="tooltip" tabindex="-1"><a ajaxify="/ufi/reaction/profile/dialog/? ft_ent_identifier=ZmVlZGJhY2s6MzgyODczMjYzMDg5MTQy&amp;reaction_type=1&amp;av=0" aria-label="좋아요 17" class="_1n9l" href="/ufi/reaction/profile/browser/?ft_ent_identifier=ZmVlZGJhY2s6MzgyODczMjYzMDg5MTQy&amp;av=0" rel="dialog" role="button" tabindex="0"><i class="sp_KIvjPBBBAwk sx_eaca68" role="img"></i></a></span>'

soup = BeautifulSoup(html,'html5lib')

span = soup.find('span', class_ = "_1n9k")

print(span.a['aria-label'])

输出:

좋아요 17

相关问题 更多 >