使用beauthoulsoup按id解析

html_doc = """ <html><head><title>The Dormouse's story</title></head> <body> The Dormouse's story Once upon a time there were three little sisters; and their names were <a href="http://example.com/elsie" class="sister" id="link1">Elsie</a>, <a href="http://example.com/lacie" class="sister" id="link2">Lacie</a> and <a href="http://example.com/tillie" class="sister" id="link3">Tillie</a>; and they lived at the bottom of a well. ... """

1条回答

网友

1楼 · 发布于 2024-10-08 18:25:34

如果要用find_all()来解决它，可以使用正则表达式或函数：

soup.find_all("a", id=re.compile(r"^link\d+$")  # id starts with 'link' followed by one or more digits at the end of the value
soup.find_all("a", id=lambda value: value and value.startswith("link"))  # id starts with 'link'

或者，您可以使用CSS选择器：

^{pr2}$

编程相关推荐

java Box2D销毁正文原因：FailedToWriteCoreDumpCoreDumpsShaveBeenDisabled
java如何使用maven构建spring boot应用程序的jar库
java How-to-know项目是使用Eclipse或NetBeans创建的
应用程序未运行时的java推送计划通知
GSON将json值反序列化为Java对象
java如何使用javamail添加内联图像？
java在同一战争中从另一个Web服务调用Web服务apache cxf
java如何在没有OutOfMemory错误的情况下从Android上传大文件？
javajavax。加密。BadPaddingException：给定的最后一个块未正确填充完整示例
java OpenGL矩阵乘法导致奇数浮点行为

相关问题更多 >

编程相关推荐

热门问题

热门文章