测试美化组中是否存在子标记

data = open("file1.xml",'r').read() xml = BeautifulSoup(data) hasAttrBs = xml.document.subdoc.has_attr('myID') hasAttrPy = hasattr(xml.document.subdoc,'myID') hasType = type(xml.document.subdoc.myid)

3条回答

网友

1楼 · 编辑于 2024-06-26 14:32:00

下面是一个检查Instagram URL中是否存在h2标记的示例。希望你觉得有用：

import datetime
import urllib
import requests
from bs4 import BeautifulSoup

instagram_url = 'https://www.instagram.com/p/BHijrYFgX2v/?taken-by=findingmero'
html_source = requests.get(instagram_url).text
soup = BeautifulSoup(html_source, "lxml")

if not soup.find('h2'):
    print("didn't find h2")

网友

2楼 · 编辑于 2024-06-26 14:32:00

如果不知道XML文档的结构，可以使用soup的.find()方法。像这样的：

with open("file1.xml",'r') as data, open("file2.xml",'r') as data2:
    xml = BeautifulSoup(data.read())
    xml2 = BeautifulSoup(data2.read())

    hasAttrBs = xml.find("myId")
    hasAttrBs2 = xml2.find("myId")

如果您确实知道这个结构，那么可以通过将标记名作为类似于xml.document.subdoc.myid的属性访问来获得所需的元素。所以整件事都是这样的：

with open("file1.xml",'r') as data, open("file2.xml",'r') as data2:
    xml = BeautifulSoup(data.read())
    xml2 = BeautifulSoup(data2.read())

    hasAttrBs = xml.document.subdoc.myid
    hasAttrBs2 = xml2.document.subdoc.myid
    print hasAttrBs
    print hasAttrBs2

印刷品

<myid>1</myid>
None

网友

3楼 · 编辑于 2024-06-26 14:32:00

if tag.find('child_tag_name'):

相关问题更多 >

编程相关推荐

热门问题

热门文章

测试美化组中是否存在子标记

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >