使用python获取xml节点的所有父节点

<Departments orgID="123" name="xmllist"> <Department> <orgID>124</orgID> <name>A</name> <type>type a</type> <status>Active</status> <Department> <orgID>125</orgID> <name>B</name> <type>type b</type> <status>Active</status> <Department> <orgID>126</orgID> <name>C</name> <type>type c</type> <status>Active</status> </Department> </Department> </Department> <Department> <orgID>109449</orgID> <name>D</name> <type>type d</type> <status>Active</status> </Department> </Departments>

2条回答

网友

1楼 · 编辑于 2024-09-30 01:27:48

使用lxml和XPath：

>>> s = '''
... <Departments orgID="123" name="xmllist">
...     <Department>
...         <orgID>124</orgID>
...         <name>A</name>
...         <type>type a</type>
...         <status>Active</status>
...             <Department>
...                 <orgID>125</orgID>
...                 <name>B</name>
...                 <type>type b</type>
...                 <status>Active</status>
...                 <Department>
...                     <orgID>126</orgID>
...                     <name>C</name>
...                     <type>type c</type>
...                     <status>Active</status>
...                 </Department>
...             </Department>
...     </Department>
...     <Department>
...         <orgID>109449</orgID>
...         <name>D</name>
...         <type>type d</type>
...         <status>Active</status>
...     </Department>
... </Departments>
... '''

使用ancestor-or-self轴，您可以找到节点本身、父节点、祖父母。。。在

^{pr2}$

网友

2楼 · 编辑于 2024-09-30 01:27:48

使用lxml的iterancestors()方法。在

from lxml import etree

doc = etree.fromstring(xml)
rval = {}
for org in doc.xpath('//orgID[text()="126"]'):
    for ancestor in org.iterancestors('Department'):
        id=ancestor.find('./orgID').text
        name=ancestor.find('./name').text
        rval[name]=id

print rval

输出：

^{pr2}$

如果您实际上试图保留元素的顺序，则不能使用dict，因为您无法控制dict中的键顺序。您必须使用OrderedDict或just and数组元组：

doc = etree.fromstring(xml)
a = []
for org in doc.xpath('//orgID[text()="126"]'):
    for ancestor in org.iterancestors():
        if ancestor.find('./orgID') is not None:
            id=ancestor.find('./orgID').text
            name=ancestor.find('./name').text
        elif ancestor.get('orgID'):
            id=ancestor.get('orgID')
            name=ancestor.get('name')
        else:
            continue

        print id,name
        a.append((name,id))

print "In order of discovery:\n    ", a 
print "From root to child\n    ", [x for x in reversed(a)]
print "dict keys are not sorted\n    ", dict(a)

输出：

126 C
125 B
124 A
123 xmllist
In order of discovery:
     [('C', '126'), ('B', '125'), ('A', '124'), ('xmllist', '123')]
From root to child
     [('xmllist', '123'), ('A', '124'), ('B', '125'), ('C', '126')]
dict keys are not sorted
     {'A': '124', 'xmllist': '123', 'C': '126', 'B': '125'}

相关问题更多 >

编程相关推荐

热门问题

热门文章