如何将多个XML元素解析为一个字符串？

<?xml version="1.0" encoding="UTF-8" ?> <Mydoc Time="2017-01-02" Period="2017-01-03"> <mycontent ClassID="kinder"> <bibliography> <Id> <Num>123456</Num> </Id> <Body> this is some crazy text my friend </Body> <myreaders> <names> <Id>john</Id> <value>95</value> </names> </myreaders> <school> <myclass> <Id>12</Id> <name>Laura</name> </myclass> <myclass> <Id>14</Id> <name>Frank</name> </myclass> <myclass> <Id>144</Id> <name>Jonny</name> </myclass> <myclass> <Id>222</Id> <name>Alex</name> </myclass> <myclass> <Id>5443</Id> <name>Johnny Doe</name> </myclass> </school> </bibliography> </mycontent> <mycontent ClassID="preK"> <bibliography> <Id> <Num>123456</Num> </Id> <Body> this is another crazy text my friend </Body> <myreaders> <names> <Id>fritz</Id> <value>133</value> </names> </myreaders> </bibliography> </mycontent> </Mydoc>

3条回答

网友

1楼 · 编辑于 2024-09-28 23:12:52

与其他答案类似，略短一点，适用于新添加的节点：

parsedXML = ET.parse( "sample.xml")
root = parsedXML.getroot()
pairs0 = []
pairs1 = []
for mycontent in root.iter('mycontent'):
    pairs0.append(','.join(['(' + name[0].text + '-' + name[1].text + ')' for name in mycontent.iter('names')]))
    pairs1.append(','.join(['(' + myclass[0].text + '-' + myclass[1].text + ')' for myclass in mycontent.iter('myclass')]))
df = pd.DataFrame(data = {"myreaders": pairs0, "school": pairs1}, columns=['myreaders', 'school'])

编辑：修改以解决多个案例。你知道吗

网友

2楼 · 编辑于 2024-09-28 23:12:52

您可以尝试XmlToDict，并将您的XML解析为字典/列表，这会使您的尝试变得更加容易。然后，您可以循环/浏览myclass字典的列表。希望能有所帮助。你知道吗

网友

3楼 · 编辑于 2024-09-28 23:12:52

它成了一个很好的列表理解装置，但我认为这是你需要的。你知道吗

import xml.etree.ElementTree as ET
import pandas as pd
tree = ET.parse('test.xml')
root = tree.getroot()
dicty = {}
dicty['myreaders'] = [','.join(['(' + x.findall('Id')[0].text + '-' + x.findall('value')[0].text + ')' for x in (root.findall('.//mycontent/bibliography/myreaders/names'))])]
dicty['school'] = [','.join(['(' + x.findall('Id')[0].text + '-' + x.findall('name')[0].text + ')' for x in (root.findall('.//mycontent/bibliography/school/myclass'))])]
print(dicty)
print(pd.DataFrame(dicty))

输出：

   myreaders                                             school
0  (john-95)  (12-Laura),(14-Frank),(144-Jonny),(222-Alex),(...

没有真正简单的方法来解析xml，您需要对数据结构进行大量的分析。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章