如何在Python中获取xml中元素的值问题的回答

如何在Python中获取xml中元素的值

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

<pre><code><?xml version="1.0" encoding="utf-8"?> <bookstore name="Libreria Pastor"> <book category="COOKING"> <title lang="en">Everyday Italian</title> <author> <writer>Giada De Laurentiis</writer> <resumer>Pepe Lopez</resumer> </author> <year>2005</year> <price>30.00</price> </book> <book category="CHILDREN"> <title lang="en">Harry Potter</title> <author> <writer>J K. Rowling</writer> <resumer>Ana Martinez</resumer> </author> <year>2005</year> <price>29.99</price> </book> <book category="PROGRAMMING"> <title lang="en">Python for All</title> <author> <writer>M.L. Jobs</writer> <resumer>Delton Jones</resumer> </author> <year>2015</year> <price>39.99</price> </book> </bookstore> from xml.dom import minidom arbol_dom = minidom.parse('C:\\Users\\MiguelRG\\Desktop\\sge\\Pythons\\e3.xml') listaBibliotecas = arbol_dom.getElementsByTagName("bookstore"); listaLibros = arbol_dom.getElementsByTagName("book"); listaAutores = arbol_dom.getElementsByTagName("author"); for biblioteca in listaBibliotecas: print(biblioteca.tagName); print("Nombre : " +biblioteca.getAttribute("name")); print("Tiene hijos:"+str(biblioteca.hasChildNodes())); for l in listaLibros: print("Tipo: "+l.tagName); print("Categoria: "+l.getAttribute("category")); print("Titulo : " +l.childNodes[0].nodeValue); print("Lenguaje : "+l.getAttribute("lang")); for a in listaAutores: **print("Escritor : " + str(a.childNodes[0].nodeValue));** **print("Resumen por : "+str(a.childNodes[1].nodeValue));** break; </code></pre> 我想用这个程序或者类似的东西来阅读xml，但是我不能得到书名里面的信息和价格，我需要先打印书店的信息，然后是每本书的信息，然后是作者的信息 任何帮助都将受到感激 谢谢你

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

xml文档中有很多节点。例如，与 <pre><code><book> <title>I Am The Very Model</title> </book> </code></pre> <code>title</code>不是<code>childNodes[0]</code>。这是一个文本节点，换行符和<code><book></code>和<code><title></code>之间的空格。您需要在子节点中搜索title元素，最简单的方法是使用<code>getElementsByTagName</code>。一旦获得正确的元素，可能会有多个节点保存文本。您需要枚举所有这些文本才能找到所需的文本。您还需要决定节点周围的哪些空白位可以被剥离，否则可能会导致输出中出现奇怪的间隙 迁移到<code>ElementTree</code>或<code>lxml</code>的一个原因是，它们倾向于整理这些内容，并为您提供一个更简单的API 您还需要注意调用<code>getElementsByTagName</code>的位置。当你做了<code>listaAutores = arbol_dom.getElementsByTagName("author");</code>你得到了文档中所有的作者，而你真的只是想要一本书的作者 作为旁白，去掉行末多余的分号。它们是不必要的，会让python程序员发疯 另一方面，<code>print</code>添加空格并将对象转换为字符串。只需使用它的功能，而不是字符串串联，这样您的代码就具有一致的外观和感觉 <pre><code>from xml.dom import minidom arbol_dom = minidom.parse('test.xml') def get_elem_text(elem): """join text in all immediate child text nodes""" return ''.join(node.data for node in elem.childNodes if node.nodeType == node.TEXT_NODE) for biblioteca in arbol_dom.getElementsByTagName("bookstore"): print(biblioteca.tagName) print("Nombre :", biblioteca.getAttribute("name")) print("Tiene hijos:", biblioteca.hasChildNodes()) for l in biblioteca.getElementsByTagName("book"): print("Tipo:", l.tagName) print("Categoria:", l.getAttribute("category")) print("Titulo :", get_elem_text(l.getElementsByTagName("title")[0])) print("Lenguaje :", l.getAttribute("lang")) for a in l.getElementsByTagName("author"): print("Escritor :", get_elem_text(a.getElementsByTagName("writer")[0])) print("Resumen por :", get_elem_text(a.getElementsByTagName("resumer")[0])) break </code></pre>

如何在Python中获取xml中元素的值

1 个回答

相关Python问题