如何在使用Python保存页眉和页脚的同时拆分XML文件（在特定的N个节点上）？

<?xml version="1.0"?> <catalog catalogName="cat1" catalogType="bestsellers"> <headerNode node="1"> <param1>value1</param1> <param2>value2</param2> </headerNode> <book id="bk101"> <author>Gambardella, Matthew</author> <title>XML Developer's Guide</title> <genre>Computer</genre> <price>44.95</price> <publish_date>2000-10-01</publish_date> <description>An in-depth look at creating applications with XML.</description> </book> <book id="bk102"> <author>Ralls, Kim</author> <title>Midnight Rain</title> <genre>Fantasy</genre> <price>5.95</price> <publish_date>2000-12-16</publish_date> <description>A former architect battles corporate zombies, an evil sorceress, and her own childhood to become queen of the world.</description> </book> <book id="bk103"> <author>Corets, Eva</author> <title>Maeve Ascendant</title> <genre>Fantasy</genre> <price>5.95</price> <publish_date>2000-11-17</publish_date> <description>After the collapse of a nanotechnology society in England, the young survivors lay the foundation for a new society.</description> </book> <footerNode node="2"> <param1>value1</param1> <param2>value2</param2> </footerNode> </catalog>

import sys import xml.etree.ElementTree as ET import os # Get the current directory cwd = os.getcwd() # Load the xml doc = ET.parse(r"%s/%s.xml" % (cwd,sys.argv[1])) root = doc.getroot() # Get the header element header = root.find("headerNode") # Get the footer element footer = root.find("footerNode") # loop over the books and create the new xml file for idx,book in enumerate(root.findall(sys.argv[2])): top = ET.Element(root.tag) top.append(header) top.append(book) top.append(footer) out_book = ET.ElementTree(top) # the output file name will be the ID of the book out_path = "%s/%s_%s.xml" % (cwd,sys.argv[1],idx) out_book.write(open(out_path, "wb"))

2条回答

网友

1楼 · 编辑于 2024-10-04 01:28:40

从我的头顶上你可以这样做：

import xml.etree.ElementTree as ET

# Load the xml
doc = ET.parse(r"d:\books.xml")
root = doc.getroot()
# Get the header element
header = root.find("headerNode")
# Get the footer element
footer = root.find("footerNode")
# loop over the books and create the new xml file
for book in root.findall('book'):
    top = ET.Element(root.tag)
    top.append(header)
    top.append(book)
    top.append(footer)
    out_book = ET.ElementTree(top)
    # the output file name will be the ID of the book
    out_path = "%s.xml" % book.attrib["id"]
    out_book.write(open(out_path, "wb"))

网友

2楼 · 编辑于 2024-10-04 01:28:40

算法如下：

解析xml文件并获取现有根目录
这样，就形成了所有书籍的基础-有页眉和页脚的目录-新的根目录。在
现在，遍历根标记以获取标记为“book”的所有元素
然后，将book元素插入到新的\u根目录并将其写入一个文件-这里我已经写入了一个与您的id同名的文件！在

#question 2 - tag name as input from user!
tag_name=raw_input("Enter tag name:")
from xml.etree.ElementTree import ElementTree,parse,Element
root = parse('sample.xml').getroot()
new_root=Element(root.tag)
#question 1 - multiple header and footer!
new_root.extend(root.findall('.//headerNode'))
new_root.extend(root.findall('.//footerNode'))
for elem in root:
    if elem.tag == tag_name:
        new_root.insert(1,elem)
        #question 3 - write output to file!
        ElementTree(new_root).write(open('path/to/folder'+elem.get('id')+'.xml', 'wb'))
        new_root.remove(elem)

样本输出：

文件名：bk101.xml

^{pr2}$

编码快乐！在

相关问题更多 >

编程相关推荐

热门问题

热门文章