在Python中加速合并多个XML文件

import lxml.etree as ET import argparse import os ap = argparse.ArgumentParser() ap.add_argument("-x", "--xmlreffile", required=True, help="Path to list of xmls") ap.add_argument("-s", "--xslfile", required=True, help="Path to the xslfile") args = vars(ap.parse_args()) dom = ET.parse(args["xmlreffile"]) xslt = ET.parse(args["xslfile"]) transform = ET.XSLT(xslt) newdom = transform(dom) print(ET.tostring(newdom, pretty_print=True))

1条回答

网友

1楼 · 发布于 2024-09-30 08:34:45

为什么不制作一个多处理版本的脚本呢。有几种方法你可以做到，但据我所知，这里是我会做的

list = open("listofxmls.xml","r")# assuming this gives you a list of files (adapt if necessary)

yourFunction(xml):
    steps 
    of your
    parse funct
    return(ET.tostring(newdom, pretty_print=True))

from multiprocessing.dummy import Pool as ThreadPool
pool = ThreadPool(4) # number of threads (adapt depending on the task and your CPU)
mergedXML = pool.map(yourFunction,list) # execute the function in parallel
pool.close()
pool.join()

然后，根据需要保存mergedXML。在

希望它能帮助你，或者至少能引导你朝着正确的方向前进

相关问题更多 >

编程相关推荐

热门问题

热门文章