解析大型压缩xml文件，python

网友

1楼 · 编辑于 2024-10-04 01:27:03

你能传入一个mmap（）的文件吗？这样可以自动分页文件所需的部分，避免内存溢出。当然，如果expat构建了一个解析树，它可能仍然会耗尽内存。在

http://docs.python.org/library/mmap.html

Memory-mapped file objects behave like both strings and like file objects. Unlike normal string objects, however, these are mutable. You can use mmap objects in most places where strings are expected; for example, you can use the re module to search through a memory-mapped file.

网友

2楼 · 编辑于 2024-10-04 01:27:03

只需使用p.ParseFile（file）而不是p.Parse（file）。在

Parse（）接受字符串，ParseFile（）接受文件句柄，并根据需要读取数据。在

参考号：http://docs.python.org/library/pyexpat.html#xml.parsers.expat.xmlparser.ParseFile

网友

3楼 · 编辑于 2024-10-04 01:27:03

在file对象上使用^{}以字符串形式读入整个文件，然后将其传递给Parse？在

file  = BZ2File(SOME_FILE_PATH)
p = xml.parsers.expat.ParserCreate()
p.Parse(file.read())

相关问题更多 >

编程相关推荐

热门问题

热门文章

解析大型压缩xml文件，python

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >