无法用io modu序列化minidom树

import codecs import xml.dom.minidom def transform1(src_path, dst_path): tree = xml.dom.minidom.parse(src_path) # ... with codecs.open(dst_path, mode="w", encoding="utf-8") as fd: tree.writexml(fd, encoding="utf-8")

Traceback (most recent call last): File "/path/to/minidom_demo.py", line 23, in <module> transform2("sample.xml", "result.xml") File "/path/to/minidom_demo.py", line 18, in transform2 tree.writexml(fd, encoding="utf-8") File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/xml/dom/minidom.py", line 1747, in writexml writer.write('<?xml version="1.0" encoding="%s"?>%s' % (encoding, newl)) TypeError: must be unicode, not str

Traceback (most recent call last): File "/path/to/minidom_demo.py", line 23, in <module> transform2("sample.xml", "result.xml") File "/path/to/minidom_demo.py", line 18, in transform2 tree.writexml(fd, encoding="utf-8") ... File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/xml/dom/minidom.py", line 298, in _write_data writer.write(data) UnicodeEncodeError: 'ascii' codec can't encode character u'\xe7' in position 4: ordinal not in range(128)

1条回答

网友

1楼 · 发布于 2024-10-06 12:29:36

writexml方法似乎总是转储str。阅读文档告诉我，它的encoding参数只将编码属性添加到XML头中。你知道吗

Changed in version 2.3: For the Document node, an additional keyword argument encoding can be used to specify the encoding field of the XML header.

您可以尝试：

fd.write(tree.toxml(encoding="utf-8").decode("utf-8"))

上面的代码将XML保存为UTF-8，并在XML头中指定编码。你知道吗

如果不指定编码，它仍将另存为UTF-8，但编码属性不会包含在标头中。你知道吗

fd.write(tree.toxml())

如果指定编码，但不decode()，它将引发异常，因为toxml()返回str，这很奇怪。你知道吗

TypeError: write() argument 1 must be unicode, not str

相关问题更多 >

编程相关推荐

热门问题

热门文章