UnicodeDecodeError:“utf8”编解码器无法解码字节

2024-09-27 07:19:16 发布

您现在位置：Python中文网/ 问答频道 /正文

12606

网友

男 | 程序猿一只，喜欢编程写python代码。

我正在分析一个xml文件，它的编码是“iso-8859-15”。在

像“Zürich”、“Aktienr ruckk”这样的词会被转换成“&；228；”等等

我尝试了以下建议：

p = ElementTree.fromstring(u'<p>found "\u62c9\u67cf \u591a\u516c \u56ed"</p>'.encode('utf8'))
>>> p.text
u'found "\u62c9\u67cf \u591a\u516c \u56ed"'
>>> print p.text

但我会遇到类似UnicodeDecodeError: 'ascii' codec can't decode byte的错误

即使这样也无济于事

^{pr2}$

我试过很多关于堆栈溢出的建议，但是我想不出我的办法。在

我需要将解析后的内容写回一个html文件，其中包含相同的字符集，比如'u'

Tags：文件 text 编码 iso xml 建议 found rich

1条回答

网友

1楼 · 发布于 2024-09-27 07:19:16

试试这个：

from xml.etree import ElementTree
p = ElementTree.fromstring(u'<p>found "\u62c9\u67cf \u591a\u516c \u56ed"</p>'.encode('utf8'))
print p.text.encode('utf8')

found "拉柏 多公 园"

例如：

^{pr2}$

别忘了把# -*- coding: utf-8 -*-放在文件的开头。在

UnicodeDecodeError:“utf8”编解码器无法解码字节

相关问题更多 >

编程相关推荐

热门问题

热门文章

UnicodeDecodeError:“utf8”编解码器无法解码字节

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >