Python格式UnicodeDecodeE

网友

1楼 · 编辑于 2024-10-03 00:24:29

解决方案非常简单，我使用了get_absolute_url而不是{}。抱歉打扰你了。在

网友

2楼 · 编辑于 2024-10-03 00:24:29

检查要格式化的参数类型，我猜它们是'str'，不是'unicode'。在使用它们之前，对它们进行适当的编码，例如：

url = request.user.get_absolute_url
if isinstance(url, str):
    print 'url was str'
    a = url.decode('utf-8')
msg = u'Uživatel <a href="{0}">...</a>').format(url)

（if和{}语句仅用于演示目的）相应地使用其他值。在

网友

3楼 · 编辑于 2024-10-03 00:24:29

您的user查找返回的是编码的bytestring，而不是Unicode对象。在

当Python2.x被要求连接Unicode和bytestrings编码时，它通过使用默认编码将bytestring解码为Unicode来实现的，默认编码是ascii，除非您进行一些更改。# -*- coding: utf-8 -*-指令设置源代码的编码，而不是系统默认编码。在

从测试format来看，它似乎试图转换参数以匹配左侧的类型。在

在2.x下，只要您使用的bytestring可以使用ascii解码，一切都将正常工作：

>>> u'test\u270c {0}'.format('bar')
u'test\u270c bar'

当然，您正在另一个Unicode对象中格式化：

^{pr2}$

如果在格式化之前省略u，则会得到一个UnicodeEncodeError：

>>> 'foo {0}'.format(u'test\u270c')
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
UnicodeEncodeError: 'ascii' codec can't encode character u'\u270c' in position 4: ordinal not in range(128)

相反，如果将非ascii字节的编码字符串格式化为Unicode对象，则会得到一个UnicodeDecodeError：

>>> u'foo {0}'.format(test.encode('utf-8'))
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 4: ordinal not in range(128)

我首先检查get_absolute_url实现。有效的url永远不能包含未转义的非ascii字符，因此它们应该始终可以被ascii解码，但是如果您使用的是标准Django模型first_name和{}构建的东西应该是Unicode对象，所以我首先打赌get_absolute_url的错误实现。在

相关问题更多 >

编程相关推荐

热门问题

热门文章