在带有“safe”参数的utf-8字符串上使用python的urllib.quote撸plus

quote_plus(name.encode('utf-8'), safe=':/') Output: --------------------------------------------------------------------------- UnicodeDecodeError Traceback (most recent call last) <ipython-input-164-556248391ee1> in <module>() ----> 1 quote_plus(v, safe=':/') /usr/lib/python2.7/urllib.pyc in quote_plus(s, safe) 1273 s = quote(s, safe + ' ') 1274 return s.replace(' ', '+') -> 1275 return quote(s, safe) 1276 1277 def urlencode(query, doseq=0): /usr/lib/python2.7/urllib.pyc in quote(s, safe) 1264 safe = always_safe + safe 1265 _safe_quoters[cachekey] = (quoter, safe) -> 1266 if not s.rstrip(safe): 1267 return s 1268 return ''.join(map(quoter, s)) UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 10: ordinal not in range(128)

3条回答

网友

1楼 · 编辑于 2024-05-09 23:10:40

#!/usr/bin/env python
# -*- coding: utf-8 -*-
from __future__ import unicode_literals
import urllib
name = u'Mayte_Martín'
print urllib.quote_plus(name.encode('utf-8'), safe=':/')

对我来说没问题（Py 2.7.9，Debian）

（我不知道答案，但我不能就名誉发表评论）

网友

2楼 · 编辑于 2024-05-09 23:10:40

我在回答我自己的问题，这样可以帮助其他面临同样问题的人。

当您在执行任何其他操作之前在当前工作区中进行以下导入时，会出现此特定问题。

from __future__ import unicode_literals

这与下面的代码序列不兼容。

from urllib import quote_plus

name = u'Mayte_Martín'
quote_plus(name.encode('utf-8'), safe=':/')

不导入unicode字符的相同代码可以正常工作。

网友

3楼 · 编辑于 2024-05-09 23:10:40

根据this bug，以下是解决方法：

#!/usr/bin/env python
# -*- coding: utf-8 -*-
from __future__ import unicode_literals
from urllib import quote_plus
name = u'Mayte_Martín'
quote_plus(name.encode('utf-8'), safe=':/'.encode('utf-8'))

必须在quote或quote_plus方法中同时使用encode两个参数才能utf-8

相关问题更多 >

编程相关推荐

热门问题

热门文章