<p>unicode字符串到底是什么?</p>
<p>普通字符串和unicode字符串有什么区别?</p>
<p>什么是utf-8?</p>
<p>我现在正在努力学习Python,我一直听到这个流行语。下面的代码是做什么的?</p>
<p><strong>i18n字符串(Unicode)</strong></p>
<pre><code>> ustring = u'A unicode \u018e string \xf1'
> ustring
u'A unicode \u018e string \xf1'
## (ustring from above contains a unicode string)
> s = ustring.encode('utf-8')
> s
'A unicode \xc6\x8e string \xc3\xb1' ## bytes of utf-8 encoding
> t = unicode(s, 'utf-8') ## Convert bytes back to a unicode string
> t == ustring ## It's the same as the original, yay!
True
</code></pre>
<p><strong>文件Unicode</strong></p>
<pre><code>import codecs
f = codecs.open('foo.txt', 'rU', 'utf-8')
for line in f:
# here line is a *unicode* string
</code></pre>