擅长:python、mysql、java
<p>如果您不能使用<code>chardet</code>,并且没有机会预先指定编码,那么我认为您剩下的唯一方法就是猜测它。你可以这样做:</p>
<pre><code># Add whichever you want to the list, but only end it in a codec like latin1 that never fails
codecs = ["utf-8", "euc-kr", "shift-jis", "latin1"]
def try_decode(text):
for codec in codecs:
try:
return text.decode(codec)
except UnicodeError:
continue
</code></pre>