擅长:python、mysql、java
<p>根据<a href="http://www.crummy.com/software/BeautifulSoup/documentation.html#Beautiful%20Soup%20Gives%20You%20Unicode,%20Dammit" rel="nofollow">BeautifulSoup</a>的文档,所有输入都在内部转换为UTF8:</p>
<pre><code>from BeautifulSoup import BeautifulSoup
soup = BeautifulSoup("Hello")
soup.contents[0]
# u'Hello'
soup.originalEncoding
# 'ascii'
</code></pre>
<p>如果您的输入没有指定编码(例如,meta标记),beauthoulsoup会猜测。您可以通过<code>fromEncoding</code>参数指定输入的编码来禁用猜测:</p>
^{pr2}$
<p>或者你真正的问题是结果到控制台的“中断”输出?在</p>