<p>使用“UTF-8”解码字符串</p>
<pre><code>print test_str.decode("utf-8")
u'Question: The cryptocurrency Bitcoin Cash (BCH/USD) settled at 1368 USD at 07:00 AM UTC at the Bitfinex exchange on Monday, April 23. In your opinion, will BCH/USD trade above 1500 USD (+9.65%) at an\u0443 tim\u0435 b\u0435fore \u0410\u0440ril 28? Indic\u0430t\u043er: 60.76%'
</code></pre>
<p>由于它仍然有一些非ASCII字符(如<a href="https://www.fileformat.info/info/unicode/char/0443/index.htm" rel="nofollow noreferrer">CYRILLIC SMALL LETTER U</a>),我们可以进一步翻译它。完整列表:<a href="https://en.wikipedia.org/wiki/Cyrillic_script_in_Unicode" rel="nofollow noreferrer">Cyrillic Script Wiki</a></p>
<p>使用<a href="https://pypi.org/project/Unidecode/" rel="nofollow noreferrer">unidecode</a></p>
<pre><code>import unidecode
unidecode.unidecode(test_str.decode("utf-8"))
'Question: The cryptocurrency Bitcoin Cash (BCH/USD) settled at 1368 USD at 07:00 AM UTC at the Bitfinex exchange on Monday, April 23. In your opinion, will BCH/USD trade above 1500 USD (+9.65%) at anu time before Arril 28? Indicator: 60.76%'
unidecode.unidecode(test_str.decode("utf-8")).split("before ")
['Question: The cryptocurrency Bitcoin Cash (BCH/USD) settled at 1368 USD at 07:00 AM UTC at the Bitfinex exchange on Monday, April 23. In your opinion, will BCH/USD trade above 1500 USD (+9.65%) at anu time ',
'Arril 28? Indicator: 60.76%']
</code></pre>
<p>注意:如果您不想使用unidecode,我发现本文详细地解释了另一种方法:<a href="https://programminghistorian.org/lessons/transliterating" rel="nofollow noreferrer">Transliterating non-ASCII characters with Python</a></p>