擅长:python、mysql、java
<p>考虑下载自然语言工具包(<a href="http://www.nltk.org/" rel="noreferrer">^{<cd1>}</a>)。然后,你可以创建一些句子,不会因为“U.S.A.”或无法拆分以“?!”结尾的句子。在</p>
<pre><code>>>> import nltk
>>> paragraph = u"Hi, this is my first sentence. And this is my second. Yet this is my third."
>>> sentences = nltk.sent_tokenize(paragraph)
[u"Hi, this is my first sentence.", u"And this is my second.", u"Yet this is my third."]
</code></pre>
<p>您的代码变得更加可读。要进入第二句话,你要用你习惯的符号。在</p>
^{pr2}$