<p>下面是我该怎么做的。在</p>
<pre><code>stopwords = ['a', 'agora', 'algum', 'alguma', 'aquele', 'aqueles', 'de', 'deu', 'do', 'e', 'estou', 'esta', 'esta',
'ir', 'meu', 'muito', 'mesmo', 'no', 'nossa', 'o', 'outro', 'para', 'que', 'sem', 'talvez', 'tem', 'tendo',
'tenha', 'teve', 'tive', 'todo', 'um', 'uma', 'umas', 'uns', 'vou']
def remove_stopwords(text):
phrases = []
for (sentence, _) in text:
sentence_without_stopwords = [word for word in sentence.split() if word not in stopwords]
phrases.append(sentence_without_stopwords)
return phrases
with open('input.txt') as raw_text:
sentence_sentiments = []
lines = [line for line in raw_text]
for line in lines:
sentence, sentiment = line.split(',')
sentence_sentiments.append((sentence[1:-1], sentiment[1:-1]))
print(remove_stopwords(sentence_sentiments))
</code></pre>
<p>请注意,在您提供的代码中,<code>baseTeste</code>是一个数组,它包含字符串列表,表示输入文件的行。这不是您想要的,因为您试图在这些行中的<code>(sentence, sentiment)</code>对上循环(<code>for(palavras, emocao) in texto:</code>)。因此,您错过了将每一行拆分为<code>(sentence, sentiment)</code>对的中间步骤。在</p>