擅长:python、mysql、java
<p>我查看了S.Ghoshal提供的链接,得出以下结论:</p>
<pre><code>#Opening your file
your_file = open('foods.txt')
#Reading every line
reviews = your_file.readlines()
reviews_array = []
dictionary = {}
#We are going through every line and skip it when we see that it's a blank line
for review in reviews:
this_line = review.split(":")
if len(this_line) > 1:
#The blank lines are less than 1 in length after the split
dictionary[this_line[0]] = this_line[1].strip()
#Every first part before ":" is the key of the dictionary, and the second part id the content.
else:
#If a blank linee was found lets save the object in the array and reset it
#for the next review
reviews_array.append(dictionary)
dictionary = {}
#Append the last object because it goes out the last else
reviews_array.append(dictionary)
f1=open("output.txt","a")
for r in reviews_array:
print(r['review/text'], file=f1)
f1.close()
</code></pre>
<p>现在,以review/text开头的行中的所有单词都转储到一个文件中。接下来,我需要创建一个包含所有唯一单词的列表</p>