擅长:python、mysql、java
<p>在新文件中没有转储<code>res</code>(乏味)。第一个元素是短语,另外两个是频率。使用<code>dict</code>代替<code>set</code>一起进行匹配和映射。在</p>
<pre><code>import csv
import io
alist, blist = [], []
with open("ngrams.csv", "rb") as fileA:
reader = csv.reader(fileA, delimiter=',')
for row in reader:
alist.append(row)
with open("ngramstest.csv", "rb") as fileB:
reader = csv.reader(fileB, delimiter=',')
for row in reader:
blist.append(row)
f_dict = {e[0]:e[1:] for e in alist}
s_dict = {e[0]:e[1:] for e in blist}
res = {}
for k,v in f_dict.items():
if k in s_dict:
res[k] = float(v[1])/float(s_dict[k][1])
print(res)
</code></pre>