temp1 = open('yourfile1.txt')
file1 = temp1.read().split(',')
temp1.close()
temp2 = open('yourfile2.txt')
file2 = temp2.read.split(',')
dict1 = {x : file1.count(x) for x in file1}
dict2 = {x : file2.count(x) for x in file2}
similars = set(file1).intersection(set(file2))
rate1 = sum(dict1[w] for w in similars)/len(file1)
rate2 = sum(dict2[w] for w in similars)/len(file2)
print("The rate of similarity is: ", rate1*rate2*100, '%')
print("The similar words are: ", similars)
一个简单的解决方案是,在读取文件并将其拆分为单词后,只需循环浏览这些文件,首先将它们分配到字典以计算每个文件出现的次数,然后计算相对比率,检查哪些单词同时出现在两个文件中,并计算最终比率:
当然,
yourfile1.txt
和yourfile2.txt
是文件名。你知道吗相关问题 更多 >
编程相关推荐