Python查找CSV中出现最多的单词

[(' Trojan.PowerShell.LNK.Gen.2', 3), (' Suspicious ZIP!lnk', 2), (' HEUR:Trojan-Downloader.WinLNK.Powedon.a', 2), (' TROJ_FR.8D496570', 2), ('Trojan.PowerShell.LNK.Gen.2', 1), (' Trojan.PowerShell.LNK.Gen.2 (B)', 1), (' Win32.Trojan-downloader.Powedon.Lrsa', 1), (' PowerShell.DownLoader.466', 1), (' malware (ai score=86)', 1), (' Probably LNKScript', 1), (' virus.lnk.powershell.a', 1), (' Troj/LnkPS-A', 1), (' Trojan.LNK', 1)]

1条回答

网友

1楼 · 发布于 2024-07-03 07:58:47

让，my_values = ['A', 'B', 'C', 'A', 'Z', 'Z' ,'X' , 'A' ,'X','H','D' ,'A','S', 'A', 'Z']是要排序的单词列表。你知道吗

现在取一个列表，它将存储每个单词出现的信息。你知道吗

count_dict={}

用适当的值填充字典：

for i in my_values:
    if count_dict.get(i)==None: #If the value is not present in the dictionary then this is the first occurrence of the value
        count_dict[i]=1
    else:
        count_dict[i] = count_dict[i]+1 #If previously found then increment it's value

现在根据dict的出现情况对其值进行排序：

sorted_items= sorted(count_dict.items(),key=operator.itemgetter(1),reverse=True)

现在你有你的预期结果了！最常见的3个值是：

print(sorted_items[:3])

输出：

[('A', 5), ('Z', 3), ('X', 2)]

最常见的两个值是：

print(sorted_items[:3])

输出：

[('A', 5), ('Z', 3)]

等等。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章