如何有效地统计字典列表中每个键的出现次数？

import numpy as np positive_feature=[[{'a':2,'b':1},1], [{'b':2,'c':1},1] ] negative_feature=[[{'e':2,'b':1},0] ] alltokens=['a','b','c','e'] dic=dict((t,i) for i,t in enumerate(alltokens)) vacabulary_size=len(dic) positive_doc_frequency,negative_doc_frequency=np.zeros(vacabulary_size), np.zeros(vacabulary_size) for t in alltokens: for x in positive_feature: if t in x[0].keys(): positive_doc_frequency[dic[t]]+=1 for x in negative_feature: if t in x[0].keys(): negative_doc_frequency[dic[t]]+=1

2条回答

网友

1楼 · 编辑于 2024-10-05 10:06:02

from itertools import chain
from collections import Counter
c = Counter(chain.from_iterable(d for d, x in positive_feature))
print(*sorted(c.items()))

这将列出positive_feature中的所有键，然后统计每个键的数量，然后打印计数。在

想要得到你想要的计数，就去做

^{pr2}$

网友

2楼 · 编辑于 2024-10-05 10:06:02

我不知道为什么你的代码会运行那么长的时间，即使有一个大的数据集。在

在Python中，count occurrences of things有很多方法。我发现标准库中的collections.Counter是最快的方法（毫不奇怪，因为它只针对这个用例进行了优化）。在

在代码中使用collections.Counter将如下所示：

from collections import Counter

positive_doc_frequency = Counter()
negative_doc_frequency = Counter()

for t in alltokens:
    for x in positive_feature:
        positive_doc_frequency.update(x[0].keys())
    for x in negative_feature:
        negative_doc_frequency.update(x[0].keys())

相关问题更多 >

编程相关推荐

热门问题

热门文章