如何迭代嵌套的dict（计数器）并递归地更新键

[{SOURCE1:{TOPIC_A:SCORE1,SCORE2,SCORE3}, {TOPIC_B:SCORE1,SCORE2,SCORE3}, {TOPIC_C:SCORE1,SCORE2,SCORE3}}, {SOURCE2:{TOPIC_A:SCORE1,SCORE2,SCORE3}, {TOPIC_B:SCORE1,SCORE2,SCORE3}, {TOPIC_C:SCORE1,SCORE2,SCORE3}}...]

sourceDict = {} sourceDictList = [] for row in sourceData: source = row[0] score = row[1] topic = row[2] sourceDict = [source,{topic:score}] sourceDictList.append(sourceDict) sourceList.append(source)

sourceCounter = Counter(sourceList) for key,val in sourceCounter.items(): for dictitem in sourceDictList: if dictitem[0] == key: sourceCounter[key] = dictitem[1]

2条回答

网友

1楼 · 编辑于 2024-10-03 23:19:26

您可以简单地使用集合的defaultdict

sourdata = [['source', 'topic', 2],['source', 'topic', 3], ['source', 'topic2', 3],['source2', 'topic', 4]]

from collections import defaultdict

sourceDict = defaultdict(dict)


for source, topic, score in sourdata:
    topicScoreDict = sourceDict[source]
    topicScoreDict[topic] = topicScoreDict.get(topic, []) + [score]

>>> print(sourceDict)
>>> defaultdict(<class 'dict'>, {'source': {'topic': [2, 3], 'topic2': [3]}, 'source2': {'topic': [4]}})
>>> print(dict(sourceDict))
>>> {'source': {'topic': [2, 3], 'topic2': [3]}, 'source2': {'topic': [4]}}

网友

2楼 · 编辑于 2024-10-03 23:19:26

我们可以做到：

sourceData = [
    ['source1', 'topic1', 'score1'],
    ['source1', 'topic2', 'score1'],
    ['source1', 'topic1', 'score2'],

    ['source2', 'topic1', 'score1'],
    ['source2', 'topic2', 'score2'],
    ['source2', 'topic1', 'score3'],
]

sourceDict = {}

for row in sourceData:
    source = row[0]
    topic = row[1]
    score = row[2]

    if source not in sourceDict:
        # This will be executed when the source
        # comes for the first time.
        sourceDict[source] = {}

    if topic not in sourceDict[source]:
        # This will be executed when the topic
        # inside that source comes for the first time.
        sourceDict[source][topic] = []

    sourceDict[source][topic].append(score)

print(sourceDict)

相关问题更多 >

编程相关推荐

热门问题

热门文章