如何用不同的翻译结果获得相同的翻译源

import collections source = [1, 1, 2, 3, 4, 4, 4, 5, 6] target = [1, 2, 2, 3, 1, 2, 3, 5, 6] duplicate_1 = [item for item, count in collections.Counter(source).items() if count > 1] duplicate_2 = [item for item, count in collections.Counter(target).items() if count > 1] def getIndexPositions(listOfElements, element): ''' Returns the indexes of all occurrences of give element in the list- listOfElements ''' indexPosList = [] indexPos = 0 while True: try: # Search for item in list from indexPos to the end of list indexPos = listOfElements.index(element, indexPos) # Add the index position in list indexPosList.append(indexPos) indexPos += 1 except ValueError as e: break return indexPosList indexPosList=[] for i in duplicate_1: indexPosList = getIndexPositions(source, i) print(indexPosList) for i in indexPosList: print(source[i]) for x in indexPosList: if target[i] == target[x]: print('same target') else: print(source[i], 'different target : first value is : ', target[i], ' ##### and ###### second value is: ', target[x])

2条回答

网友
1楼 · 编辑于 2024-09-30 20:20:49

我相信这会做一些类似于你想要的：
def get_translation(source, target): output = {} for name, trans in zip(source, target): if name in output: output[name].append(trans) else: output[name] = [trans] return output
这不会获取索引并创建整数字典，而是复制源和目标中包含的字符串。你知道吗
它同时遍历两个列表。如果名称不在词典output中，则它将作为包含trans的列表添加到词典中。如果名称已经在字典中，那么trans将添加到该列表的末尾。你知道吗
因此，输入：
source = ['text1', 'text2', 'text3', 'text2'] target = ['trans1', 'trans2', 'trans3', 'trans4']
将产生输出：
{'text1':['trans1'], 'text2':['trans2', 'trans4'], 'text3':['trans3']}
输入：
source = [1, 1, 2, 3, 4, 4, 4, 5, 6] target = [1, 2, 2, 3, 1, 2, 3, 5, 6]
输出：
{1: [1, 2], 2: [2], 3: [3], 4: [1, 2, 3], 5: [5], 6: [6]}

网友
2楼 · 编辑于 2024-09-30 20:20:49

第一步：找到重复的条目。步骤2：为每个重复条目获取索引步骤3检查是否相等，这里的示例text3使用了两次，但具有相同的翻译第四步：追加字典列表？（我不懂那种格式）
source = ['text1', 'text2', 'text3', 'text2', 'text3', 'text1'] target = ['trans1', 'trans2', 'trans3', 'trans4', 'trans3', 'trans6'] def get_repeated_translations(source, target): double_translation_entries = [] reapeated_entries = list(set([x for i, x in enumerate(source) if source.count(x)>1 and source.index(x) < i])) for repeated_entry in reapeated_entries: indices_of_repeated_entry = [i for i, x in enumerate(source) if x == repeated_entry] entry_translation = target[indices_of_repeated_entry[0]] for translated_index in indices_of_repeated_entry: if target[translated_index] != entry_translation: double_translation_entries.append({source.index(repeated_entry) : indices_of_repeated_entry}) break return double_translation_entries print(get_repeated_translations(source,target))
结果：
[{1: [1, 3]}, {0: [0, 5]}] [Finished in 0.075s]

相关问题更多 >

编程相关推荐

热门问题

热门文章