Set不返回字母数字元组列表中的唯一元素为什么？（Python 3.6版）

#Import modules import numpy as np import pandas as pd #Define trial sets s1 = ["A", "B", "C", "D", "E"] s2 = ["A", "B", "C"] s3 = ["A", "B", "F"] s4 = ["A", "B", "G", "H", "I"] s5 = ["X", "Y", "Z"] slist = [s1,s2,s3,s4,s5] #Create an empty list to append results to result1 = [] #Calculate Jaccard index between every entry #This is computationally inefficient as most computations are performed twice to generate a full results matrix to make mapping easy. Making half a matrix is more complicated but would be possible within the loop. Empty values would still have to be coded for though so in terms of storage of the final results matrix I don't think there should be much difference for i in range(len(slist)): for j in range(len(slist)): result1.append(len(set(slist[i]).intersection(slist[j]))/len(set(slist[i]).union(slist[j]))) #Define result matrix dimensions shape = (len(slist), len(slist)) #Convert list to array for numpy rarray = np.array(result1) pathway_names = ["Pathway1", "Pathway2", "Pathway3", "Pathway4", "Pathway5"] dataframe = pd.DataFrame(data = rmatrix, index = pathway_names, columns = pathway_names) #List all pathways with Jaccard index > x unless PathwayName = PathwayName x = 0.5 temp =[] #A temporary list for holding lists of tuples which will contain permutations

for k in range(len(slist)): index = dataframe.index[dataframe.iloc[k]>x] for l in range(len(index)): if index[l] != dataframe.columns[k]: temp.append((index[l], dataframe.columns[k], dataframe.iloc[l,k])) print(set(temp))

1条回答

网友

1楼 · 发布于 2024-10-01 02:25:15

问题是元组是有序的，因此('Pathway1', 'Pathway2', 0.6)不等于('Pathway2', 'Pathway1', 0.6)。你知道吗

要解决此问题，请将temp初始化为set并对任何元组排序，然后再将其添加到元组中。你知道吗

temp = set()
for ...:
    ...
    the_tuple = ...
    temp.add(tuple(sorted(the_tuple)))
print(temp)

相关问题更多 >

编程相关推荐

热门问题

热门文章