如何搜索列表中某一组项目是否存在？

from pprint import pprint as pp targets = open(file) longest_UTR = [] counter = 0 for line in targets: (chromosome, locus, mir, gene, transcript, UTR_length) = line.strip("\n").split("\t") if [locus, mir, gene] not in longest_UTR: longest_UTR.append([locus, mir, gene, transcript, UTR_length]) counter += 1 if counter == 100: break pp (longest_UTR)

['CFI', 'hsa-miR-576-5p', 'DIS3', 'ENST00000490646', '2934'], ['APOE', 'hsa-miR-642a-5p', 'WDR64', 'ENST00000425826', '2122'], >['C2/CFB/SKIV2L', 'hsa-miR-219a-1-3p', 'GLG1', 'ENST00000422840', '4748'], ['C2/CFB/SKIV2L', 'hsa-miR-219a-1-3p', 'GLG1', 'ENST00000422840', '4748']<, ['APOE', 'hsa-miR-330-3p', 'DCAF4L1', 'ENST00000333141', '4764'], ['TMEM97/VTN', 'hsa-miR-144-3p', 'DCAF4L1', 'ENST00000333141', '4764']]

2条回答

网友

1楼 · 编辑于 2024-09-30 14:34:02

列表是不可散列的，因此不能按照您认为的方式来比较两者之间的相等性。列表比较可以使用sets代替

从pprint导入pprint作为pp

targets = open(file)

longest_UTR = []

for line in targets:
    chromosome, locus, mir, gene, transcript, UTR_length = line.strip("\n").split("\t")

    if not [set([locus, mir, gene]) < set(utr) for utr in longest_UTR]:
        longest_UTR.append([locus, mir, gene, transcript, UTR_length)])
pp (longest_UTR)

网友

2楼 · 编辑于 2024-09-30 14:34:02

看起来longest_UTR将是一个列表列表。if语句if [locus, mir, gene] not in longest_UTR将在longest_UTR中搜索列表[locus, mir, gene]，并且永远不会找到它，因为longest_UTR中的子列表都是长度为5的

相反，您可以只搜索每个子列表的前3个元素：

if not any(x[:3] == [locus, mir, gene] for x in longest_UTR):

你应该知道元素的顺序在这里很重要。类似地，如果longest_UTR有一些列表，前3个元素为[mir, locus, gene]，那么这个if语句将返回False

相关问题更多 >

编程相关推荐

热门问题

热门文章