添加字符串的所有匹配项

网友

1楼 · 编辑于 2024-10-04 01:29:19

有很多方法可以做到这一点。比如说,

你可以记录总数

total = 0
for aminos in aa:
    # No need to check if aminos in protein because .count() returns 0 if that's the case
    total += protein.count(aminos)

您可以编写一个生成器表达式，并使用sum()将aa中每个amino的count()的所有值相加

total = sum(protein.count(amino) for amino in aa)

您可以迭代蛋白质并检查每个字符是否在aa中。但首先，将aa转换为set以降低成员资格检查的成本

s_aa = set(aa)
total = sum(p in s_aa for p in protein)

这是因为如果p在s_aa中，则p in s_aa的计算结果为True，否则False的计算结果为TrueTrue计为一，False计为零，因此当您sum一组True/False值时，您会得到True值的数量

对protein中的所有字符进行计数，然后对您关心的字符进行计数之和：

counts = {}
for p in protein:
    ct = counts.get(p, 0) # get counts[p], default to 0 if not exists
    counts[p] = ct + 1

total = sum(counts.get(amino, 0) for amino in aa)

Vignesh's ^{} technique与此方法相同。计数元素比Hamza's approach好，因为它只在protein字符串上迭代一次，而不是对aa的每个元素迭代一次。这也是我的第三种或第四种方法优于#1和#2的原因

网友

2楼 · 编辑于 2024-10-04 01:29:19

最简单的方法可能是：

sum(protein.count(a) for a in aa)

您还可以获得单独的计数，如下所示：

all_counts = {a:protein.count(a) for a in aa}

结果：{'M': 1, 'L': 10}

如果只需要总数，您可以进一步求和：

sum(all_counts.values())

结果是：11

网友

3楼 · 编辑于 2024-10-04 01:29:19

有很多方法可以做到这一点。以下是其中之一

from collections import Counter

protein = "MSRSLLLRFLLFLLLLPPLP"
aminos = ['M', 'L']

# Count occurrences of all characters
amino_counter = Counter(protein)
total_count = 0

# Only consider the counts of aminos that matter
for amino in aminos:
    total_count += amino_counter.get(amino, 0)

print(total_count)

相关问题更多 >

编程相关推荐

热门问题

热门文章

添加字符串的所有匹配项

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >