如何在Python中基于另一个列表删除列表中的项？

all_words_merged = ['ego', 'femina', 'incenderare', 'tuus', 'casa', 'et', 'cutullus', 'incipere', 'et', 'wingardium', 'leviosa'] class_words_merged = ['femina', 'incenderare', 'incipere', 'wingardium']

3条回答

网友

1楼 · 编辑于 2024-09-30 12:26:37

也可以使用filter内置方法来完成，如下所示：

>>> all_words_merged = ['ego', 'femina', 'incenderare', 'tuus', 'casa', 'et', 'cutullus', 'incipere', 'et', 'wingardium', 'leviosa']
>>> class_words_merged = ['femina', 'incenderare', 'incipere', 'wingardium']
>>>
>>> list(filter(lambda x: x not in class_words_merged, all_words_merged))
['ego', 'tuus', 'casa', 'et', 'cutullus', 'et', 'leviosa']

list对于Python3是必需的，因为filter生成一个过滤器对象，而在Python2中，这不是必需的，只是：

>>> filter(lambda x: x not in class_words_merged, all_words_merged)

编辑：

当然，这不是优化的方法，因为您必须将生成器转换为列表，您可以通过时序配置文件来猜测：

>>> timeit.timeit(stmt='list(filter(lambda x: x not in c, a))', globals={'a':all_words_merged, 'c':class_words_merged})
2.6026250364160717
>>> timeit.timeit(stmt='[x for x in a if x not in c]', globals={'a':all_words_merged, 'c':class_words_merged})
1.3826178676799827

网友

2楼 · 编辑于 2024-09-30 12:26:37

您应该在all_words_merged上迭代，并且只包含不在class_words_merged中的单词

result = [x for x in all_words_merged if x not in class_words_merged]

输出：

['ego', 'tuus', 'casa', 'et', 'cutullus', 'et', 'leviosa']

编辑

如果class_words_merged可以包含重复项，那么首先使用set将提供更好的性能。你知道吗

cwm_set = set(class_words_merged)
result = [x for x in all_words_merged if x not in cwm_set]

网友
3楼 · 编辑于 2024-09-30 12:26:37

如果class_words_merged很大，首先将其转换为一个集合会加快速度：

>>> to_remove = set(class_words_merged)
>>> [word for word in all_words_merged if word not in to_remove]
['ego', 'tuus', 'casa', 'et', 'cutullus', 'et', 'leviosa']

一些计时

100倍大：

large_class_words_merged = class_words_merged * 100

先创建为集合：

%%timeit
to_remove = set(large_class_words_merged)
[word for word in all_words_merged if word not in to_remove]
1000 loops, best of 3: 493 µs per loop

反复浏览清单：

%timeit [word for word in all_words_merged if word not in large_class_words_merged]
100 loops, best of 3: 3.18 ms per loop

提示：

%timeit和%%imeit是我在Jupyter笔记本中使用的IPython魔术命令。你知道吗

一些计时

相关问题更多 >

编程相关推荐

热门问题

热门文章