Python返回字符串列表（如果其他列表中存在任何子字符串）

companies = [['zmpEVqsbCUO1aXStxHkSVA', 'palms-car-wash'], ['5T0vKfIJWP1xTnxA7fJ17w', 'meat-and-bread'], ['C0d5kzUx6C19mLcxQyhxCA', 'alamo-drafthouse-cinema-'], ['ch1ercqwoNLpQLxpTb90KQ', 'boston-tea-stop']]

# KEEPING ONLY RESULTS WHERE WE DO NOT FIND THE SUBSTRINGS [x for x in companies if (no_interest[0] not in x[1]) & (no_interest[1] not in x[1]) & (no_interest[2] not in x[1])] # RETURN [['5T0vKfIJWP1xTnxA7fJ17w', 'meat-and-bread'], ['ch1ercqwoNLpQLxpTb90KQ', 'boston-tea-stop']]

2条回答

网友

1楼 · 编辑于 2024-10-02 02:36:32

下面是另一个使用正则表达式的例子，但是（正如亨利·埃克的回答）它假设在任何“无兴趣”元素中都没有干扰正则表达式的特殊字符

import regex as re
pattern = re.compile("|".join(no_interest))
out = [c for c in companies if ((pattern.search(c[0]) == None) and (pattern.search(c[1]) == None))]

网友

2楼 · 编辑于 2024-10-02 02:36:32

Why is the 'AND' statement acting like a 'OR'?

见：DeMorgan's Laws

$DeMorgan's Law$

How can we make this code more pythonic and more efficient?

更像Python：

一种选择是在单独的列表中使用all：

companies = [['zmpEVqsbCUO1aXStxHkSVA', 'palms-car-wash'],
             ['5T0vKfIJWP1xTnxA7fJ17w', 'meat-and-bread'],
             ['C0d5kzUx6C19mLcxQyhxCA', 'alamo-drafthouse-cinema-'],
             ['ch1ercqwoNLpQLxpTb90KQ', 'boston-tea-stop']]

no_interest = ['museum', 'cinema', 'car']

out = [x for x in companies if all([ni not in x[1] for ni in no_interest])]
print(out)

或与not{a4}一起：

out = [x for x in companies if not any([ni in x[1] for ni in no_interest])]

[['5T0vKfIJWP1xTnxA7fJ17w', 'meat-and-bread'],
 ['ch1ercqwoNLpQLxpTb90KQ', 'boston-tea-stop']]

更有效率：

使用类似pandas的库：

import pandas as pd

companies = [['zmpEVqsbCUO1aXStxHkSVA', 'palms-car-wash'],
             ['5T0vKfIJWP1xTnxA7fJ17w', 'meat-and-bread'],
             ['C0d5kzUx6C19mLcxQyhxCA', 'alamo-drafthouse-cinema-'],
             ['ch1ercqwoNLpQLxpTb90KQ', 'boston-tea-stop']]

df = pd.DataFrame(data=companies, columns=['id', 'val'])

no_interest = ['museum', 'cinema', 'car']

out = df[~df['val'].str.contains('|'.join(no_interest))]
print(out)

输出为数据帧

                       id              val
1  5T0vKfIJWP1xTnxA7fJ17w   meat-and-bread
3  ch1ercqwoNLpQLxpTb90KQ  boston-tea-stop

输出为列表

print(out.to_numpy().tolist())

[['5T0vKfIJWP1xTnxA7fJ17w', 'meat-and-bread'],
 ['ch1ercqwoNLpQLxpTb90KQ', 'boston-tea-stop']]

所以我有两个问题：

相关问题更多 >

编程相关推荐

热门问题

热门文章