在python中只从列表中提取相关信息

info = [['Price: 5000', 'In warranty', 'Weight: 8 kg'], ['Refundable', 'Price: 2800', 'Weight: 5.5 kg', 'Extra battery power'], ['Price: 9000', 'Non-exchangeable', 'Weight: 8 kg', 'High-Quality']..]

3条回答

网友

1楼 · 编辑于 2024-10-01 22:42:34

可以使用in关键字查看一个字符串（或列表）是否包含另一个字符串。您可以使用any关键字一次检查多个项目。在

info = [
    ['Price: 5000', 'In warranty', 'Weight: 8 kg'], 
    ['Refundable', 'Price: 2800', 'Weight: 5.5 kg', 'Extra battery power'], 
    ['Price: 9000', 'Non-exchangeable', 'Weight: 8 kg', 'High-Quality']
]

keywords = ['Price', 'Weight']

for item in info:
    print([x for x in item if any(kw in x for kw in keywords)])

输出：

^{pr2}$

此数据的更干净的格式可能是使用字典。在

info = [
    {
        'Price': 5000, 
        'Weight': '8 kg',
        'Attributes': ['In warranty'] 
    },
    {
        'Price': 2800, 
        'Weight': '5.5 kg',
        'Attributes': ['Refundable', 'Extra battery power'] 
    },
    {
        'Price': 9000, 
        'Weight': '8 kg',
        'Attributes': ['Non-exchangeable', 'High-Quality'] 
    }
]

keywords = ['Price', 'Weight']

info_filterd = [{k: v for k, v in item.items() if k in keywords} for item in info]
print(info_filterd)

输出：

[
    {
        "Price": 5000,
        "Weight": "8 kg"
    },
    {
        "Price": 2800,
        "Weight": "5.5 kg"
    },
    {
        "Price": 9000,
        "Weight": "8 kg"
    }
]

网友

2楼 · 编辑于 2024-10-01 22:42:34

使用函数编程的一个线性函数（map、filter和any）

info = [
    ['Price: 5000', 'In warranty', 'Weight: 8 kg'], 
    ['Refundable', 'Price: 2800', 'Weight: 5.5 kg', 'Extra battery power'], 
    ['Price: 9000', 'Non-exchangeable', 'Weight: 8 kg', 'High-Quality']
]

keywords = ['Price', 'Weight']

l = map(lambda sub_list: list(filter(lambda element: any(map(lambda keyword: keyword in element, keywords)), sub_list)), info)

print(list(l))

输出：

^{pr2}$

一层衬里各部分的说明

map(lambda sub_list: list(filter(lambda element: any(map(lambda keyword: keyword in element, keywords)), sub_list)), info)

迭代应用lambda函数的所有信息元素

filter(lambda element: any(map(lambda keyword: keyword in element, keywords)), sub_list)

在sub_list的所有值中，获取至少包含一个关键字的值（filter）

any(map(lambda keyword: keyword in element, keywords))

如果关键字中的任何关键字出现在元素中，则返回true或false

注意：list（）用于展开生成器

网友

3楼 · 编辑于 2024-10-01 22:42:34

使用difflib.SequenceMatcher（doc）的一个可能的解决方案。但是，可能需要对比率进行一些调整：

from difflib import SequenceMatcher

info = [['Price: 5000', 'In warranty', 'Weight: 8 kg'],
        ['Refundable', 'Price: 2800', 'Weight: 5.5 kg', 'Extra battery power'],
        ['Price: 9000', 'Non-exchangeable', 'Weight: 8 kg', 'High-Quality']]

keywords = ['Price', 'Weight']

out = []
for i in info:
    out.append([])
    for item in i:
        if any(SequenceMatcher(None, item.lower(), kw.lower()).ratio() > 0.5 for kw in keywords):
            out[-1].append(item)

from pprint import pprint
pprint(out)

印刷品：

^{pr2}$

一层衬里各部分的说明

相关问题更多 >

编程相关推荐

热门问题

热门文章