如何在字符串中搜索关键字、提取该字符串并将其放置在新列中？

def get_category(product): if df['Product Name'].str.contains('Pegasus') or df['Product Name'].str.contains('Metcon'): return product df['Category'] = df['Product Name'].apply(lambda x: get_category(x)) ValueError: The truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all().

3条回答

网友

1楼 · 编辑于 2024-06-26 02:16:45

您的代码存在以下问题：

您正在传递产品，但在检查时使用的是df["Product Name"]，它返回整个系列
此外，返回值是product。但根据预期的答案，可能是Pegasus或Metcon

我想你想要这样的东西

def get_category(product):
    if "Pegasus" in product:
        return "Pegasus" 
    elif "Metcon" in product:
        return "Metcon"

网友

2楼 · 编辑于 2024-06-26 02:16:45

这个解决方案怎么样，当你有一个新的类别时，你所要做的就是将新的类别添加到cats数组中

import pandas as pd
import numpy as np

df = pd.DataFrame({'Product Name': ['Nike Zoom Pegasus', 'All New Nike Zoom Pegasus 4', 'Metcon 3', 'Nike Metcon 5']})
cats = ["Pegasus","Metcon"]
df["Category"] = df["Product Name"].apply(lambda x: np.intersect1d(x.split(" "),cats)[0])


output
                  Product Name Category
0            Nike Zoom Pegasus  Pegasus
1  All New Nike Zoom Pegasus 4  Pegasus
2                     Metcon 3   Metcon
3                Nike Metcon 5   Metcon

网友

3楼 · 编辑于 2024-06-26 02:16:45

那么：

import pandas as pd

df = {'Product Name': ['Nike Zoom Pegasus', 'All New Nike Zoom Pegasus 4', 'Metcon 3', 'Nike Metcon 5']}

c = set(['Metcon', 'Pegasus'])
categories = [c.intersection(pn.split(' ')) for pn in df['Product Name']]
df['Categories'] = categories

print(df)

>> {'Product Name': ['Nike Zoom Pegasus', 'All New Nike Zoom Pegasus 4', 'Metcon 3', 'Nike Metcon 5'], 'Categories': [{'Pegasus'}, {'Pegasus'}, {'Metcon'}, {'Metcon'}]}

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何在字符串中搜索关键字、提取该字符串并将其放置在新列中？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >