Pandas通过将dataframe列与其他多个列相匹配来生成一个列

Code DF Code1 Code2 Code3 Code4 Code5 Eur xxx xxx xxx xxx xxx xxx xxx ESP xxx ASI xxx xxx xxx xxx xxx BRA xxx xxx xxx xxx AUS xxx xxx xxx xxx xxx NOR xxx xxx xxx xxx xxx PRT xxx xxx xxx xxx xxx SGP Country1 DF Country-Code Region Eur Europe ASI Asia BRA America AUS Asia NOR Europe Country2 DF Country Code Region ESP Europe PRT Europe SGP Asia ASI Asia

2条回答

网友

1楼 · 编辑于 2024-09-26 22:54:17

存储国家代码映射的更好方法是在字典中。我假设country_dict1，country_dict2分别是每个数据帧的code:region的映射：

def determine_region(row):
    for item in row[:-3:-1]:
        if item in country_dict1:
            return country_dict1.get(item)
    for item2 in row[-3::-1]:
        if item2 in country_dict2:
            return country_dict2.get(item2)
    return pd.np.nan

df['Region'] = df.apply(determine_region, axis=1)

网友

2楼 · 编辑于 2024-09-26 22:54:17

您可以使用列表理解来完成此操作：

def determine_region(df_row):
    # if else chain to make a decision for each row
    # or maybe you could use python builtin set to make it 
    # more semantic

# capture each item into a list with a comprehension
x = [ determine_region(x) for x in CodeDF ]
# append the data into a new column named region
CodeDF.loc[:,'Region'] = pd.Series(x)

其他资源

Appending Column to Pandas DF

List Comprehensions

Sets and Operations with Sets

相关问题更多 >

编程相关推荐

热门问题

热门文章