Pandas：添加索引为来自其他datafram的匹配行的列

-----|---------|----------|----- ... | Country | Business | ... -----|---------|----------|----- | A | 1 | -----|---------|----------|----- | A | 1 | -----|---------|----------|----- | A | 2 | -----|---------|----------|----- | A | 2 | -----|---------|----------|----- | B | 1 | -----|---------|----------|----- | B | 1 | -----|---------|----------|----- | B | 2 | -----|---------|----------|----- | C | 1 | -----|---------|----------|----- | C | 2 | -----|---------|----------|-----

2条回答

网友

1楼 · 编辑于 2024-10-04 11:25:14

你可以用纽比。在哪里函数来匹配数据帧

例如：

datadf = pd.DataFrame([['USA','Business1'],['AUS','Business2'],['UK','Business3'],['IND','Business4']],
                          columns=['country','business'])
configdf = pd.DataFrame([['AUS','Business2'],['IND','Business4'],['USA','Business1'],['UK','Business3']],
                          columns=['country','business'])

datadf['new_col'] = datadf.apply(lambda x: (np.where(x == configdf)[0][0]),axis=1)
print(datadf)

输出：

^{pr2}$

编辑1:

好吧，那样的话，你可以用

datadf['new_col'] = datadf.apply(lambda x: (np.where((x['country'] == configdf['country']) & (x['business'] == configdf['business']))[0][0]),axis=1)

基于示例数据帧datadf和configdf的输出：

  country business  new_col
0       A        1        0
1       A        1        0
2       A        2        1
3       A        2        1
4       B        1        2
5       B        1        2
6       B        2        3
7       C        1        4
8       C        2        5

网友

2楼 · 编辑于 2024-10-04 11:25:14

下面是一个使用pandas merge的解决方案。在

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.merge.html#pandas.DataFrame.merge

import pandas as pd

# make the two dataframes
data = pd.DataFrame({'Country':['A','A','A','A','B','B','B','C','C'],
                     'Business':[1,1,2,2,1,1,2,1,2]})

configdf = pd.DataFrame({'Country':['A','A','B','B','C','C'],
                         'Business':[1,2,1,2,1,2]})

# make a column with the index values
configdf.reset_index(inplace=True)

# merge the two dataframes based on the selected columns.
newdf = data.merge(configdf, on=['Country', 'Business'])

相关问题更多 >

编程相关推荐

热门问题

热门文章