在df2.col2的基础上，在df.col1中填写na。两个数据帧的大小不同

data1 = {'Prod_id':['PR1', 'PR2', 'PR3', 'PR4', 'PR2', 'PR3','PR1', 'PR4"],store=['store1','store2','store3','store6','store3','store8','store45','store23']'item_wt':[28,nan,29,42,nan,34,87,nan]} df1 = pd.DataFrame(data1) data2 = {'Item_name':['PR1', 'PR2', 'PR3', 'PR4'],'mean_wt':[18,12,22,9]} df2 = pd.DataFrame(data2) final df should be like: data1 = {'Prod_id':['PR1', 'PR2', 'PR3', 'PR4', 'PR2', 'PR3','PR1', 'PR4"],store=['store1','store2','store3','store6','store3','store8','store45','store23']'Item_wt':[28,12,29,42,12,34,87,9]} df1 = pd.DataFrame(data1)

1条回答

网友

1楼 · 发布于 2024-10-03 02:43:57

您可以使用^{}并设置由values创建的numpy数组，因为原始序列和新序列的索引不同：

df1['item_wt'] = (df1.set_index('Prod_id')['item_wt']
                     .fillna(df2.set_index('Item_name')['mean_wt']).values)
print (df1)
  Prod_id    store  item_wt
0     PR1   store1     28.0
1     PR2   store2     12.0
2     PR3   store3     29.0
3     PR4   store6     42.0
4     PR2   store3     12.0
5     PR3   store8     34.0
6     PR1  store45     87.0
7     PR4  store23      9.0

或者先使用^{}：

s = df2.set_index('Item_name')['mean_wt']
df1['item_wt'] = df1['item_wt'].fillna(df1['Prod_id'].map(s))
#alternative
#df1['item_wt'] = df1['item_wt'].combine_first(df1['Prod_id'].map(s))
print (df1)
  Prod_id    store  item_wt
0     PR1   store1     28.0
1     PR2   store2     12.0
2     PR3   store3     29.0
3     PR4   store6     42.0
4     PR2   store3     12.0
5     PR3   store8     34.0
6     PR1  store45     87.0
7     PR4  store23      9.0

相关问题更多 >

编程相关推荐

热门问题

热门文章