根据各种条件提取数据

Name Segment Axis 1 2 3 4 5 0 Amazon 1 slope NaN 100 120 127 140 1 Amazon 1 x 0.0 1.0 2.0 3.0 4.0 2 Amazon 1 y 0.0 0.4 0.8 1.2 1.6 3 Amazon 2 slope NaN 50 57 58 59 4 Amazon 2 x 0.0 2.0 4.0 6.0 8.0 5 Amazon 2 y 0.0 1.0 2.0 3.0 4.0

s=df.set_index(['Name' , 'Segment','Axis']).stack().unstack('Axis') s=s.dropna(subset=["slope"]).sort_values("slope").reset_index(level=2, drop=True) df3=pd.merge(s, df2, on=['Name', 'Segment'], how='left') df3[df3['slope']>df3['Optimal_Cost']].groupby(['Name', 'Segment']).first().reset_index()

1条回答

网友

1楼 · 发布于 2024-06-17 13:25:25

让我们继续使用@wwnde解决方案，并对其进行一些更改：

s=df.set_index(['Name','Segment','Axis']).stack().unstack(2)
s=s.sort_values("slope").reset_index(level=2, drop=True) 
#In above code we don't have to drop nan
out=pd.merge(s, df2, on=['Name',  'Segment'], how='left')
cond=out['slope'].gt(out['Optimal Cost']) | out['slope'].isna()
#make changes in condition to include nan's
out=out[cond].groupby(['Name','Segment'],as_index=False).first().drop('Optimal Cost',1)

out的输出：

    Name    Segment     slope   x       y
0   Amazon  1           120.0   2.0     0.8
1   Amazon  2           NaN     0.0     0.0

相关问题更多 >

编程相关推荐

热门问题

热门文章