Pandas如何对多指标进行条件选择

!curl -O http://pbpython.com/extras/sales-funnel.xlsx df = pd.read_excel('./sales-funnel.xlsx') df['Status'] = df['Status'].astype('category') df["Status"].cat.set_categories(["won","pending","presented","declined"],inplace=True) table = pd.pivot_table(df, index=['Manager', 'Status'], values=['Price', 'Quantity'], columns=['Product'], aggfunc={'Price':[np.sum, np.mean], 'Quantity':len}, fill_value=0 )

3条回答

网友

1楼 · 编辑于 2024-06-25 23:24:22

你的经典答案由@ScottBoston提供。在

除了@jezrael的IndexSlice方法之外，我还将添加这一点作为广度和视角您也可以使用^{}来获取横截面

table.xs(['Debra Henley', 'won'])

                Product    
Quantity  len   CPU                1
                Maintenance        0
                Monitor            0
                Software           0
Price     mean  CPU            65000
                Maintenance        0
                Monitor            0
                Software           0
          sum   CPU            65000
                Maintenance        0
                Monitor            0
                Software           0
Name: (Debra Henley, won), dtype: int64

网友

2楼 · 编辑于 2024-06-25 23:24:22

可以，您可以使用：

table.loc[[('Debra Henley', 'won')]]

要返回熊猫数据帧，或者可以使用：

^{pr2}$

还一个熊猫系列。在

您可以参考this文档。在

网友

3楼 · 编辑于 2024-06-25 23:24:22

对于更简单的选择（仅索引或仅列），使用^{}方法或按tuples选择。在

另一个更通用的解决方案是slicers：

idx = pd.IndexSlice
#output is df
print (table.loc[[idx['Debra Henley','won']]])
                    Quantity                               Price              \
                         len                                mean               
Product                  CPU Maintenance Monitor Software    CPU Maintenance   
Manager      Status                                                            
Debra Henley won           1           0       0        0  65000           0   


                                        sum                               
Product             Monitor Software    CPU Maintenance Monitor Software  
Manager      Status                                                       
Debra Henley won          0        0  65000           0       0        0

^{pr2}$

但是，对于更复杂的选择，如果需要将筛选索引和列放在一起，一个xs不起作用：

idx = pd.IndexSlice
#select all rows where first level is Debra Henley in index and 
#in columns second level is len and sum
print (table.loc[idx['Debra Henley',:], idx[:, ['len', 'sum'], :]])
                       Quantity                               Price  \
                            len                                 sum   
Product                     CPU Maintenance Monitor Software    CPU   
Manager      Status                                                   
Debra Henley won              1           0       0        0  65000   
             pending          1           2       0        0  40000   
             presented        1           0       0        2  30000   
             declined         2           0       0        0  70000   



Product                Maintenance Monitor Software  
Manager      Status                                  
Debra Henley won                 0       0        0  
             pending         10000       0        0  
             presented           0       0    20000  
             declined            0       0        0

相关问题更多 >

编程相关推荐

热门问题

热门文章