如果一行不存在，则在Python中检查并相应地指定一个值

import pandas as pd import numpy as np df = pd.DataFrame([['Circle', 'Circle', 'Polygon', 'Polygon',"Trapezoid"], [0, 1, 0, 1,1], [28152, 9168, 24741, 11402,5000]], ['nom_1', 'target', 'id']).T

ColumnTarget = df[["id","nom_1","target"]] ColumnGrouped = ColumnTarget.groupby(["nom_1","target"]).count()["id"].reset_index() ColumnCalculation = ColumnGrouped.groupby("nom_1").apply(lambda row: (row[row.target ==1]["id"].iloc[0]) / (row[row.target ==0]["id"].iloc[0] + row[row.target ==1]["id"].iloc[0]))

ColumnTarget = df[["id","nom_1","target"]] ColumnGrouped = ColumnTarget.groupby(["nom_1","target"]).count()["id"].reset_index() ColumnCalculation = ColumnGrouped.groupby("nom_1").apply(lambda row: 0 if row[row.target ==1].all() is False else (1 if row[row.target ==0].all() is False else ((row[row.target ==1]["id"].iloc[0]) / (row[row.target ==0]["id"].iloc[0] + row[row.target ==1]["id"].iloc[0]))))

2条回答

网友

1楼 · 编辑于 2024-09-22 14:39:10

使用transform和div

df['id'].div(df.groupby('nom_1').id.transform('sum'), axis=0)

       nom_1 target     id     ratio
0     Circle      0  28152  0.754341
1     Circle      1   9168  0.245659
2    Polygon      0  24741  0.684531
3    Polygon      1  11402  0.315469
4  Trapezoid      1   5000         1

很明显，您可以编辑这个df来可视化那些带有target == 1的行

df[df.target == 1]

       nom_1 target     id     ratio
1     Circle      1   9168  0.245659
3    Polygon      1  11402  0.315469
4  Trapezoid      1   5000         1

网友

2楼 · 编辑于 2024-09-22 14:39:10

使用index对齐计算（我添加了一个shape missing Target==1）。这假设您在['nom_id', 'target']上没有任何重复的内容：

df = pd.DataFrame([['Circle', 'Circle', 'Polygon', 'Polygon',"Trapezoid", 'Octagon'], 
                   [0, 1, 0, 1, 1, 0], [28152, 9168, 24741, 11402,5000, 6000]], 
                   ['nom_1', 'target', 'id']).T 

df = df.set_index('nom_1')
u = df.loc[df.target.eq(1), 'id']
v = df.loc[df.target.eq(0), 'id']

                                    # - 0 When Target == 1 is missing
                                    # |
s = u.divide(u.add(v, fill_value=0)).fillna(0)
#nom_1
#Circle       0.245659
#Octagon      0.000000
#Polygon      0.315469
#Trapezoid    1.000000
#Name: id, dtype: float64

相关问题更多 >

编程相关推荐

热门问题

热门文章