Pandas：如何找到一个群体的百分比？

网友

1楼 · 编辑于 2024-09-22 10:23:00

您还可以计算和合并数据帧

import pandas as pd

data = {
    "Fund": ["1000", "1000", "2000", "2000", "3000", "3000", "4000", "4000"],
    "State": ["AL", "AL", "FL", "FL", "AL", "AL", "NC", "NC"],
    "Compensation": [2000, 2500, 1500, 1750, 4000, 3200, 1450, 3000],
}
# Create dataframe from dictionary provided
df = pd.DataFrame.from_dict(data)

# first group compensation by state and fund 
df_fund = df.groupby(["Fund", "State"]).Compensation.sum().reset_index()

# Calculate Total by state in new df
df_total = df_fund.groupby("State").Compensation.sum().reset_index()

# Merge dataframes with total column
merged = df_fund.merge(df_total, how="outer", left_on="State", right_on="State")

#Add percentage col to merged dataframe. 
merged["percentage"] = merged["Compensation_x"] / merged["Compensation_y"] * 100

网友

2楼 · 编辑于 2024-09-22 10:23:00

这里有一个解决方案。您可以首先执行groupby以获得最低级别的聚合，然后使用groupby转换将这些值除以状态总数

agg = df.groupby(['Fund','State'],as_index=False)['Compensation'].sum()
agg['percentage'] = (agg['Compensation'] / agg.groupby('State')['Compensation'].transform(sum)) * 100

agg.to_dict()
{'Fund': {0: '1000', 1: '2000', 2: '3000', 3: '4000'},
'State': {0: 'AL', 1: 'FL', 2: 'AL', 3: 'NC'},
 'Compensation': {0: 4500, 1: 3250, 2: 7200, 3: 4450},
 'percentage': {0: 38.46153846153847,
  1: 100.0,
  2: 61.53846153846154,
  3: 100.0}}

网友

3楼 · 编辑于 2024-09-22 10:23:00

这应该可以完成以下工作：

df['total_state_compensataion'] = df.groupby('State')['Compensation'].transform(sum)
df['total_state_fund_compensataion'] = df.groupby(['State','Fund'])['Compensation'].transform(sum)
df['ratio']=df['total_state_fund_compensataion'].div(df['total_state_compensataion'])
>>>df.groupby(['State','Fund'])['ratio'].mean().to_dict()

out[1] {('AL', '1000'): 0.38461538461538464,
 ('AL', '3000'): 0.6153846153846154,
 ('FL', '2000'): 1.0,
 ('NC', '4000'): 1.0}

相关问题更多 >

编程相关推荐

热门问题

热门文章

Pandas：如何找到一个群体的百分比？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >