在将数据帧拆分为两个组时，是否有多个变量影响因素？Pandas

将死亡病例数据集分为两组

highpop_highdeath = df.iloc[(df'StatePopulation' > 4342705.0), (df'deaths_to_cases' > 0.012143070253953211).values] highpop_highdeath.name = 'States with a high population and high death rate' highpop_lowdeath = df.iloc[(df'StatePopulation'> 4342705.0), (df'deaths_to_cases' <= 0.012143070253953211).values] highpop_lowdeath.name = 'States with a high population and low death rate'

3条回答

网友

1楼 · 编辑于 2024-10-01 15:33:03

要在过滤器上组合多个因子，需要对每个条件使用布尔运算符&：

highpop_highdeath = df.loc[(df'StatePopulation' > 4342705.0) & (df'deaths_to_cases' > 0.012143070253953211), :]

网友

2楼 · 编辑于 2024-10-01 15:33:03

是的，你可以有两个变量。顺便问一下，你能分享一下错误信息吗？此外，请尝试以下方法：

highpop_highdeath = df.loc[(df['StatePopulation'] > 4342705.0) &  (df['deaths_to_cases'] > 0.012143070253953211)]
highpop_highdeath.name = 'States with a high population and high death rate'
highpop_lowdeath = df.loc[(df['StatePopulation']> 4342705.0) & (df['deaths_to_cases'] <= 0.012143070253953211)]
highpop_lowdeath.name = 'States with a high population and low death rate'

网友

3楼 · 编辑于 2024-10-01 15:33:03

您希望合并这两个布尔向量。通过这种方式，对于数据帧中的每个位置，pandas将计算这两个语句，并且只有当这两个语句都为真时，才保留数据

highpop_highdeath = df.loc[(df'StatePopulation' > 4342705.0) & (df'deaths_to_cases' > 0.012143070253953211)]

ighpop_lowdeath = df.loc[(df'StatePopulation'> 4342705.0) & (df'deaths_to_cases' <= 0.012143070253953211)]

更简洁的是：

highpop_highdeath_names = df.loc[(df'StatePopulation' > 4342705.0) & (df'deaths_to_cases' > 0.012143070253953211),'name']

将死亡病例数据集分为两组

相关问题更多 >

编程相关推荐

热门问题

热门文章