Pandas在value_counts（）表格中减少了分类变量的数量

1条回答

网友

1楼 · 发布于 2024-10-02 14:28:08

我认为您可以将^{}与^{}一起使用，其中条件与^{}一起使用：

df = pd.DataFrame({'Color':'Red Red Blue Red Violet Blue'.split(), 
                   'Value':[11,150,50,30,10,40]})
print (df)
    Color  Value
0     Red     11
1     Red    150
2    Blue     50
3     Red     30
4  Violet     10
5    Blue     40

a = df.Color.value_counts()
print (a)
Red       3
Blue      2
Violet    1
Name: Color, dtype: int64

#get top 2 values of index
vals = a[:2].index
print (vals)
Index(['Red', 'Blue'], dtype='object')

^{pr2}$

或者，如果需要替换所有非top值，请使用^{}：

df['new1'] = df.Color.where(df.Color.isin(vals), 'other')
print (df)
    Color  Value   new1
0     Red     11    Red
1     Red    150    Red
2    Blue     50   Blue
3     Red     30    Red
4  Violet     10  other
5    Blue     40   Blue

编程相关推荐

c#什么时候使用公共字段才有意义？
JavaRMI何时创建存根、启动注册表并指定代码库？
java强制子级使用自己定义的枚举
java安卓跨越Html。fromHtml（stringWithCDATA）仍然将标记显示为文本
java如何通过按键和释放使循环开始和结束？
java我可以使用什么工具从多个图像创建单个PNG？
java Sonarqube给了我删除代码的问题，无法过滤问题
java Firebase Firestore：如何在Android上将文档对象转换为POJO
JavaSwing:JTextArea列问题
java将数组对象及其变量列表到Main方法

相关问题更多 >

编程相关推荐

热门问题

热门文章

Pandas在value_counts（）表格中减少了分类变量的数量

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >