我有这个数据帧:
df = pd.DataFrame(columns=["App","Feature1", "Feature2","Feature3",
"Feature4","Feature5",
"Feature6","Feature7","Feature8"],
data=[["SHA",0,0,1,1,1,0,1,0],
["LHA",1,0,1,1,0,1,1,0],
["DRA",0,0,0,0,0,0,1,0],
["FRA",1,0,1,1,1,0,1,1],
["BRU",0,0,1,0,1,0,0,0],
["PAR",0,1,1,1,1,0,1,0],
["AER",0,0,1,1,0,1,1,0],
["SHE",0,0,0,1,0,0,1,0]])
更新:(对不起,我没有正确表述预期结果)
我想计算每个功能的值1
出现的时间:
Features Count
Feature1 6
Feature2 7
...
我试过这个:
df.groupBy("App").count()
但我没有得到预期的输出。你知道吗
另一种使用熔化的方法:
首先获取长格式数据:
然后按
Features
分组并计算1:用途:
与
1
类似:加上^{} :
或与列表理解:
相关问题 更多 >
编程相关推荐