我尝试按多个列进行分组,并按计数对它们进行排序,然后获得每个组的最高记录
df.groupby("_c21","y2_co","y2_r","y2_z","y2_org").count()\
.show(n=10)
我尝试过按不为null的单个列进行分组
df.groupby("_c21").count()\
.show(n=10)
AttributeError: 'NoneType' object has no attribute 'groupby'
样本行
+--------------------+--------------------+--------------------+-----+----+-----+--------------------+
| _c17| _c21| m|y2_co|y2_r| y2_z| y2_org|
+--------------------+--------------------+--------------------+-----+----+-----+--------------------+
|proc=;app=;cl=442...|tHO$SZPbABVo3A1X8...|[proc -> , app ->...| BR| PB|58397|Voax Provedor de ...|
|proc=;app=;cl=444...|tHO$SZPbABVo3A1X8...|[proc -> , app ->...| BR| PB|58397|Voax Provedor de ...|
|proc=;app=;cl=145...|Zu6zZxiekXnHfpNER...|[proc -> , app ->...| MX| NLE|66490| Totalplay|
|proc=;app=;cl=145...|Zu6zZxiekXnHfpNER...|[proc -> , app ->...| MX| NLE|66490| Totalplay|
|proc=;app=;cl=147...|Zu6zZxiekXnHfpNER...|[proc -> , app ->...| MX| NLE|66490| Totalplay|
+--------------------+--------------------+--------------------+-----+----+-----+--------------------+
我在上一次发言中有一个
.show(n=5)
。我把.show(n=5)
注释掉了,它就行了相关问题 更多 >
编程相关推荐