Pandas grouby和transform（'count'）在较小的数据上提供放置错误

newtest=pd.DataFrame([['010010201001000','001','0220','AL','0'],['010010201001001','001','0220','AL','0'],['010010201001002','001','0220','AL','0'],['010010201001003','001','0160','AL','0'],['010010201001004','001','0160','AL','0']],columns=['BlockID','CountyFP','District','state_x','HD']) newtest['blocks']=newtest.groupby(['CountyFP','District','state_x']).transform('count')

1条回答

网友

1楼 · 发布于 2024-10-02 18:23:38

您没有选择要对其执行聚合的任何列，因此它对其余的列（共2列）执行聚合，如果选择其中一列，则会获得所需的结果：

In [6]:
newtest['blocks'] = newtest.groupby(['CountyFP','District','state_x'])['BlockID'].transform('count')
newtest

Out[6]:
           BlockID CountyFP District state_x HD  blocks
0  010010201001000      001     0220      AL  0       3
1  010010201001001      001     0220      AL  0       3
2  010010201001002      001     0220      AL  0       3
3  010010201001003      001     0160      AL  0       2
4  010010201001004      001     0160      AL  0       2

尝试的输出：

^{pr2}$

您可以看到它生成了2列，因为这些列是剩余的列，因此出现了您观察到的错误消息。在

相关问题更多 >

编程相关推荐

热门问题

热门文章