如何分组并计算该组的总数

# Import pandas library import pandas as pd import numpy as np from sklearn.linear_model import LogisticRegression # initialize list of lists data = [['tom', 10,1], ['nick', 15,0], ['tom', 14,1], ['jason', 15,0], ['nick', 18,1], ['jason', 15,0], ['jason', 17,1] , ['tom', 14,0], ['nick',16 ,1], ['tom', 22,1]] # Create the pandas DataFrame df = pd.DataFrame(data, columns = ['Name', 'Attempts','Target']) # print dataframe. df Name Attempts Target 0 tom 10 1 1 nick 15 0 2 tom 14 1 3 jason 15 0 4 nick 18 1 5 jason 15 0 6 jason 17 1 7 tom 14 0 8 nick 16 1 9 tom 22 1

Name Attempts Target totalentries 0 tom 10 1 4 1 nick 15 0 3 2 tom 14 1 4 3 jason 15 0 3 4 nick 18 1 3 5 jason 15 0 3 6 jason 17 1 3 7 tom 14 0 4 8 nick 16 1 3 9 tom 22 1 4

2条回答

网友

1楼 · 编辑于 2024-10-02 02:32:38

你应该试试这个：

df["totalentries"] = [df.groupby("Name")["Name"].count()[i] for i in df["Name"].values]

这将为您提供所需的输出

网友

2楼 · 编辑于 2024-10-02 02:32:38

将^{}与groupby之后的指定列一起使用聚合函数：

df['totalentries'] = df.groupby('Name')['Target'].transform('nunique')

如果需要计算值：

df['totalentries'] = df.groupby('Name')['Target'].transform('size')

相关问题更多 >

编程相关推荐

热门问题

热门文章