如何在python中按多个列分组问题的回答

如何在python中按多个列分组

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

资料 <pre><code>df = pd.DataFrame({'name ': {0: 'Settings', 1: 'Calendar', 2: 'Calendar', 3: 'Messages', 4: 'Instagram', 5: 'Drive'}, ' duration': {0: 3.4360000000000004, 1: 2.167, 2: 5.705, 3: 7.907, 4: 50.285, 5: 30.26}, ' UserId': {0: 1, 1: 1, 2: 1, 3: 1, 4: 9, 5: 9}, ' category': {0: ' System tool', 1: ' Calendar', 2: ' Calendar', 3: ' Phone_and_SMS', 4: ' Social', 5: ' Productivity'}, ' part_of_day': {0: ' evening', 1: ' night ', 2: ' night ', 3: 'night ', 4: ' night ', 5: ' night '}, ' Date': {0: ' 2020-09-10', 1: ' 2020-09-11', 2: ' 2020-09-11', 3: ' 2020-09-11', 4: ' 2020-09-28', 5: ' 2020-09-28'}}) df.columns = df.columns.str.strip() </code></pre> df: <pre><code> name duration UserId category part_of_day Date 0 Settings 3.436 1 System tool evening 2020-09-10 1 Calendar 2.167 1 Calendar night 2020-09-11 2 Calendar 5.705 1 Calendar night 2020-09-11 3 Messages 7.907 1 Phone_and_SMS night 2020-09-11 4 Instagram 50.285 9 Social night 2020-09-28 5 Drive 30.260 9 Productivity night 2020-09-28 </code></pre> <pre><code>grouping = df.groupby(["UserId", "Date","category"]).agg({"category": 'count', 'duration':max}).rename(columns={"duration" : "max-duration"}) </code></pre> 分组： <pre><code> category max-duration UserId Date category 1 2020-09-10 System tool 1 3.436 2020-09-11 Calendar 2 5.705 Phone_and_SMS 1 7.907 9 2020-09-28 Productivity 1 30.260 Social 1 50.285 </code></pre>

如何在python中按多个列分组

1 个回答

相关Python问题