如何统计Pandas分类时间序列数据

timestamp action nick message 2005-11-04 01:44:33 False hack-cclub lex, hey! 2005-11-04 01:44:43 False hack-cclub lol, yea thats broke 2005-11-04 01:44:56 False lex Slashdot - Updated 2005-11-04 00:23:00 | Micro... 2005-11-04 01:44:56 False hack-cclub lex slashdot 2005-11-04 01:45:12 False lex port 666 is doom - doom Id Software (or mdqs o.. 2005-11-04 01:45:12 False hack-cclub lex, port 666 2005-11-04 01:45:21 False hitokiri lex, port 23485 2005-11-04 01:45:45 False hitokiri lex, port 1024 2005-11-04 01:45:46 True hack-cclub slaps lex around with a wet fish

1条回答

网友

1楼 · 发布于 2024-06-03 04:04:40

你需要^{}和{a2}：

print df[['timestamp', 'nick']]
             timestamp        nick
0  2005-11-04 01:44:33  hack-cclub
1  2005-11-04 01:44:43  hack-cclub
2  2005-11-04 01:44:56         lex
3  2005-11-04 01:44:56  hack-cclub
4  2005-11-04 01:45:12         lex
5  2005-11-04 01:45:12  hack-cclub
6  2005-11-04 01:45:21    hitokiri
7  2005-11-04 01:45:45    hitokiri
8  2005-11-04 01:45:46  hack-cclub

df = pd.crosstab(df.timestamp, df.nick)
print df
nick                 hack-cclub  hitokiri  lex
timestamp                                     
2005-11-04 01:44:33           1         0    0
2005-11-04 01:44:43           1         0    0
2005-11-04 01:44:56           1         0    1
2005-11-04 01:45:12           1         0    1
2005-11-04 01:45:21           0         1    0
2005-11-04 01:45:45           0         1    0
2005-11-04 01:45:46           1         0    0

df = df.cumsum()
print df
nick                 hack-cclub  hitokiri  lex
timestamp                                     
2005-11-04 01:44:33           1         0    0
2005-11-04 01:44:43           2         0    0
2005-11-04 01:44:56           3         0    1
2005-11-04 01:45:12           4         0    2
2005-11-04 01:45:21           4         1    2
2005-11-04 01:45:45           4         2    2
2005-11-04 01:45:46           5         2    2

相关问题更多 >

编程相关推荐

热门问题

热门文章