计算datafram中非数字列的每日发生率

relative_humidity condition fid 2017-08-02 10:00:00 0.49 Chance of a Thunderstorm 1 2017-08-02 11:00:00 0.50 Chance of a Thunderstorm 1 2017-08-02 12:00:00 0.54 Partly Cloudy 1 2017-08-02 13:00:00 0.58 Partly Cloudy 2 2017-08-02 14:00:00 0.68 Partly Cloudy 2

2条回答

网友

1楼 · 编辑于 2024-09-27 19:25:29

df.groupby(['fid',pd.Grouper(freq='D'),'condition']).size().groupby(level=[0,1]).head(1)

输出：

fid              condition               
1    2017-08-02  Chance of a Thunderstorm    2
2    2017-08-02  Partly Cloudy               2
dtype: int64

网友

2楼 · 编辑于 2024-09-27 19:25:29

您需要^{}和index[0]，因为数据是经过排序的，第一个值是top：

d = {'level_1':'date'}
df1 = df.groupby(['fid', pd.Grouper(freq='D')])['condition'] \
       .apply(lambda x: x.value_counts().index[0]).reset_index().rename(columns=d)
print (df1)
   fid       date                 condition
0    1 2017-08-02  Chance of a Thunderstorm
1    2 2017-08-02             Partly Cloudy

相关问题更多 >

编程相关推荐

热门问题

热门文章

计算datafram中非数字列的每日发生率

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >