大Pandas从datetime索引从长到宽

val week 2015-01-02 16729 1 2015-01-09 16225 2 2015-01-16 15250 3 2015-01-23 15690 4 2015-01-30 16025 5 ... ... ... 2020-03-20 16417 12 2020-03-27 15481 13 2020-04-03 14216 14 2020-04-10 13113 15 2020-04-17 12825 16

2015 ... 2020 01-1 16729 ... ... 01-2 16225 ... ... 01-3 15250 ... ... 01-4 15690 ... ... 01-5 16025 ... ... ... ... ... ... 03-12 ... ... 16417 03-13 ... ... 15481 04-14 ... ... 14216 04-15 ... ... 13113 04-16 ... ... 12825

2015 ... 2020 01-02 16729 ... ... 01-09 16225 ... ... 01-16 15250 ... ... 01-23 15690 ... ... 01-30 16025 ... ... ... ... ... ... 03-20 ... ... 16417 03-27 ... ... 15481 04-03 ... ... 14216 04-10 ... ... 13113 04-17 ... ... 12825

1条回答

网友

1楼 · 发布于 2024-10-01 02:26:56

在所有的评论之后，似乎是时候编写一些代码了。有点不对劲，但也许这会对你有所帮助：

import numpy as np
import pandas as pd

# example df with some random values.
df = pd.DataFrame({'t': ['2015-01-02','2015-01-03','2015-01-16','2015-01-23','2015-01-30', '2020-01-01'],
                   'val': [16729, 16225, 15250, 15690, 16025, 999],
                   'week': [1, 2, 3, 4, 5, 1]})
df['t'] = pd.to_datetime(df['t'])

# pivot to get years as columns
df1 = pd.pivot_table(df, values='val', columns=df['t'].dt.year, index=df['t'])

# create a new column "date" for later on... cast to datetime object for now
df1['date'] = pd.to_datetime(df1.index.date)

# sum the values for every week and drop the original "t" (datetime) column
df2 = df1.groupby(df1.index.week).resample('W-Mon', on='date').sum().reset_index().sort_values(by='date').drop(columns=['t'])

# drop all rows that only hold zeros
df2 = df2.loc[~np.isclose(df2.loc[:, df2.columns != 'date'], 0)]

# finally, format the datetime column to string as desired
df2['month-week'] = df2['date'].dt.strftime('%m-%W')

相关问题更多 >

编程相关推荐

热门问题

热门文章