Pandas放假前几天问题的回答

Pandas放假前几天

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

通过使用数据帧的索引，可以避免<code>groupby()</code>。你知道吗 <强>1。使用范围索引 假设以下数据帧： <pre><code>import pandas as pd df = pd.DataFrame({ 'SalesDate': ['2016-12-20', '2016-12-21', '2016-12-22', '2016-12-23', '2016-12-24', '2016-12-25', '2016-12-26', '2016-12-27'], 'holiday': [0, 0, 0, 0, 0, 1, 0, 0] }) df SalesDate holiday 0 2016-12-20 0 1 2016-12-21 0 2 2016-12-22 0 3 2016-12-23 0 4 2016-12-24 0 5 2016-12-25 1 6 2016-12-26 0 7 2016-12-27 0 </code></pre> 单线解决方案： 如果数据帧使用标准RangeIndex，则可以使用<code>df.index.where().bfill()</code>和<code>df.index</code>进行算术运算： <pre><code>df['next_holiday'] = pd.Series(df.index.where(df.holiday == 1), dtype='Int32').fillna(method='bfill') - df.index # Note: dtype'Int32' nullable int is new in pandas 0.24 </code></pre> 结果： <pre><code>df SalesDate holiday next_holiday 0 2016-12-20 0 5 1 2016-12-21 0 4 2 2016-12-22 0 3 3 2016-12-23 0 2 4 2016-12-24 0 1 5 2016-12-25 1 0 6 2016-12-26 0 NaN 7 2016-12-27 0 NaN </code></pre> <强>2。使用DateTimeIndex 如果<code>SalesDate</code>是索引列（<code>datetime64</code>类型），则解决方案类似： <pre><code>df = df.set_index(pd.to_datetime(df.SalesDate)).drop(columns=['SalesDate']) df holiday SalesDate 2016-12-20 0 2016-12-21 0 2016-12-22 0 2016-12-23 0 2016-12-24 0 2016-12-25 1 2016-12-26 0 2016-12-27 0 </code></pre> 用日期算法求解： <pre><code>df['next_holiday'] = ((pd.Series(df.index.where(df.holiday == 1)).fillna(method='bfill') - df.index) / np.timedelta64(1, 'D')) df['next_holiday'] = df['next_holiday'].astype('Int32') # pandas >= 0.24 for the nullable integer cast </code></pre> 结果： <pre><code>df holiday next_holiday SalesDate 2016-12-20 0 5 2016-12-21 0 4 2016-12-22 0 3 2016-12-23 0 2 2016-12-24 0 1 2016-12-25 1 0 2016-12-26 0 NaN 2016-12-27 0 NaN </code></pre>

Pandas放假前几天

1 个回答

相关Python问题