我有一个时间序列数据,需要根据Stage列上次更新的日期添加一行。我有重复的数据:例如
Id Date Stage
1 20-12-2013 Basic
1 20-10-2015 Basic
1 05-12-2018 Advanced
2 20-05-2019 Basic
2 15-12-2019 Advanced
3 20-01-2020 Advanced
4 20-10-2020 Basic
4 20-12-2020 Advanced
预期结果:
Id Date Stage Stage Changed Since
1 20-12-2013 Basic NaN
1 20-10-2015 Basic NaN
1 05-12-2018 Advanced 05-12-2018 - 20-10-2015
2 20-05-2019 Basic NaN
2 15-12-2019 Advanced 15-12-2019 - 20-05-2019
3 20-01-2020 Advanced NaN
4 20-10-2020 Basic NaN
4 20-12-2020 Advanced 20-12-2020 - 20-10-2020
所以,基本上,当阶段在同一Id内更改时,我需要获得阶段更改后的天数。而舞台也发生了变化,因为专栏应该显示这一点
签出https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.shift.html
您可以为
stage
和date
创建两个下移列,并进行比较你得到了什么
相关问题 更多 >
编程相关推荐