使用pandas填补间隙，而不是结束处的NaN值

def fillGaps(houseDF): """Fills up holes in the housing data""" def fillColumns(column): filled_col = column lastValue = None # Keeps track of if we are dealing with a gap in numbers gap = False i = 0 for currentValue in filled_col: # Loops over all the nans before the numbers begin if not isANumber(currentValue) and lastValue is None: pass # Keeps track of the last number we encountered before a gap elif isANumber(currentValue) and (gap is False): lastIndex = i lastValue = currentValue # Notes when we encounter a gap in numbers elif not isANumber(currentValue): gap = True # Fills in the gap elif isANumber(currentValue): gapIndicies = range(lastIndex + 1, i) for j in gapIndicies: filled_col[j] = lastValue gap = False i += 1 return filled_col filled_df = houseDF.apply(fillColumns, axis=0) return filled_df

3条回答

网友

1楼 · 编辑于 2024-09-27 01:29:48

另一种解决多列数据帧的方法

df.fillna(method='ffill') + (df.fillna(method='bfill') * 0)

它是如何工作的？在

第一个fillna执行值的前向填充。这几乎是我们想要的，除了在每个系列的末尾留下填充值的痕迹。在

第二个fillna对乘以0的值进行反向填充。结果是我们不需要的尾随值将为NaN，其他值都将为0。在

最后，我们利用x+0=x和x+NaN=NaN这一事实将两者相加。在

网友

2楼 · 编辑于 2024-09-27 01:29:48

您可以在本系列的某些部分使用fillna。根据您的描述，fillna应该只填充第一个non-NaN之后和最后一个non-NaN之前的NaN：

import numpy as np
import pandas as pd


def fill_column(house):
    house = house.copy()
    non_nans = house[~house.apply(np.isnan)]
    start, end = non_nans.index[0], non_nans.index[-1]
    house.ix[start:end] = house.ix[start:end].fillna(method='ffill')
    return house


house1 = pd.Series([np.nan, np.nan, np.nan, 200000, 200000, np.nan, np.nan, 200000, 190000, np.nan, np.nan, np.nan])
print fill_column(house1)

输出：

^{pr2}$

注意，这假设序列至少包含两个非nan，对应于第一天和最后一天的价格。在

网友

3楼 · 编辑于 2024-09-27 01:29:48

我在一年后找到了这个答案，但是需要它来处理一个包含多个列的数据帧，所以我想把我的解决方案留在这里，以防其他人也需要这个答案。我的功能只是YS-L的修改版

def fillna_downbet(df):
    df = df.copy()
    for col in df:
        non_nans = df[col][~df[col].apply(np.isnan)]
        start, end = non_nans.index[0], non_nans.index[-1]
        df[col].loc[start:end] = df[col].loc[start:end].fillna(method='ffill')
    return df

谢谢！在

相关问题更多 >

编程相关推荐

热门问题

热门文章