如何从我创建的函数创建循环和新数据集？

neighborhood type_property type_negotiation price Smallville house rent 2000 Oakville apartment for sale 100000 King Bay house for sale 250000 ...

def foo(string): a = df[(df.type_negotiation == 'forsale')&(df.type_property == 'house')&(df.neighborhood == string)] b = pd.DataFrame([[a.price.quantile(0.1), a.price.quantile(0.9), len(a.index)]], columns=('tenthpercentile', 'ninetiethpercentile', 'Quantity')) return b print(foo('KingBay')) tenthpercentile ninetiethpercentile Quantity 0 250000.0 250000.0 1

tenthpercentile ninetiethpercentile Quantity King Bay 250000.0 250000.0 1 Smallville 99000.0 120000.0 8 Oakville 45000.0 160000.0 6

1条回答

网友

1楼 · 发布于 2024-09-30 02:20:56

通常对于数据帧，如果可以，最好避免显式循环，并使用pandas提供的优化方法。在您的例子中，您可以通过使用groupby和^{}，将所需的百分位数传递给参数percentiles来消除循环。然后，只需选择所需的列并适当地重命名它们：

new_df = (df.groupby('neighborhood')
          .describe(percentiles=[0.1,0.9])
          ['price'][['10%','90%','count']]
          .rename(columns={'count':'Quantity',
                           '10%':'tenthpercentile',
                           '90%':'ninetiethpercentile'}))

在您的案例中（因为每个社区只有一个例子）：

>>> new_df
              tenthpercentile  ninetiethpercentile  Quantity
neighborhood                                                
King Bay             250000.0             250000.0       1.0
Oakville             100000.0             100000.0       1.0
Smallville             2000.0               2000.0       1.0

[编辑]：我刚刚看到在您的函数中，您只查看(df.type_negotiation == 'for sale') & (df.type_property == 'house')。为此，只需添加一个loc即可根据以下条件过滤数据帧：

new_df = (df.loc[(df.type_negotiation == 'for sale')
                 & (df.type_property == 'house')]
          .groupby('neighborhood')
              .describe(percentiles=[0.1,0.9])
              ['price'][['10%','90%','count']]
              .rename(columns={'count':'Quantity',
                               '10%':'tenthpercentile',
                               '90%':'ninetiethpercentile'}))

另外，如果您想使用函数和循环（我不建议这样做），您可以：

pd.concat([foo(i) for i in df.neighborhood.unique()])

相关问题更多 >

编程相关推荐

热门问题

热门文章