迭代Groupby对象上的列

Q1 = boroughs.quantile(0.25) Q3 = boroughs.quantile(0.75) IIQ = Q3 - Q1 inf_lim = Q1 - 1.5*IIQ sup_lim = Q3 + 1.5*IIQ rent = pd.DataFrame() for borough in boroughs.groups.keys(): which_borough = df["Borough"] == borough not_outlier = ((df["Rent_Price"] >= inf_lim[borough]) & (df["Rent_Price"] <= sup_lim[borough])) selection = which_borough & not_outlier df_selection = df[selection] aluguel = pd.concat([rent, df_selection])

1条回答

网友

1楼 · 发布于 2024-09-26 18:00:16

试试这个：

for col in ['Rent_Price', 'borough', 'number_of_bedrooms', 'parking_spaces', 'floor_area']:
    ans=df[(df[col]<=(df[col].quantile(0.75)+(1.5*(df[col].quantile(0.75)-df[col].quantile(0.25))))) & (df[col]>=(df[col].quantile(0.25)-(1.5*(df[col].quantile(0.75)-df[col].quantile(0.25)))))][col].describe()
    print(f'{col} Summary')
    print(ans)

按行政区划分：

for b in set(df.borough):
    df1=df[df.borough==b]
    for col in ['rent price', 'borough', 'nº of bedrooms', 'parking spaces', 'floor area']:
        ans=df1[(df1[col]<=(df1[col].quantile(0.75)+(1.5*(df1[col].quantile(0.75)-df1[col].quantile(0.25))))) & (df1[col]>=(df1[col].quantile(0.25)-(1.5*(df1[col].quantile(0.75)-df1[col].quantile(0.25)))))][col].describe()
        print(f'{col} Summary')
        print(ans)

相关问题更多 >

编程相关推荐

热门问题

热门文章

迭代Groupby对象上的列

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >