使用pandas选择多列和多列中的fillna（）的另一种方法

da1 = pd.read_csv('terror.csv', sep = ',', header=0 , encoding='latin' , na_values=['Missing', ' ']) da1.head() #Handling missing values da1['attacktype3'] = da1['attacktype3'].fillna(0) da1['attacktype2'] = da1['attacktype2'].fillna(0) da1['attacktype1'] = da1['attacktype1'].fillna(0) da1['total_attacks'] = da1['attacktype3'] + da1['attacktype2'] + da1['attacktype1'] #country_txt is a column which consists of different countries.Want to find "Total_atacks" only for India. Therefore, the condition applied is country_txt=='India'. a1 = da1.query("country_txt=='India'").agg({'total_attacks':np.sum}) print(a1)

da1 = pd.read_csv('terror.csv', sep = ',', header=0 , encoding='latin' , na_values=['Missing', ' ']) da1.head() #Handling missing values check1=Df.country_txt=="India" store=Df[["attacktype1","attacktype2","attacktype3"]].apply(lambda x:x.fillna(0)) Total_attack=Df.loc[check1,store].sum(axis=1) print(Total_attack)

I want to apply fillna(0) to multiple columns in a single line and also total those columns in an alternate and effective way. The error that I get when I use my second way is: ValueError: Cannot index with multidimensional key

1条回答

网友

1楼 · 发布于 2024-09-29 23:25:33

首先用^{}按^{}筛选，然后用^{}替换缺少的值：

check1 = Df.country_txt == "India"
cols = ["attacktype1","attacktype2","attacktype3"]

Df['Total_attack'] = Df.loc[check1, cols].fillna(0).sum(axis=1)

对于标量，一个数字输出addsum：

Total_attack = Df['Total_attack'].sum()
print (Total_attack)
35065.0

相关问题更多 >

编程相关推荐

热门问题

热门文章