循环数据帧中的If/else语句

depth density VSH 5517 2.126 0.8347083 5517.5 2.123 0.8310949 5518 2.124 0.8012414 5518.5 2.121 0.7838615 5519 2.116 0.7674243 5519.5 2.127 0.8405414

for index, row in data.iterrows(): if data.loc[index, 'VSH']<0.2: data.loc[index,'porosity']=(data['density']*2.3) elif data.loc[index, 'VSH'] > 0.7: data.loc[index,'porosity']=(data['density']*1.7)

1条回答

网友

1楼 · 发布于 2024-09-26 18:19:38

这里iterrows是一个不好的选择，因为速度慢并且存在矢量化的解决方案，请检查Does pandas iterrows have performance issues?

所以使用^{}：

m1 = data['VSH'] < 0.2
m2 = data['VSH'] > 0.7
s1 = data['density']*2.3
s2 = data['density']*1.7

data['porosity'] = np.select([m1, m2], [s1, s2])

print (data)
    depth  density       VSH  porosity
0  5517.0    2.126  0.834708    3.6142
1  5517.5    2.123  0.831095    3.6091
2  5518.0    2.124  0.801241    3.6108
3  5518.5    2.121  0.783861    3.6057
4  5519.0    2.116  0.767424    3.5972
5  5519.5    2.127  0.840541    3.6159

更好的定义是，在0.2 and 0.7之间发生什么-例如，默认参数中data['density']列的返回值：

data['porosity'] = np.select([m1, m2], [s1, s2], default=data['density'])

print (data)
    depth  density       VSH  porosity
0  5517.0    2.126  0.834708    3.6142
1  5517.5    2.123  0.831095    3.6091
2  5518.0    2.124  0.801241    3.6108
3  5518.5    2.121  0.783861    3.6057
4  5519.0    2.116  0.767424    3.5972
5  5519.5    2.127  0.840541    3.6159

相关问题更多 >

编程相关推荐

热门问题

热门文章