在一个Numpy数组中一次执行多个比较（间隔）

import numpy as np from sklearn.ensemble import GradientBoostingRegressor n_samples = 2000 n_features = 10 rng = np.random.RandomState(0) X = rng.normal(size=(n_samples, n_features)) w = rng.normal(size=n_features) # simple linear function without noise y = np.dot(X, w) gbrt = GradientBoostingRegressor(loss='quantile', alpha=0.95) gbrt.fit(X, y) # Get upper interval upper_interval = gbrt.predict(X) # Get lower interval gbrt.set_params(alpha=0.05) gbrt.fit(X, y) lower_interval = gbrt.predict(X) intervals = np.concatenate((lower_interval[:, np.newaxis], upper_interval[:, np.newaxis]), axis=1) # This is 4 passes: perc_correct_intervals = ((y >= intervals[:, 0]) & (y <= intervals[:, 1])).sum() / y.shape[0]

1条回答

网友

1楼 · 发布于 2024-10-05 14:23:59

np.count_nonzero与.sum()相比节省了一些，如果您不需要将intervals矩阵用于其他用途，则可以节省更多

%%timeit
intervals = np.concatenate((lower_interval[:, np.newaxis], upper_interval[:, np.newaxis]), axis=1);
perc_correct_intervals = ((y >= intervals[:, 0]) & (y <= intervals[:, 1])).sum() / y.shape[0]

15.7 µs ± 78.8 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)


%%timeit
np.count_nonzero(np.less(lower_interval, y)*np.less(y, upper_interval))/y.size

3.93 µs ± 28 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

相关问题更多 >

编程相关推荐

热门问题

热门文章