numpy加法过程中隐式发生了什么？

def sqdist(X: np.ndarray) -> np.ndarray: # Organize input and output N, D = X.shape X = X.astype(np.float32) Y = np.zeros((N, N)).astype(np.float32) # Prepare memory pointers dataIn = X.ctypes.data_as(cdll.POINTER(cdll.c_float)) dataOut = Y.ctypes.data_as(cdll.POINTER(cdll.c_float)) # Call the sqdist dll cdll.load(_get_build_default()) cdll.computeSquaredEuclideanDistances(dataIn, N, D, dataOut) cdll.unload() # Return as numpy array return Y

scipydist = scipy.spatial.distance.cdist(a, a, metric='sqeuclidean') cudadist1 = cuda.sqdist(a) cudadist2 = cuda.sqdist(b) plt.figure(figsize=(20, 5)) plt.subplot(131) plt.imshow(scipydist, vmax=3000) plt.colorbar() plt.title("scipydist") plt.subplot(132) plt.imshow(cudadist1, vmax=3000) plt.colorbar() plt.title("cudadist1") plt.subplot(133) plt.imshow(cudadist2, vmax=3000) plt.colorbar() plt.title("cudadist2") plt.show()

1条回答

网友

1楼 · 发布于 2024-10-01 11:28:26

好吧。这似乎是由于某种内存布局。使用我的包装中的np.astype，它默认为order='K'：

K means as close to the order the array elements appear in memory as possible

而CUDA应用程序希望数据按顺序C排列。将包装器更新为以下内容修复了此问题：

X = X.astype(np.float32, order='C')
Y = np.zeros((N, N)).astype(np.float32, order='C')

因此，我猜numpy添加隐式地将底层数据重新排序为适合它的数据

相关问题更多 >

编程相关推荐

热门问题

热门文章