x、y坐标系numpy数组中最近点索引的求法问题的回答

x、y坐标系numpy数组中最近点索引的求法

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

我有两个2d numpy数组：x_数组包含x方向的位置信息，y_数组包含y方向的位置。 然后我有一长串的x，y点。 对于列表中的每个点，我需要找到最接近该点的位置（在数组中指定）的数组索引。 基于这个问题，我天真地生成了一些有效的代码： <a href="https://stackoverflow.com/questions/2566412/find-nearest-value-in-numpy-array">Find nearest value in numpy array</a> 即 <pre><code>import time import numpy def find_index_of_nearest_xy(y_array, x_array, y_point, x_point): distance = (y_array-y_point)**2 + (x_array-x_point)**2 idy,idx = numpy.where(distance==distance.min()) return idy[0],idx[0] def do_all(y_array, x_array, points): store = [] for i in xrange(points.shape[1]): store.<a href="https://www.cnpython.com/list/append" class="inner-link">append</a>(find_index_of_nearest_xy(y_array,x_array,points[0,i],points[1,i])) return store # Create some dummy data y_array = numpy.random.random(10000).reshape(100,100) x_array = numpy.random.random(10000).reshape(100,100) points = numpy.random.random(10000).reshape(2,5000) # Time how long it takes to run start = time.time() results = do_all(y_array, x_array, points) end = time.time() print 'Completed in: ',end-start </code></pre> 我是在一个大数据集上做这个的，我真的想加快一点。有人能优化这个吗？ 谢谢。 <hr/> 更新：解决方案遵循@silvado和@justin的建议（如下） <pre><code># Shoe-horn existing data for entry into KDTree routines combined_x_y_arrays = numpy.dstack([y_array.ravel(),x_array.ravel()])[0] points_list = list(points.transpose()) def do_kdtree(combined_x_y_arrays,points): mytree = scipy.spatial.cKDTree(combined_x_y_arrays) dist, indexes = mytree.query(points) return indexes start = time.time() results2 = do_kdtree(combined_x_y_arrays,points_list) end = time.time() print 'Completed in: ',end-start </code></pre> 上面的代码将我的代码（在100x100矩阵中搜索5000个点）提高了100倍。有趣的是，使用scipy.spatial.KDTree（而不是scipy.spatial.cKDTree）提供了与我的原始解决方案相当的时间，因此使用cKDTree版本绝对值得。。。

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

x、y坐标系numpy数组中最近点索引的求法

1 个回答

相关Python问题