numpy中无时间项外积的非平凡和

N = 7 M = 5 # Create the unit vectors, just so we have some examples, # this is not meant to be elegant phi = np.random.rand(N)*2*np.pi ctheta = np.random.rand(N)*2 - 1 stheta = np.sqrt(1-ctheta**2) nhat = np.array([stheta*np.cos(phi), stheta*np.sin(phi), ctheta]).T # Create the other vectors m = np.random.rand(M,3) # Calculate the quantity we desire, here using broadcasting. S = np.average(np.abs(np.sum(nhat*m[:,np.newaxis,:], axis=-1)), axis=0)

3条回答

网友

1楼 · 编辑于 2024-07-03 06:03:24

这相当慢，但不会创建大的中间矩阵。在

vals = np.zeros(N)
for i in xrange(N):
    u = nhat[i]
    for v in m:
        vals[i]+=abs(np.dot(u,v))
    vals[i]=vals[i]/M

编辑：在for循环外移动除以M。在

第二版：新观念、旧观念及相关评论。在

^{pr2}$

这很快，但有时会给出不同的值，我正在研究原因，但同时可能会有所帮助。在

编辑3：啊，这是绝对值的东西。嗯

>>> S
array([ 0.28620962,  0.65337876,  0.37470707,  0.46500913,  0.49579837,
        0.29348924,  0.27444208,  0.74586928,  0.35789315,  0.3079964 ,
        0.298353  ,  0.42571445,  0.32535728,  0.87505053,  0.25547394,
        0.23964505,  0.44773271,  0.25235646,  0.4722281 ,  0.33003338])
>>> vals
array([ 0.2099343 ,  0.6532155 ,  0.33039334,  0.45366889,  0.48921527,
        0.20467291,  0.16585856,  0.74586928,  0.31234917,  0.22198642,
        0.21013519,  0.41422894,  0.26020981,  0.87505053,  0.1199069 ,
        0.06542492,  0.44145805,  0.08455833,  0.46824704,  0.28483342])

time to compute S: 0.000342130661011 seconds
time to compute vals: 7.29560852051e-05 seconds

编辑4：好吧，如果你的单位向量大部分都是正值，这应该运行得更快，假设m中的向量总是正数，就像它们在你的虚拟数据中一样。在

m2 = np.average(m,0)
vals = np.zeros(N)
for i in xrange(N):
    u=nhat[i]
    if u[0] >= 0 and u[1] >= 0 and u[2] >= 0:
        vals[i] = abs(np.dot(u,m2))
    else:
        for j in xrange(M):
            vals[i]+=abs(np.dot(u,m[j]))
        vals[i]/=M

网友

2楼 · 编辑于 2024-07-03 06:03:24

我不认为有任何简单的方法（除了Cython之类的）来加速你的精确操作。但是你可能需要考虑你是否真的需要计算你正在计算的东西。因为如果你可以用root mean square代替绝对值的平均值，你仍然会以某种方式平均内积的大小，但是你可以在一次射击中得到：

rms = np.sqrt(np.einsum('ij,il,kj,kl,k->i', nhat, nhat, m, m, np.ones(M)/M))

这与执行以下操作相同：

^{pr2}$

是的，这并不完全是你所要求的，但我担心这是最接近你将得到的矢量化方法。如果您决定沿着这条路走下去，请看np.einsum对大的N和M的表现：当传递太多参数和索引时，它有陷入困境的趋势。在

网友

3楼 · 编辑于 2024-07-03 06:03:24

根据一些评论，使用cython似乎是最好的方法。我愚蠢地从来没有考虑过使用赛顿。事实证明，生成工作代码相对容易。在

经过一番搜索，我把下面的cython代码放在一起。这不是最通用的代码，可能不是最好的编写方法，而且可能会变得更高效。即使如此，它只比原始问题中的einsum()代码慢25%，所以它不是太糟糕！它的目的是显式地处理在原始问题中创建的数组（因此是输入数组的假定模式）。
尽管有警告，但它确实为原始问题提供了一个合理有效的解决方案，并且可以作为类似情况下的起点。在

import numpy as np
cimport numpy as np
import cython
DTYPE = np.float64
ctypedef np.float64_t DTYPE_t
cdef inline double d_abs (double a) : return a if a >= 0 else -a

@cython.boundscheck(False)
@cython.wraparound(False)
def process_vectors (np.ndarray[DTYPE_t, ndim=2, mode="fortran"] nhat not None,
                     np.ndarray[DTYPE_t, ndim=2, mode="c"] m not None) :
    if nhat.shape[1] != m.shape[1] :
        raise ValueError ("Arrays must contain vectors of the same dimension")
    cdef Py_ssize_t imax = nhat.shape[0]
    cdef Py_ssize_t jmax = m.shape[0]
    cdef Py_ssize_t kmax = nhat.shape[1] # same as m.shape[1]
    cdef np.ndarray[DTYPE_t, ndim=1] S = np.zeros(imax, dtype=DTYPE)
    cdef Py_ssize_t i, j, k
    cdef DTYPE_t val, tmp
    for i in range(imax) :
        val = 0
        for j in range(jmax) :
            tmp = 0
            for k in range(kmax) :
                tmp += nhat[i,k] * m[j,k]
            val += d_abs(tmp)
        S[i] = val / jmax
    return S

相关问题更多 >

编程相关推荐

热门问题

热门文章