相互依赖矩阵的矢量化计算问题的回答

相互依赖矩阵的矢量化计算

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

我在多个时间分辨率下跟踪多个离散时间序列，得到一个SxRxB矩阵，其中S是时间序列的数目，R是不同分辨率的数目，B是缓冲区，即每个序列记住多少值。每个序列都是离散的，并且使用有限范围的自然数来表示其值。我把这些叫做“符号”。在 对于每个系列，我想计算在所有测量中，任何先前测量的符号直接位于当前测量的符号之前的频率。我已经用如下所示的for循环解决了这个问题，但出于明显的原因，我想将其矢量化。在 我不确定我构建数据结构的方式是否有效，因此我愿意听取建议。尤其是比率矩阵，我认为可以用不同的方法。在 提前谢谢！在 <pre><code>def supports_loop(data, num_series, resolutions, buffer_size, vocab_size): # For small test matrices we can calculate the complete matrix without problems indices = [] indices.<a href="https://www.cnpython.com/list/append" class="inner-link">append</a>(xrange(num_series)) indices.append(xrange(vocab_size)) indices.append(xrange(num_series)) indices.append(xrange(vocab_size)) indices.append(xrange(resolutions)) # This is huge! :/ # dimensions: # series and value for which we calculate, # series and value which precedes that measurement, # resolution ratios = np.full((num_series, vocab_size, num_series, vocab_size, resolutions), 0.0) for idx in itertools.product(*indices): s0, v0 = idx[0],idx[1] # the series and symbol for which we calculate s1, v1 = idx[2],idx[3] # the series and symbol which should precede the we're calculating for res = idx[4] # Find the positions where s0==v0 found0 = np.where(data[s0, res, :] == v0)[0] if found0.size == 0: continue #print('found {}={} at {}'.format(s0, v0, found0)) # Check how often s1==v1 right before s0==v0 candidates = (s1, res, (found0 - 1 + buffer_size) % buffer_size) found01 = np.count_nonzero(data[candidates] == v1) if found01 == 0: continue print('found {}={} following {}={} at {}'.format(s0, v0, s1, v1, found01)) # total01 = number of positions where either s0 or s1 is defined (i.e. >=0) total01 = len(np.argwhere((data[s0, res, :] >= 0) & (data[s1, res, :] >= 0))) ratio = (float(found01) / total01) if total01 > 0 else 0.0 ratios[idx] = ratio return ratios def stackoverflow_example(fnc): data = np.array([ [[0, 0, 1], # series 0, resolution 0 [1, 3, 2]], # series 0, resolution 1 [[2, 1, 2], # series 1, resolution 0 [3, 3, 3]], # series 1, resoltuion 1 ]) num_series = data.shape[0] resolutions = data.shape[1] buffer_size = data.shape[2] vocab_size = np.max(data)+1 ratios = fnc(data, num_series, resolutions, buffer_size, vocab_size) coordinates = np.argwhere(ratios > 0.0) nz_values = ratios[ratios > 0.0] print(np.hstack((coordinates, nz_values[:,None]))) print('0/0 precedes 0/0 in 1 out of 3 cases: {}'.format(np.isclose(ratios[0,0,0,0,0], 1.0/3.0))) print('1/2 precedes 0/0 in 2 out of 3 cases: {}'.format(np.isclose(ratios[0,0,1,2,0], 2.0/3.0))) </code></pre> 预期输出（21对，5列为坐标，后跟found count）： ^{pr2}$ 在上面的例子中，序列0中的0在三分之二的情况下跟随序列1中的2（因为缓冲区是圆形的），因此[0，0，1，2，0]处的比率将为~0.6666。同样是序列0，值0在三种情况中有一种会跟随它自己，所以在[0，0，0，0]处的比率将是~0.3333。还有一些是&gt；0.0的。在 <hr/> 我在两个数据集上测试每个答案：一个很小的（如上图所示）和一个更真实的（100系列，5个分辨率，每个系列10个值，50个符号）。在 <h2>结果</h2> <pre><code>Answer Time (tiny) Time (huge) All pairs found (tiny=21) ----------------------------------------------------------------------- Baseline ~1ms ~675s (!) Yes Saedeas ~0.13ms ~1.4ms No (!) Saedeas2 ~0.20ms ~4.0ms Yes, +cross resolutions Elliot_1 ~0.70ms ~100s (!) Yes Elliot_2 ~1ms ~21s (!) Yes Kuppern_1 ~0.39ms ~2.4s (!) Yes Kuppern_2 ~0.18ms ~28ms Yes Kuppern_3 ~0.19ms ~24ms Yes David ~0.21ms ~27ms Yes </code></pre> 赛德亚斯第二种方法是明显的赢家！非常感谢大家：）

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

首先，不显式地嵌套for循环会给自己带来一点伤害。结果你重复了很多努力，却没有保存任何记忆。当循环嵌套时，可以将一些计算从一个级别移动到另一个级别，并确定哪些内部循环可以矢量化。在 <pre><code>def supports_5_loop(data, num_series, resolutions, buffer_size, vocab_size): ratios = np.full((num_series, vocab_size, num_series, vocab_size, resolutions), 0.0) for res in xrange(resolutions): for s0 in xrange(num_series): # Find the positions where s0==v0 for v0 in np.unique(data[s0, res]): # only need to find indices once for each series and value found0 = np.where(data[s0, res, :] == v0)[0] for s1 in xrange(num_series): # Check how often s1==v1 right before s0==v0 candidates = (s1, res, (found0 - 1 + buffer_size) % buffer_size) total01 = np.logical_or(data[s0, res, :] >= 0, data[s1, res, :] >= 0).sum() # can skip inner loops if there are no candidates if total01 == 0: continue for v1 in xrange(vocab_size): found01 = np.count_nonzero(data[candidates] == v1) if found01 == 0: continue ratio = (float(found01) / total01) ratios[(s0, v0, s1, v1, res)] = ratio return ratios </code></pre> 你会在计时中看到，大部分的速度提升来自于不重复的努力。在 一旦创建了嵌套结构，就可以开始研究向量化和其他优化。在 ^{pr2}$ 不幸的是，我只能对最里面的循环进行矢量化，这只会额外获得10%的加速。在最里面的循环之外，你不能保证所有的向量都是相同的大小，所以你不能构建一个数组。在 <pre><code>In [121]: (np.all(supports_loop(data, num_series, resolutions, buffer_size, vocab_size) == supports_5_loop(data, num_series, resolutions, buffer_size, vocab_size))) Out[121]: True In [122]: (np.all(supports_loop(data, num_series, resolutions, buffer_size, vocab_size) == supports_4_loop(data, num_series, resolutions, buffer_size, vocab_size))) Out[122]: True In [123]: %timeit(supports_loop(data, num_series, resolutions, buffer_size, vocab_size)) 2.29 ms ± 73.9 µs per loop (mean ± std. dev. of 7 runs, 100 loops each) In [124]: %timeit(supports_5_loop(data, num_series, resolutions, buffer_size, vocab_size)) 949 µs ± 5.37 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each) In [125]: %timeit(supports_4_loop(data, num_series, resolutions, buffer_size, vocab_size)) 843 µs ± 3.21 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each) </code></pre>

相互依赖矩阵的矢量化计算

1 个回答

相关Python问题