如何在Python中对齐两个不同长度的数组（在没有匹配元素的情况下使用NaNs）

import numpy as np import scipy.interpolate as sp x = [2, 5, 7, 11, 13, 16, 19, 23, 25, 30] y = [11, 10, 12, 14, 16, 19, 17, 14, 18, 17] xd = np.linspace(0, max(x), int(max(x))+1) # create the new x axis ipo = sp.splrep(x, y, k=3) # cubic spline yd = sp.splev(xd, ipo) # interpolated y values newY = np.zeros((1, len(yd)), dtype=float) # preallocate for the filled y values for i in x: if(i in xd): idx, = np.where(xd == i) # find where the original x value is in the new x axis idx2, = np.where(np.array(x) == i) newY[0, int(idx)] = y[int(idx2)] # replace the y value of the new vector with the y value from original set

def A(): for i in x: if(i in xd): idx, = np.where(xd == i) # find where the original x value is in the new x axis idx2, = np.where(np.array(x) == i) newY[int(idx)] = y[int(idx2)] # replace the y value of the new vector with the y value from original set def B(): for i, date in enumerate(xd): if date in x: new_y[i] = date def C(): known_values = dict(zip(x, y)) for i,u in enumerate(xd): if u in known_values: newY[i] = known_values[u]

2条回答

网友

1楼 · 编辑于 2024-10-06 06:45:54

抱歉，如果我完全误解了您的代码，但是np.linspace(0, max(x), int(max(x))+1)不是一种简单的np.array(range(1+max(x)))的迂回方式吗？看起来好像您只是在0和max(x)之间的范围内（包括1+max(x)）进行线性间隔采样，这与只获取0和max（x）之间的整数相同。在

在这种情况下，有必要这样做吗？在

if(i in xd): 
    idx, = np.where(xd == i) # find where the original x value is in the new x axis

如果xd真的只是一个从0到max（x）的整数列表，那么x中的所有元素都将在xd中，并且{}应该始终等于i。在

（当然，这假设x只包含非负整数值。）

^{pr2}$

编辑：在更一般的情况下，新的轴不仅仅是整数范围0..max（x），我建议在将已知值转换为字典之后，在数组上迭代。这将更有效，因为线性搜索被字典查找所取代。在

known_values = dict(zip(x, y))

xd = [... your new axis ...]
newY = np.zeros(len(xd))

for i,x in enumerate(xd):
    if x in known_values:
        newY[i] = known_values[x]

编辑：有趣的是，性能要差得多——如果已知值太少（那么在大数组中循环开销要大得多），显然会发生这种情况，但我认为这在实践中不会是一个问题。在

还有另一种循环方式，它利用了这两种顺序，但它替代了np.哪里如果不是显式的，那就取决于MPI的显式循环有多高效：

^{4}$

网友

2楼 · 编辑于 2024-10-06 06:45:54

我明白这一切的目的是在同一个图上绘制y值，为什么不直接做呢？轴可以轻松处理同一绘图上的不同x轴，如下所示：

import numpy as np
import scipy.interpolate as sp
import matplotlib.pyplot as plt

x = [2, 5, 7, 11, 13, 16, 19, 23, 25, 30]
y = [11, 10, 12, 14, 16, 19, 17, 14, 18, 17]

xd = np.linspace(0, max(x), int(max(x)) + 1)  # create the new x axis
ipo = sp.splrep(x, y, k=3)  # cubic spline
yd = sp.splev(xd, ipo)  # interpolated y values

fig = plt.figure()
ax = fig.add_subplot(111)
ax.plot(x, y, label='Original')
ax.plot(xd, yd, label='Interpolated')
plt.legend()
plt.grid()

plt.show()

如您所愿，每个“y”数据都与它自己的x轴对齐，而无需进行任何预处理。这里所做的唯一插值是Matplotlib用于显示的插值。在

由于您确实需要用Nan填充数组，下面是一种有效的方法：

^{pr2}$

也许可以用一些华丽的单行代码来减少

相关问题更多 >

编程相关推荐

热门问题

热门文章