如何在python中为dataframe添加另一个labellike列？

2条回答

网友

1楼 · 编辑于 2024-09-26 17:46:00

在pandas中实现快速性能的关键是使用向量化操作，即避免（正如您所注意的）缓慢的Python循环的内置操作。你知道吗

我比较喜欢的方法是对这样的更改进行标记，即对差异调用np.sign（当然，首先要做import numpy as np）：

>>> df
   id  openPrice  closePrice
0   1         10          13
1   2         20          15
>>> df["movement"] = np.sign(df["closePrice"] - df["openPrice"])
>>> df
   id  openPrice  closePrice  movement
0   1         10          13         1
1   2         20          15        -1

这样做的一个好处是，如果openPrice == closePrice，您会自动获得movement == 0，这非常方便。你知道吗

如果你更喜欢手工操作，你可以做向量运算，比如

>>> df["closePrice"] > df["openPrice"]
0     True
1    False
dtype: bool
>>> (df["closePrice"] > df["openPrice"]) * 2 - 1
0    1
1   -1
dtype: int64

因为这里有False == 0和True == 1，但是你必须有特殊情况closePrice == openPrice。你知道吗

网友

2楼 · 编辑于 2024-09-26 17:46:00

可以使用where设置设置值的条件，最后一个参数是条件为False时的值：

In [6]:

df['movement'] = np.where(df['openPrice'] < df['closePrice'], 1, -1 )
df
Out[6]:
   id  openPrice  closePrice  movement
0   1         10          13         1
1   2         20          15        -1

[2 rows x 4 columns]

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何在python中为dataframe添加另一个labellike列？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >