逐行Pandas数据帧，有条件地用最后一列的值替换多个列值

fNum 1 2 3 4 5 6 7 labelx Index 1 1 0 1 1 1 0 0 0 2 2 1 0 0 1 1 0 0 0 2 4 1 0 0 0 0 0 1 0 3 5 1 0 0 0 0 0 0 0 0 6 1 0 0 1 0 0 0 0 3 7 1 0 0 0 1 0 0 0 3 1 2 0 1 0 0 0 0 0 2 2 2 1 1 1 0 0 0 0 2 3 2 1 1 1 0 0 0 0 2 4 2 1 1 0 0 0 0 0 2 5 2 0 0 0 0 1 0 0 0 6 2 0 0 0 0 1 1 1 3 7 2 0 0 0 0 1 1 1 3

dfIX = Intitial_dataframe.ix[:, 2:8] #<--The "body" of the data labelx_frame = Intitial_dataframe.ix[:, 8:9] #<-- The labelx column dfIX[dfIX>0] = labelx_frame #<-- Attempt to replace values, nan instead

2条回答

网友

1楼 · 编辑于 2024-09-30 08:29:08

您也可以使用numpy来完成此操作。在

df = pd.DataFrame({1: [0,0,0,1,1,0,0,1,0,1], 2: [1,1,1,1,0,0,0,0,1,0], 3: [1,1,0,1,0,0,0,1,1,0], 'label_x': [2,2,3,0,0,2,3,2,2,2]})

1  2  3  label_x
0  0  1  1        2
1  0  1  1        2
2  0  1  0        3
3  1  1  1        0
4  1  0  0        0
5  0  0  0        2
6  0  0  0        3
7  1  0  1        2
8  0  1  1        2
9  1  0  0        2

还有，这个

^{pr2}$

收益率

 1  2  3  label_x
0  0  2  2        2
1  0  2  2        2
2  0  3  0        3
3  0  0  0        0
4  0  0  0        0
5  0  0  0        2
6  0  0  0        3
7  2  0  2        2
8  0  2  2        2
9  2  0  0        2

网友

2楼 · 编辑于 2024-09-30 08:29:08

我重新创建了部分数据，因为输入数据最初是作为图片发布的，而不是可复制文本。我将让您根据您的具体数据调整此方法。在

下面是使用^{}最简单、最易读的方法：

>>> df = pd.DataFrame({1: [0,0,0,1,1,0,0,1,0,1], 2: [1,1,1,1,0,0,0,0,1,0], 3: [1,1,0,1,0,0,0,1,1,0], 'label_x': [2,2,3,0,0,2,3,2,2,2]})
>>> df
   1  2  3  label_x
0  0  1  1        2
1  0  1  1        2
2  0  1  0        3
3  1  1  1        0
4  1  0  0        0
5  0  0  0        2
6  0  0  0        3
7  1  0  1        2
8  0  1  1        2
9  1  0  0        2
>>> for c in df:
...     if c != 'label_x':
...         df[c] = np.where(df[c] == 1, df['label_x'], df[c])
... 
>>> df
   1  2  3  label_x
0  0  2  2        2
1  0  2  2        2
2  0  3  0        3
3  0  0  0        0
4  0  0  0        0
5  0  0  0        2
6  0  0  0        3
7  2  0  2        2
8  0  2  2        2
9  2  0  0        2

这里有另一种方法，但我只是将其作为Python“强大”的一个例子（我不知道这个词是否正确）。这实际上是我最初解决你的问题的方法，但我认为只提供这个就有点过分了。如果我是你，我更喜欢numpy.where。但这只是为了演示：

^{pr2}$

还有，看看那个！我们得到同样的答案。在

有关所有这些**的详细信息，请参见Unpacking generalizations in Python 3。它是合并词典的有效语法。在

您还可以考虑这样做，基本上遍历new_values中每个列的对应列表：

for c in [1,2,3]:
    df[c] = new_values[c]

有很多方法可以剥这只猫的皮！在

相关问题更多 >

编程相关推荐

热门问题

热门文章