我如何首先将两个特定列的数据帧索引为一个？

data1 data2 a b c a c d a1 b1 c1 1a c1 1d b2 c2 2a c2 2d a3 c3 3a c3 3d 4a c4 4d

3条回答

网友

1楼 · 编辑于 2024-05-27 11:18:13

使用@IdoS安装程序：

import pandas as pd
data1 = pd.DataFrame({'a': ['a1', None, 'a3'],
                      'b': ['b1', 'b2', None],
                      'c': ['c1', 'c2', 'c3']})

data2 = pd.DataFrame({'a': ['1a', '2a', '3a', '4a'],
                      'c': ['c1', 'c2', 'c3', 'c4'],
                      'd': ['1d', '2d', '3d', '4d']})

您可以使用set_index，combine_first，然后重新编制索引：

^{pr2}$

输出：

    c   a     b   d
0  c1  a1    b1  1d
1  c2  2a    b2  2d
2  c3  a3  None  3d

网友

2楼 · 编辑于 2024-05-27 11:18:13

你也可以试试这个方法：

# set indexes
data1 = data1.set_index('c')
data2 = data2.set_index('c')

# join data on indexes
datax = data1.join(data2.drop('d', axis=1), rsuffix='_rr').reset_index()

# fill missing value in column a
datax['a'] = datax['a'].fillna(datax['a_rr'])

# drop unwanted columns
datax.drop('a_rr', axis=1, inplace=True)

# fill missing values with blank spaces
datax.fillna('', inplace=True)

# output
    a   b   c
0   a1  b1  c1
1   2a  b2  c2
2   a3      c3

^{pr2}$

网友

3楼 · 编辑于 2024-05-27 11:18:13

我不是百分之百的清楚你如何索引你的数据帧（data1和data2），但是如果你在列'c'上对它们进行索引，那就可以了。在

我是这样创建你的数据的：

import pandas as pd
data1 = pd.DataFrame({'a': ['a1', None, 'a3'],
                      'b': ['b1', 'b2', None],
                      'c': ['c1', 'c2', 'c3']})

data2 = pd.DataFrame({'a': ['1a', '2a', '3a', '4a'],
                      'c': ['c1', 'c2', 'c3', 'c4'],
                      'd': ['1d', '2d', '3d', '4d']})

然后我将两者的索引设置为列'c'：

^{pr2}$

然后我像您一样使用combine_first：

data_combined = data1.combine_first(data_2)

我明白了：

    a   b   d
c           
c1  a1  b1  1d
c2  2a  b2  2d
c3  a3  None    3d
c4  4a  NaN 4d

不知道为什么不需要索引为'c4'的行或列'd'，但删除它们很容易：

data_combined = data_combined.drop('d', axis=1)
data_combined = data_combined.loc[data_combined.index != 'c4']

然后我重新排序以得到你想要的结果：

data_combined = data_combined.reset_index()
data_combined = data_combined[['a', 'b', 'c']]
data_combined = data_combined.fillna('')


    a   b   c
0   a1  b1  c1
1   2a  b2  c2
2   a3      c3

相关问题更多 >

编程相关推荐

热门问题

热门文章

我如何首先将两个特定列的数据帧索引为一个？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >