pandas中左连接中的不匹配左表记录

Students = pd.DataFrame({ 'Class': [7, 7, 8], 'Section': ['A', 'B', 'B'], 'RollNo': [2, 3, 4], 'Student': ['Ram', 'Rahim', 'Robert'] }) Fee = pd.DataFrame({ 'Class': [7, 7, 8], 'Section': ['A', 'B', 'B'], 'RollNo': [2, 2, 3], 'Fee': [10, 20, 30] })

2条回答

网友

1楼 · 编辑于 2024-09-28 05:18:00

如果在Fee数据帧的Fee列中没有NaN，请使用^{}anf filter by ^{}和{a3}：

df = pd.merge(Students, Fee, how='left')
print (df)
   Class  RollNo Section Student   Fee
0      7       2       A     Ram  10.0
1      7       3       B   Rahim   NaN
2      8       4       B  Robert   NaN

df1 = df[df['Fee'].isna()].drop('Fee', axis=1)
#for oldier versions of pandas
#df1 = df[df['Fee'].isnull()].drop('Fee', axis=1)
print (df1)
   Class  RollNo Section Student
1      7       3       B   Rahim
2      8       4       B  Robert

使用NaNs的更一般的解决方案还将参数indicator添加到merge并使用left_only过滤行：

^{pr2}$

网友

2楼 · 编辑于 2024-09-28 05:18:00

我对这个概念很感兴趣。在

选项1

将pandas.concat与keys参数一起使用
确保Studentss部分的结果MultiIndex的第一个级别的值为'stu'。在
将pandas.DataFrame.drop_duplicates与参数keep=False一起使用可删除所有重复。在
通过使用loc，将注意力集中在{}部分。在

catted = pd.concat([Students, Fee], keys=['stu', 'fee'])
dropped = catted.drop_duplicates(['Class', 'RollNo', 'Section'], keep=False)
index = dropped.loc['stu'].index

Students.loc[index]

   Class  RollNo Section Student
1      7       3       B   Rahim
2      8       4       B  Robert

方案2

使用元组列表上的集合，取一个差异并与一个人工数据帧合并。在

^{pr2}$

选项1

方案2

相关问题更多 >

编程相关推荐

热门问题

热门文章

pandas中左连接中的不匹配左表记录

选项1

方案2

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >