给定两个具有分组项的数据帧和另一个具有等效组的数据帧，我可以使用匹配组中的所有单元对创建一个数据帧吗？

df1_unit df2_unit 0 x a 1 x b (shouldn't be included) 2 y a 3 y b 4 z d 5 z e (shouldn't be included) 6 t d 7 t e 8 u d 9 u e

temp = dfeq.rename(columns={'df2u':'base_unit'}).merge(df2, on='base_unit', how='left') temp = temp[['df1u', 'df2u']] out = temp.rename(columns={'df1u':'base_unit'}).merge(df1, on='base_unit', how='left') out = out[['df1u', 'df2u']]

1条回答

网友
1楼 · 发布于 2024-10-02 10:33:36

您可以使用以下方法：
考虑到df_eq中的数据，我们将在df1中映射base_unit列，因为我们将使用^{}构建一个字典，该字典将由^{}方法使用
map_base_unit_df2 = dict(df_eq.to_dict(orient='split')['data']) df1['base_unit'] = df1['base_unit'].map(map_base_unit_df2)
我们将使用^{}构建df_unrestricted，只选择对我们来说重要的列
df_unrestricted = pd.merge(df1, df2, on='base_unit')[['df1_unit', 'df2_unit']]
最后，添加最后一个限制，我的意思是，我们将使用^{}+^{}+^{}删除df_eq中存在的记录
df_output = pd.concat([df_unrestricted, df_eq]).drop_duplicates(keep=False).reset_index(drop=True)
完整代码：
map_base_unit_df2 = dict(df_eq.to_dict(orient='split')['data']) df1['base_unit'] = df1['base_unit'].map(map_base_unit_df2) df_unrestricted = pd.merge(df1, df2, on='base_unit')[['df1_unit', 'df2_unit']] df_output = pd.concat([df_unrestricted, df_eq]).drop_duplicates(keep=False).reset_index(drop=True) print(df_output)
输出：
df1_unit df2_unit 0 x a 1 y a 2 y b 3 z d 4 t d 5 t e 6 u d 7 u e

相关问题更多 >

编程相关推荐

热门问题

热门文章