如何优化合并两大数据框架

time brand user_id sku_id 27630 37957 2016-02-01 07:43:14 8 489 37957 2016-02-01 07:43:04 8 489 37957 2016-02-01 07:43:02 8 661 21546 2016-02-01 07:43:02 6 ……

time brand user_id sku_id 27630 37957 2016-02-01 07:43:14 8 489 37957 2016-02-01 07:43:04 8 764 37957 2016-02-01 07:43:02 8 667 2156 2016-02-01 07:43:02 3 ……

2条回答

网友

1楼 · 编辑于 2024-09-28 17:29:59

在这种情况下，我会使用Index.intersection：

解决方案：

In [159]: A.loc[A.index.intersection(B.index)]
Out[159]:
                               time  brand
user_id sku_id
489     37957   2016-02-01 07:43:04      8
        37957   2016-02-01 07:43:02      8
27630   37957   2016-02-01 07:43:14      8

Pandas documentation: Merge, join, and concatenate

网友

2楼 · 编辑于 2024-09-28 17:29:59

你试过merge操作吗？你知道吗

df=df1.merge（df2，how='outer'，on='your required column or index'）

Parameters:

如何：{'left'、'right'、'outer'、'inner'}，默认为'inner'

左：仅使用左框架中的键，类似于SQL左外部联接；保留键顺序

右：仅使用右帧中的键，类似于SQL右外部联接；保留键顺序

外部：使用来自两个帧的键的并集，类似于SQL完全外部连接；按字典顺序对键排序

内部：使用两个帧的键的交集，类似于SQL内部连接；保留左键的顺序

上：标签或列表：-字段名加入。必须在两个数据帧中找到。如果on为None且不在索引上合并，则默认情况下，它在列的交叉点上合并。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章