我在下面有两个数据框,df_purchase(1)和df_login(2)
+--------+-----+--------+------------+--------------------+-------------+--------------------------+
| | age | gender | ttp | count | sum(amount) | region |
+--------+-----+--------+------------+--------------------+-------------+--------------------------+
| 49427 | 63 | M | 824.731412 | 2 | 25.00 | Omaha, Nebraska |
| 28433 | 49 | M | 1.166250 | 2 | 41.94 | Catasauqua, Pennsylvania |
| 4162 | 29 | M | 5.620949 | 2 | 51.78 | Eagle Center, Iowa |
| 18747 | 43 | M | 153.502072 | 2 | 23.84 | Pacific, Washington |
| 45173 | 59 | M | 0.027257 | 2 | 13.98 | De Soto, Missouri |
+--------+-----+--------+------------+--------------------+-------------+--------------------------+
+--------+-----+--------+------------+--------------------+-------------+--------------------------+
| | age | gender | count | region | | |
| 671766 | 84 | M | 13900 | New York, New York | | |
| 671166 | 84 | F | 7619 | New York, New York | | |
| 672209 | 85 | F | 6483 | New York, New York | | |
| 672671 | 85 | M | 5808 | New York, New York | | |
| 195201 | 34 | M | 3817 | New York, New York | | |
+--------+-----+--------+------------+--------------------+-------------+--------------------------+
我正在尝试加入dfòu登录,以dfòu购买年龄、性别和地区,熊猫代码如下:
df = pd.merge(df_purchase, df_login[['count']],
how='left', on=['age', 'gender', 'region'])
但是,我一直得到这个错误:KeyError: 'age'
有什么想法吗?
关键错误源于:
您只从
df_login
中选择了一列,您需要:我假设这不是您的完整数据,因为您在
df_login
中的age和region列中没有公共值。相关问题 更多 >
编程相关推荐