我正在尝试连接两个数据帧:
df_games
:
jogo_id rodada_id time_id time_nome adversario_id ... variacao_num media_num jogos_num status ano
0 232423 1 293.0 Athletico-PR 267 ... 0.00 0.00 0 Nulo 2019
1 232423 1 293.0 Athletico-PR 267 ... 0.00 0.00 0 Nulo 2019
2 232423 1 293.0 Athletico-PR 267 ... 0.00 0.00 0 Nulo 2019
3 232423 1 293.0 Athletico-PR 267 ... 0.00 0.00 0 Nulo 2019
4 232423 1 293.0 Athletico-PR 267 ... -2.17 0.40 1 Nulo 2019
... ... ... ... ... ... ... ... ... ... ... ...
30825 232429 38 267.0 Vasco 315 ... 0.00 0.70 1 Nulo 2019
30826 232429 38 267.0 Vasco 315 ... -0.30 2.93 25 Dúvida 2019
30827 232429 38 267.0 Vasco 315 ... 0.00 2.39 17 Suspenso 2019
30828 232429 38 267.0 Vasco 315 ... 0.00 0.00 0 Nulo 2019
30829 232429 38 267.0 Vasco 315 ... 0.00 0.32 8 Dúvida 2019
shape: (30830, 19)
和{
rank atleta_nome posicao
player_id
82730 39 Marcelo Rangel Goleiro
100651 136 Rodrygo Atacante
87863 140 Arrascaeta Meio-Campista
shape: (733, 4)
当我尝试加入他们时,就像这样:
df_joined = df_games.set_index('player_id').join(players.set_index('player_id'))
我得到一个错误:
ValueError: columns overlap but no suffix specified: Index(['atleta_nome', 'posicao'],dtype='object')
但是如果我尝试将相同的df_players
与另一个df_games_2连接起来,如下所示:
Unnamed: 0 atletas.nome atletas.slug atletas.apelido ... PP GC DP round
0 1 Glaybson Yago Souza Lisboa yago-pikachu Yago Pikachu ... NaN NaN NaN 1
1 2 Juan Ramón Cazares Sevillano cazares Cazares ... NaN NaN NaN 1
2 3 Alessandro Vinícius Gonçalves da Silva alessandro-vinicius Alessandro Vinícius ... NaN NaN NaN 1
3 4 Diego Alves Carreira diego-alves Diego Alves ... NaN NaN NaN 1
4 5 Réver Humberto Alves Araújo rever Réver ... NaN NaN NaN 1
.. ... ... ... ... ... .. .. .. ...
843 844 Lucas Piton Crivellaro lucas-piton Lucas Piton ... NaN NaN NaN 38
844 845 Alan Carlos de Paula Dias Filho alan Alan ... NaN NaN NaN 38
845 846 Roni Medeiros de Moura roni Roni ... NaN NaN NaN 38
846 847 Alexander Silva de Lucena alexander Alexander ... NaN NaN NaN 38
847 848 Riquelme Carvalho Araújo Viana riquelme Riquelme ... NaN NaN NaN 38
[30581 rows x 34 columns]
连接起作用
为什么在第一种情况下我可以通过player_id
加入,而在第二种情况下我不能
我该如何解决这个问题?我需要加入df_games
您可以使用
merge
代替join
,也可以在调用中传递后缀:相关问题 更多 >
编程相关推荐