两列重叠,但未指定后缀

2024-06-26 14:39:59 发布

您现在位置:Python中文网/ 问答频道 /正文

我正在尝试连接两个数据帧:

df_games

       jogo_id  rodada_id  time_id     time_nome  adversario_id  ... variacao_num  media_num  jogos_num    status   ano
0       232423          1    293.0  Athletico-PR            267  ...         0.00       0.00          0      Nulo  2019
1       232423          1    293.0  Athletico-PR            267  ...         0.00       0.00          0      Nulo  2019
2       232423          1    293.0  Athletico-PR            267  ...         0.00       0.00          0      Nulo  2019
3       232423          1    293.0  Athletico-PR            267  ...         0.00       0.00          0      Nulo  2019
4       232423          1    293.0  Athletico-PR            267  ...        -2.17       0.40          1      Nulo  2019
...        ...        ...      ...           ...            ...  ...          ...        ...        ...       ...   ...
30825   232429         38    267.0         Vasco            315  ...         0.00       0.70          1      Nulo  2019
30826   232429         38    267.0         Vasco            315  ...        -0.30       2.93         25    Dúvida  2019
30827   232429         38    267.0         Vasco            315  ...         0.00       2.39         17  Suspenso  2019
30828   232429         38    267.0         Vasco            315  ...         0.00       0.00          0      Nulo  2019
30829   232429         38    267.0         Vasco            315  ...         0.00       0.32          8    Dúvida  2019

shape: (30830, 19)

和{}:

           rank     atleta_nome        posicao
player_id                                     
82730        39  Marcelo Rangel        Goleiro
100651      136         Rodrygo       Atacante
87863       140      Arrascaeta  Meio-Campista

shape: (733, 4)

当我尝试加入他们时,就像这样:

df_joined = df_games.set_index('player_id').join(players.set_index('player_id'))

我得到一个错误:

ValueError: columns overlap but no suffix specified: Index(['atleta_nome', 'posicao'],dtype='object')

但是如果我尝试将相同的df_players与另一个df_games_2连接起来,如下所示:

     Unnamed: 0                            atletas.nome         atletas.slug      atletas.apelido  ...  PP  GC  DP  round
0             1              Glaybson Yago Souza Lisboa         yago-pikachu         Yago Pikachu  ... NaN NaN NaN      1
1             2            Juan Ramón Cazares Sevillano              cazares              Cazares  ... NaN NaN NaN      1
2             3  Alessandro Vinícius Gonçalves da Silva  alessandro-vinicius  Alessandro Vinícius  ... NaN NaN NaN      1
3             4                    Diego Alves Carreira          diego-alves          Diego Alves  ... NaN NaN NaN      1
4             5             Réver Humberto Alves Araújo                rever                Réver  ... NaN NaN NaN      1
..          ...                                     ...                  ...                  ...  ...  ..  ..  ..    ...
843         844                  Lucas Piton Crivellaro          lucas-piton          Lucas Piton  ... NaN NaN NaN     38
844         845         Alan Carlos de Paula Dias Filho                 alan                 Alan  ... NaN NaN NaN     38
845         846                  Roni Medeiros de Moura                 roni                 Roni  ... NaN NaN NaN     38
846         847               Alexander Silva de Lucena            alexander            Alexander  ... NaN NaN NaN     38
847         848          Riquelme Carvalho Araújo Viana             riquelme             Riquelme  ... NaN NaN NaN     38

[30581 rows x 34 columns]

连接起作用

为什么在第一种情况下我可以通过player_id加入,而在第二种情况下我不能

我该如何解决这个问题?我需要加入df_games


Tags: iddftimedeprnannumgames