从列中删除数据

Year From country To country Points 0 2016 Albania Armenia 0 1 2016 Albania Armenia 2 2 2016 Albania Australia 12 Year From country To country Points 2129 2016 United Kingdom The Netherlands 0 2130 2016 United Kingdom Ukraine 10 2131 2016 United Kingdom Ukraine 5 [2132 rows x 4 columns]

Year From country To country Points 0 2016 Albania Armenia 0 2 2016 Albania Australia 12 4 2016 Albania Austria 0 Year From country To country Points 46 2016 Albania The Netherlands 0 48 2016 Albania Ukraine 0 50 2016 Albania United Kingdom 5 [50 rows x 4 columns]

2条回答

网友

1楼 · 编辑于 2024-09-29 21:35:52

最简单的解决方案是按“to country”名称分组，并从每个组中选取第一行（或最后一行，如果您愿意）：

df.groupby('To country').first().reset_index()
#        To country  Year    From country  Points
#0          Armenia  2016         Albania       0
#1        Australia  2016         Albania      12
#2  The Netherlands  2016  United Kingdom       0
#3          Ukraine  2016  United Kingdom      10

与aryamccarthy的解决方案相比，这个解决方案可以让您更好地控制要保留哪些副本。你知道吗

网友

2楼 · 编辑于 2024-09-29 21:35:52

不，这种行为是正确的假设每一支球队都和另一支球队比赛，它在寻找第一，而且所有这些第一都是“来自”阿尔巴尼亚。你知道吗

根据您下面所说的，您希望保留第0行，而不是第1行，因为它同时重复To和From国家。消除这些问题的方法是：

df.drop_duplicates(subset=['To country', 'From country'], inplace=True)

相关问题更多 >

编程相关推荐

热门问题

热门文章