擅长:python、mysql、java
<p><strong><em>获取重复掩码</em></strong></p>
<pre><code>cols = ['stop_lat', 'stop_lon']
dups = df.duplicated(subset=cols)
</code></pre>
<p><strong><em>带掩码的子集df</em></strong></p>
^{pr2}$
<p><strong><em>重复数据可以自己复制</em></strong></p>
<pre><code>first_dup = df[dups].drop_duplicates(subset=cols)
first_dup = first_dup.set_index(cols).stop_id
</code></pre>
<p><strong><em>相应分配</em></strong></p>
<pre><code>nodups.loc[first_dup.index, 'stop_id2'] = first_dup
nodups
</code></pre>
<p><a href="https://i.stack.imgur.com/3gta6.png" rel="nofollow noreferrer"><img src="https://i.stack.imgur.com/3gta6.png" alt="enter image description here"/></a></p>