擅长:python、mysql、java
<h3><code>duplicated</code></h3>
<pre><code>mask = df.duplicated(['id', 'sample'], keep=False)
df.assign(mutation=df.mutation.mask(mask, 'multi')).drop_duplicates()
id mutation sample
0 MYC multi s1
2 MYCL nonsens s1
3 MYCL missense s2
4 MYCN missense s3
5 MYCN multi s1
</code></pre>
<hr/>
<h3><code>groupby</code></h3>
^{pr2}$