擅长:python、mysql、java
<p>使用pandas<a href="https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.Series.str.extract.html" rel="nofollow noreferrer">str extract</a>搜索后跟“ml”的数字</p>
<pre><code> df['(column2)'] = df.iloc[:,0].str.extract(r'(\d+ml)')
(column1) (column2)
0 Coke Can 300ml 300ml
1 Bottle 800ml 800ml
2 Cup NaN
3 Bucket 2000ml 2000ml
</code></pre>
<p>如果要删除数字后面的“ml”,可以使用regex look-behind断言。。。它将只在数字后查找“ml”,并将其替换为空字符串</p>
<pre><code>df.iloc[:,0] = df.iloc[:,0].str.replace('(?<=\d)ml','')
(column1) (column2)
0 Coke Can 300 300ml
1 Bottle 800 800ml
2 Cup NaN
3 Bucket 2000 2000ml
</code></pre>