在Pandas中使用Regex提取特定单词

2条回答

网友

1楼 · 编辑于 2024-05-17 10:17:49

您可以使用extract：

df['country'] = df['country'].str.extract(r'Country:\s*(\w+)')

熊猫测试：

import pandas as pd
import numpy as np
df = pd.DataFrame({'country' : [np.nan, 'Country: America', 'Country France ... More countries...']})
df['country'].str.extract(r'Country:\s*(\w+)')
#          0
# 0      NaN
# 1  America
# 2      NaN

网友

2楼 · 编辑于 2024-05-17 10:17:49

您还可以避免regex并使用^{}：

In [86]: df = pd.DataFrame({'country' : [np.nan, 'Country: America', 'Country: France ... More countries...', np.nan, 'Country: India']})

In [87]: df
Out[87]: 
                                 country
0                                    NaN
1                       Country: America
2  Country: France ... More countries...
3                                    NaN
4                         Country: India

In [94]: df.country.str.split(':').str[1].str.split().str[0]
Out[94]: 
0        NaN
1    America
2     France
3        NaN
4      India
Name: country, dtype: object

相关问题更多 >

编程相关推荐

热门问题

热门文章

在Pandas中使用Regex提取特定单词

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >