如何修剪列中的字符串和字符串列表

0 ['AU06_threshold_h', 'AU12_threshold_h'] 1 AU14_threshold_h 2 AU26_threshold_h 3 NaN 4 AU01_threshold_h

2条回答

网友

1楼 · 编辑于 2024-10-02 04:22:04

使用自定义函数（基于regex替换）：

In [98]: pat = re.compile(r'[^\d]+')                                                                        

In [99]: def trim_non_num(s): 
    ...:     if isinstance(s, str): 
    ...:         return int(pat.sub('', s)) 
    ...:     elif isinstance(s, list): 
    ...:         return [int(pat.sub('', i)) for i in s] 
    ...:     return s 
    ...:                                                                                                    

In [100]: df['col'].apply(trim_non_num)                                                                     
Out[100]: 
0    [6, 12]
1         14
2         26
3        NaN
4          1
Name: col, dtype: object

网友

2楼 · 编辑于 2024-10-02 04:22:04

使用explode

df.col.explode().str.extract('(\d+)')[0]\
      .groupby(level=0).agg(lambda s: list(s) if len(s)>1 else s.iat[0])

0    [06, 12]
1          14
2          26
3         NaN
4          01
Name: 0, dtype: object

我只能说这不是一个好的设计。避免将列表和数字放在同一列中

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何修剪列中的字符串和字符串列表

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >