提取特殊字符和词之间的所有字符的正则表达式

3条回答

网友

1楼 · 编辑于 2024-09-27 23:28:48

$ grep -Po '(?<=>)[^<$]+' <<EOF
123abc >I want this1.myword
123<>I want this2.myword<>
EOF

I want this1.myword
I want this2.myword

(?<=)正面回顾
[^]负字符集

网友

2楼 · 编辑于 2024-09-27 23:28:48

首先，一个简单的点.匹配任何字符，因此您希望在regex中转义它：\.否则，regex还会在例如：
中找到匹配项 123>Iwantthis!myword # extracts Iwantthis!myword

其次，必须允许捕获的组中有空格字符：\s。你知道吗

我想这应该适合你： r'([\w\s]+\.myword)'

网友

3楼 · 编辑于 2024-09-27 23:28:48

我不使用regex，而是定义一个特定的函数来提取子字符串：

代码

def substring(original_string):
    start = original_string.find(">")
    end = original_string.find(".myword")

    if (start > -1) and (end > -1):
        return original_string[start + 1:end]
    else:
        return None


df['my_column'] = df['text'].apply(lambda x: substring(x))

代码

相关问题更多 >

编程相关推荐

热门问题

热门文章

提取特殊字符和词之间的所有字符的正则表达式

代码

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >