正则表达式是否匹配某些模式而排除其他模式？

>>> text = 'Wow.. :smiley_face: this is delicious!' # A string containing emoji >>> cleaned_text = re.sub('[^a-zA-Z0-9]+',' ',text) # regex to keep only alphanumerics >>> print(cleaned_text) Wow smiley face this is delicious

1条回答

网友
1楼 · 发布于 2024-10-03 13:18:35

如果只想删除colon-word(s)-colon上下文中除冒号和下划线以外的所有特殊字符，可以使用
re.sub(r'(:[a-z_]+:)|[^\w\s]|_', r'\1', text)
见regex demo详细信息：
(:[a-z_]+:)-捕获组1（\1）：:，一个或多个小写ASCII字母或_，以及:
|-或
[^\w\s]|_-除单词和空格字符或_以外的任何字符（它是单词字符，因此需要作为替代添加）
见the Python demo：
import re text = 'Wow.. :smiley_face: this is delicious!' # A string containing emoji print( re.sub(r'(:[a-z_]+:)|[^\w\s]|_', r'\1', text) ) # => Wow :smiley_face: this is delicious

编程相关推荐

java调整可绘图问题的大小（引用未被传递？）
如何在java 安卓中对数字进行排序？
方法中的java布尔值未正确返回（数组形式参数）
JavaEclipseMilo：如何读取历史数据？
java重写toString（）时出现错误
java如何调试运行在两个不同Tomcat服务器上的两个应用程序？
如何将java应用程序中的对象序列化为多个文件？
macos MacOSX系统菜单名w/Java？
java动态配置Maven依赖项
java Android Studio：如何找到Gradle的安装位置？

相关问题更多 >

编程相关推荐

热门问题

热门文章