使用regex Python组合字符（如果存在于列表中）

import re emoticon = [':)',':-)',':-(',':D'] def emoticonNormalize(text,loop=2): text = re.sub(r'\s(\S)\s(\S)\s(\S)\s', r' \1\2\3 ', text) text = re.sub(r'\s(\S)\s(\S)\s', r' \1\2 ', text) text = re.sub(r'\s(\S)\s(\S)', r' \1\2', text) print(text) texta = 'I dont like politic : - ( but still read about it : - ) _ because its funny . : D and unpredictable : )' print(texta) texted = emoticonNormalize(texta,1)

3条回答

网友
1楼 · 编辑于 2024-09-20 22:54:13

在这里，我传递了re.sub()一个匹配要收紧的图释的正则表达式和一个函数tighten_emoticon，它删除了匹配的regex对象的每个字符之间的空格。在
import re def tighten_emoticon(matchobj): return matchobj.group(0).replace(" ", "") original = 'I dont like politic : - ( but still read about it : - ) _ because its funny . : D and unpredictable : )' tightened = re.sub(r'(: - $|: - $|: D|: \))', tighten_emoticon, original)
编辑
或者，可以使用emoticon列表动态生成正则表达式：
^{pr2}$

网友
2楼 · 编辑于 2024-09-20 22:54:13

希望这个会好起来。在
import re emoticon = [':)',':-)',':-(',':D'] def emoticonNormalize(text,loop=2): text = re.sub(r'\:\s*D', ':D', text) text = re.sub(r':\s*\-\s*\)', ':-)', text) text = re.sub(r'\:\s*\-\s*$', ':-(', text) text = re.sub(r'\:\s*$', ':)', text) print(text) texta = 'I dont like politic : - ( but still read about it : - ) _ because its funny . : D and unpredictable : )' print(texta) texted = emoticonNormalize(texta,1)
输出：
^{pr2}$

网友
3楼 · 编辑于 2024-09-20 22:54:13

re.sub(r'(?<=\:)( )','',texta)
Out[72]: 'I dont like politic :- ( but still read about it :- ) _ because its funny . :D and unpredictable :)'

相关问题更多 >

编程相关推荐

热门问题

热门文章