如何获取以“#”开头的所有术语？

3条回答

网友

1楼 · 编辑于 2024-09-29 21:57:47

像Python这样的好编程语言不需要正则表达式：

  hashed = [ word for word in line.split() if word.startswith("#") ]

网友

2楼 · 编辑于 2024-09-29 21:57:47

你可以用

compiled = re.compile(r'#\w*')
compiled.findall(line)

输出：

^{pr2}$

但是有一个问题。如果搜索类似'blahblahblah #Syrup #nshit #thebluntislit beg#end'的字符串，输出将是['#Syrup', '#nshit', '#thebluntislit', '#end']。在

这个问题可以通过使用正向回溯来解决：

compiled = re.compile(r'(?<=\s)#\w*')

（在这里不可能使用\b（单词边界），因为#不在\w符号[0-9a-zA-Z_]之间，这可能构成正在搜索的边界的单词）。在

网友

3楼 · 编辑于 2024-09-29 21:57:47

似乎re.findall()将执行您想要的操作。在

matches = re.findall(r'#\w*', line)