如何使用python提取提到的内容？

>>>extract_mentions('@AndreaTantaros- You are a true journalistic\ professional. I so agree with what you say. Keep up the great\ work!@RepJohnLewis ') ['AndreaTantaros','RepJohnLewis'] >>>extract_mentions('@CPAC For all the closet #libertarians attending \ #CPAC2016 , I'll be there Thurs/Fri -- speaking Thurs. a.m. on the main\ stage. Look me up! @CPAC') ['CPAC','CPAC']

2条回答

网友

1楼 · 编辑于 2024-09-30 08:38:04

使用regex：

import re
input_string = '@AndreaTantaros- You are a true journalistic professional. I so agree with what you say. Keep up the great work!@RepJohnLewis '
result = re.findall("@([a-zA-Z0-9]{1,15})", input_string)

输出：['AndreaTantaros', 'RepJohnLewis']

如果要先删除电子邮件地址，只需执行以下操作：

^{pr2}$

网友

2楼 · 编辑于 2024-09-30 08:38:04

您可以使用以下正则表达式，因为它忽略电子邮件地址。在

(^|[^@\w])@(\w{1,15})

示例代码

^{pr2}$

这将返回：

[('', 'RayFranco'), (' ', 'jjconti'), ("'", 'username83'), (' ', 'probablyfaketwi')]

注意，twitter允许最多15个字符作为twitter用户名。基于Twitter specs：

Your username cannot be longer than 15 characters. Your real name can be longer (20 characters), but usernames are kept shorter for the sake of ease. A username can only contain alphanumeric characters (letters A-Z, numbers 0-9) with the exception of underscores, as noted above. Check to make sure your desired username doesn't contain any symbols, dashes, or spaces.

相关问题更多 >

编程相关推荐

热门问题

热门文章