找不出正则表达式与lis的匹配

&nbsp 1 Clemson A = &nbsp 5 Ohio State A = &nbsp155 Tennessee-Martin AA = &nbsp152 Louisiana-Monroe A = &nbsp104 Hawai'i A = &nbsp193 VMI AA = &nbsp202 Stephen F. Austin AA =

3条回答

网友

1楼 · 编辑于 2024-10-03 09:09:00

这相对容易：

import re

raw = """
&nbsp  1  Clemson              A  =
&nbsp  5  Ohio State           A  =
&nbsp155  Tennessee-Martin     AA =
&nbsp152  Louisiana-Monroe     A  =
&nbsp104  Hawai'i              A  =
&nbsp193  VMI                  AA =
&nbsp202  Stephen F. Austin    AA =
"""

teams = re.findall(r"&nbsp\s*\d+\s+(.*?)\s+A+\s+=", raw)

for team in teams:
    print(team)

# Clemson
# Ohio State
# Tennessee-Martin
# Louisiana-Monroe
# Hawai'i
# VMI
# Stephen F. Austin

网友

2楼 · 编辑于 2024-10-03 09:09:00

尝试使用以下正则表达式：

\d\s+(.*?)\s+=

    - \d match digit
    - \s+ followed by one or more space
    - (.*) anything
    - \s+ followed by one or more spaces
    - = followed by  `=`

被抓获的小组会给你一个小组的名字

Regex Demo

编辑如果A/AA不是团队名称的一部分，请执行以下操作：

\d\s+(.*?)\s+[A]+\s+=

Updated Regex

网友

3楼 · 编辑于 2024-10-03 09:09:00

像这样的怎么样？不需要正则表达式

lines是字符串列表，其中每个字符串都是数据中的一行

for line in lines:
    splits = line.split(" ")
    teamName = splits[1]
    if hasNumbers(teamName):
        teamName = splits[2]

    print(teamName)


def hasNumbers(inputString):
    return any(char.isdigit() for char in inputString)

相关问题更多 >

编程相关推荐

热门问题

热门文章

找不出正则表达式与lis的匹配

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >