正则表达式模式，用于在某个子字符串后查找x长度的n个非空格字符

2条回答

网友

1楼 · 编辑于 2024-10-04 05:29:59

那么：

# removes all white spaces with replace()

x = 'CIG7826328A2B FORNITURA ENERGIA ELETTRICA U'.replace(' ', '')
x = x.split("CIG")[1][:10] 
# x = '7826328A2B'

x = '/BENEF/FORNITURA GAS FEB-20 CIG Z9F 27D2198 01762-0000031'.replace(' ', '')
x.split("CIG")[1][:10]
# x = '7826328A2B'

如果字符串中只有一个“烟”，则可以正常工作

网友

2楼 · 编辑于 2024-10-04 05:29:59

你可以用

r'(?i)cig[\s:.]*(\S(?:\s*\S){9})(?!\S)'

见regex demo详细信息：

cig-一个cig字符串
[\s:.]*-零个或多个空格，:或.
(\S(?:\s*\S){9})-组1：一个非空白字符，然后出现九个零或多个空白字符，后跟一个非空白字符
(?!\S)-右边必须有空格或字符串结尾

在Python中，可以使用

import re
text = "/BENEF/FORNITURA GAS FEB-20 CIG Z9F               27D2198 01762-0000031"
pattern = r'cig[\s:.]*(\S(?:\s*\S){9})(?!\S)'
matches = re.finditer(pattern, text, re.I)
for match in matches:
  print(re.sub(r'\s+', '', match.group(1)), ' found at ', match.span(1))

# => Z9F27D2198  found at  (32, 57)

见Python demo

相关问题更多 >

编程相关推荐

热门问题

热门文章

正则表达式模式，用于在某个子字符串后查找x长度的n个非空格字符

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >