正则表达式，用于在一个词之后和一个特殊字符之前提取文本，并排除所有其他数字

Section 2.1. 1.1.14. Minimum Rent Schedule (subiect to adjustment, if applicable):less than or greater than twelve (12) full calendar months (and such proration or adjustment being based upon the actual number of days in such Lease Year)

2条回答

网友

1楼 · 编辑于 2024-09-22 20:33:25

这是一种模式。你知道吗

例如：

import re

s = "Section 2.1. 1.1.14. Minimum Rent Schedule (subiect to adjustment, if applicable):less than or greater than twelve (12) full calendar months (and such proration or adjustment being based upon the actual number of days in such Lease Year)"
print(re.match(r"Section[\d.\s]+(.*?):", s).group(1))

输出：

Minimum Rent Schedule (subiect to adjustment, if applicable)

如果有多个元素，请使用re.findall

例如：

print(re.findall(r"Section[\d.\s]+(.*?):", your_text))

网友

2楼 · 编辑于 2024-09-22 20:33:25

您尝试的模式使用character class，它将匹配列出的任何字符1+次。你知道吗

要不匹配任何在Section之后包含数字的字符，可以重复0多次匹配空格，后跟至少包含一个数字的非空格字符。你知道吗

捕获组中不包含数字的内容。你知道吗

Section (?:[^\s\d]*\d\S* )*([^:]+):

解释

Section 匹配节和空格
(?:非捕获组
- [^\s\d]*使用negated character class匹配除空白字符和数字0+以外的任何字符
- \d\S* 然后匹配一个数字，后跟匹配0+乘以一个非空白字符
)*关闭组并重复0+次
([^:]+):在组1中捕获匹配1+倍除:之外的任何字符，然后匹配:

Regex demo

例如

import re

regex = r"Section (?:[^\s\d]*\d\S* )*([^:]+):"
s = "Section 2.1. 1.1.14. Minimum Rent Schedule (subiect to adjustment, if applicable):less than or greater than twelve (12) full calendar months (and such proration or adjustment being based upon the actual number of days in such Lease Year)"
print(re.match(regex, s).group(1))

结果

Minimum Rent Schedule (subiect to adjustment, if applicable)

要找到多个，可以使用关于芬德尔地址：

print(re.findall(regex, s))

Demo using re.findall

相关问题更多 >

编程相关推荐

热门问题

热门文章