消极的前瞻性Regex贪婪（为什么是*？太贪婪了）

Python实现

import re def test_re(arg, INSULTSTR): mm = re.search(r''' (?: # No grouping (?![A-Z].*?%s.*?\.)) # Negative zero-width # assertion: arg, followed by a period ([A-Z].*?\.) # Match a capital letter followed by a period ''' % arg, INSULTSTR, re.VERBOSE) if mm is not None: print "neg-lookahead(%s) MATCHED: '%s'" % (arg, mm.group(1)) else: print "Unable to match: neg-lookahead(%s) in '%s'" % (arg, INSULTSTR) INSULT = 'Yomama is ugly. And, she smells like a wet dog.' test_re('ugly', INSULT) test_re('looks', INSULT) test_re('smells', INSULT)

neg-lookahead(ugly) MATCHED: 'And, she smells like a wet dog.' neg-lookahead(looks) MATCHED: 'Yomama is ugly.' Unable to match: neg-lookahead(smells) in 'Yomama is ugly. And, she smells like a wet dog.'

3条回答

网友

1楼 · 编辑于 2024-09-28 05:23:37

您的问题是regex引擎将尽可能努力匹配(?![A-Z].*?$arg.*?\.)，因此对于“气味”大小写，它最终匹配整个字符串。（中间的句点被包含在.*?构造之一中）您应该限制负的lookahead大小写以尽可能多地匹配另一个case：

而不是：

(?:(?![A-Z].*?$arg.*?\.))([A-Z].*?\.)

使用：

^{pr2}$

现在，负lookahead不能比其他部分匹配更多的字符串，因为它必须在第一个句点处停止。在

网友

2楼 · 编辑于 2024-09-28 05:23:37

如果您想知道Perl在regex中做什么，可以使用regex调试器运行：

perl -Dr -e '"A two. A one." =~ /(?![A-Z][^\.]*(?:two)[^\.]*\.)([A-Z][^\.]+\.)/; print ">$1<\n"'

你要思考的产出。您需要一个用-DDEBUGGING构建的Perl。在

网友

3楼 · 编辑于 2024-09-28 05:23:37

#!/usr/bin/perl

sub test_re {
    $arg    = $_[0];
    $INSULTSTR = $_[1];
    $INSULTSTR =~ /(?:^|\.\s*)(?:(?![^.]*?$arg[^.]*\.))([^.]*\.)/;
    if ($1) {
        print "neg-lookahead($arg) MATCHED: '$1'\n";
    } else {
        print "Unable to match: neg-lookahead($arg) in '$INSULTSTR'\n";
    }
}

$INSULT = 'Yomama is ugly.  And, she smells like an wet dog.';
test_re('Yomama', $INSULT);
test_re('ugly', $INSULT);
test_re('looks', $INSULT);
test_re('And', $INSULT);
test_re('And,', $INSULT);
test_re('smells', $INSULT);
test_re('dog', $INSULT);

结果：

^{pr2}$

问题

编辑

Python实现

Perl实现

输出

相关问题更多 >

编程相关推荐

热门问题

热门文章