从Perl到Regex的转换

00000001;Root;; 00000002; Documents;; 00000003; oracle-advanced_plsql.zip;file; 00000004; Public;; 00000005; backup;; 00000006; 20110323-JM-F.7z.001;file; 00000007; 20110426-JM-F.7z.001;file; 00000008; 20110603-JM-F.7z.001;file; 00000009; 20110701-JM-F-via-summer_school;; 00000010; 20110701-JM-F-yyy.7z.001;file;

#filename: perl_regex.pl #/usr/bin/perl -w while(<>) { s/^(.*?;.*?)(\w)/$1;$2/; print $_; }

s/ # start search-and-replace regexp ^ # start at the beginning of this line ( # save the matched characters until ')' in $1 .*?; # go forward until finding the first semicolon .*? # go forward until finding... (to be continued below) ) ( # save the matched characters until ')' in $2 \w # ... the next alphanumeric character. ) / # continue with the replace part $1;$2 # write all characters found above, but insert a ; before $2 / # finish the search-and-replace regexp.

2条回答

网友

1楼 · 编辑于 2024-09-27 17:34:37

Python正则表达式与Perl的非常相似，除了：

Python中没有正则表达式。它应该用字符串表示。我在下面的代码中使用了r'raw string literal'。在
反向引用表示为\1，\2。。或者\g<1>，\g<2>。。在
。。。在

使用^{}替换。在

import re
import sys

for line in sys.stdin: # Explicitly iterate standard input line by line
    # `line` contains trailing newline!
    line = re.sub(r'^(.*?;.*?)(\w)', r'\1;\2', line)
    #print(line) # This print trailing newline
    sys.stdout.write(line) # Print the replaced string back.

网友

2楼 · 编辑于 2024-09-27 17:34:37

python正则表达式中s/pattern/replace/的replace指令是re.sub公司（pattern，replace，string）函数，或重新编译（模式）.sub（替换，字符串）。在您的情况下，您将这样做：

_re_pattern = re.compile(r"^(.*?;.*?)(\w)")
result = _re_pattern.sub(r"\1;\2", line)

注意，$1变成了{}。对于perl，您需要按照您想要的方式迭代行（open、inputfile、splitlines…）。在

相关问题更多 >

编程相关推荐

热门问题

热门文章