改进regex搜索模式

''' MFTF2LH_LSetC1_D-10_hot50_fa00_bpmax MFTF2LH_LSetC1_D-11_hot50_fa00_bpmax MFTF2LH_LSetC1_D-01_hot56_fa00_bpmax MFTF2LH_LSetC1_D-02_hot56_fa00_bpmax MFTF2LH_LSetC1_D-03_hot56_fa00_bpmax MFTF2LH_LSetC1_D-04_hot50_fa00_bpmax MFTF2LH_LSetC1_D-07_hot43_fa00_bpmax MFTF2LH_LSetC1_D-10_hot56_fa00_bpmax '''

2条回答

网友
1楼 · 编辑于 2024-09-30 04:39:40

我将grep与-v（还原匹配项）一起使用：
grep -Ev "D-[0][1-7]_hot(?:43|50)|D-(?:08|09|10|11)_hot56" raw.txt > filtered.txt
它完全匹配您不需要的内容，然后还原匹配项

网友
2楼 · 编辑于 2024-09-30 04:39:40

您可以通过使备选方案仅在字符串中的不同位置匹配来改进您的模式
使用
rx = re.compile(r'_D-(?:1[01]_hot56|0(?:[89]_hot56|[1-7]_hot(?:43|50)))') # .... Read the file line by line ... if not rx.search(line): # Ok, process
参见regex demo
图案细节：
_D--文字子串
(?:-非捕获组的开始（与捕获组不同，没有为组创建内存缓冲区）匹配：
1[01]_hot56-1，然后0或1，然后_hot56
|-或
0-a0字符，然后
(?:-第二个非捕获组
[89]_hot56-8或9然后_hot56
|或
[1-7]_hot(?:43|50)—从1到7的数字，然后是_hot，然后是43或50
)-第二个非捕获组的结尾
)-第一个非捕获组的结尾

相关问题更多 >

编程相关推荐

热门问题

热门文章