如何使我的正则表达式匹配在前瞻后停止？问题的回答

如何使我的正则表达式匹配在前瞻后停止？

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

我有一些pdf文件中的文本，我想把它分成一个字符串，这样我就有了一个列表，其中每个字符串都以一个数字和一个句点开头，然后在下一个数字之前停止 例如，我想将此转换为： <pre><code>'3.1 First liens 15,209,670,396 0 15,209,670,396 14,216,703,858 3.2 Other than first liens 0 0 4. Real estate: 4.1 Properties occupied by the company (less $ 43,332,898 encumbrances) 68,122,291 0 68,122,291 64,237,046 4.2 Properties held for the production of income (less $ encumbrances) 0 0 4.3 Properties held for sale (less $ encumbrances) 0 0 5. Cash ($ (101,130,138)), cash equivalents ($ 850,185,973 ) and short-term investments ($ 0 ) 749,055,835 0 749,055,835 1,867,997,055 6. Contract loans (including $ premium notes) 253,533,676 0 253,533,676 233,680,271 7. Derivatives 3,194,189,871 0 3,194,189,871 2,390,781,023 8. Other invested assets 749,074,191 11,899,360 737,174,831 692,916,503' </code></pre> 为此： <pre><code>['3.1 First liens 15,209,670,396 0 15,209,670,396 14,216,703,858 ', '3.2 Other than first liens 0 0 ', '4. Real estate:', '4.1 Properties occupied by the company (less $ 43,332,898 encumbrances) 68,122,291 0 68,122,291 64,237,046', '4.2 Properties held for the production of income (less $ encumbrances) 0 0' '4.3 Properties held for sale (less $ encumbrances) 0 0', '5. Cash ($ (101,130,138)), cash equivalents ($ 850,185,973 ) and short-term investments ($ 0 ) 749,055,835 0 749,055,835 1,867,997,055', '6. Contract loans (including $ premium notes) 253,533,676 0 253,533,676 233,680,271', '7. Derivatives 3,194,189,871 0 3,194,189,871 2,390,781,023', '8. Other invested assets 749,074,191 11,899,360 737,174,831 692,916,503'] </code></pre> 问题是原始字符串在名称的中间散布“\n”（例如，在4.1个单词中，在单词后缀之前有一个\n）。 <pre><code>(\d+\.[\s\S]*(?!\d+\.)) </code></pre> 这是我一直尝试使用的正则表达式，但它匹配整个字符串而不是每个数字行。我的正则表达式有没有办法在下一个数字行之前停止匹配

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

如何使我的正则表达式匹配在前瞻后停止？

1 个回答

相关Python问题