正则表达式以获取Python中多个换行符之间的文本

2条回答

网友

1楼 · 编辑于 2024-10-03 21:29:24

鉴于：

txt='''\
\n\nMy take on fruits.\n\nHealthy Fruits\nAn apple is a fruit and it\'s very good.\n\nPears are good as well. Bananas are very good too and healthy.\n\nSour Fruits\nOranges are on the sour side and contains a lot of vitamin C.\n\nGrapefruits are even more sour, if you can believe it.'''

desired=[('Healthy Fruits',   "An apple is a fruit and it's very good.", 'Pears are good as well. Bananas are very good too and healthy.'),  ('Sour Fruits',   'Oranges are on the sour side and contains a lot of vitamin C.', 'Grapefruits are even more sour, if you can believe it.')]

您可以使用正则表达式：

r'\n\n([\s\S]*?)(?=(?:\n\n.*\n[^\n])|\Z)'

Demo

Python演示：

>>> sp=[tuple(re.split('\n+',l)) for l in re.findall(r'\n\n([\s\S]*?)(?=(?:\n\n.*\n[^\n])|\Z)',txt) if '\n' in l]

>>> sp
[('Healthy Fruits', "An apple is a fruit and it's very good.", 'Pears are good as well. Bananas are very good too and healthy.'), ('Sour Fruits', 'Oranges are on the sour side and contains a lot of vitamin C.', 'Grapefruits are even more sour, if you can believe it.')]

>>> sp==desired
True

网友

2楼 · 编辑于 2024-10-03 21:29:24

这不是正则表达式，但它可以工作：

text="\n\nMy take on fruits.\n\nHealthy Fruits\nAn apple is a fruit and it\'s very good. Bananas are very good too and healthy.\n\nSour Fruits\nOranges are on the sour side and contains a lot of vitamin C.\n\nGrapefruits are even more sour, if you can believe it."
    NewList=[]
    Newtext=text.split("\n\n")
    for line in Newtext:
        if line.find("\n")>=0:
            NewList.extend(line.split('\n'))
    
    NewList[len(NewList)-1]=str(NewList[len(NewList)-1])+str(Newtext[len(Newtext)-1])

相关问题更多 >

编程相关推荐

热门问题

热门文章

正则表达式以获取Python中多个换行符之间的文本

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >