对检索到的文件中的数据进行迭代以提取各种值

EXAMPLEFILE.TXT: 1. [**] random text group A [**] 2. random number of lines of text 3. findMe 4. stufff...100-012 qwerty...499-217 5. [**] random text group B [**] 6. random lines of text 7. findMe 8. [**] random text group C [**] 9. random number of lines of text 10. findMe 11. stufff...223-300 qwerty...888-888 zzzz...333-444 12. [**] continues......

import re def patternMatching(group, line): section = re.findall(group, line) for i in section: randText = re.search('\]\s(.*?)\[', i) result1 = randText.group(1) print result1 findMe = re.search('findMe', line) result2 = findMe.group() print result2 numbers = re.findall('(\d{3}\-\d{3})',line) print numbers randomTextgroup = re.compile(r'\*{2}\].*\[\*{2}\].*\[\*{2}\]', re.DOTALL|re.S) with open ("C:/Location/test.txt", 'r') as txt: data=txt.read().replace('\n','\r') a = randomTextgroup.findall(data) for i in a: patternMatching(randomTextgroup, i)

1条回答

网友

1楼 · 发布于 2024-10-04 07:31:58

我终于有了！：）感谢@AdamSmith和@sln的帮助和建议。事实上，主要因素是regex（@randomTextgroup）很贪婪。然后patternMatching（）中的额外for循环导致没有数据…一个for循环太多。不管怎样，谢谢你：）

randomTextgroup = re.compile(r'\*{2}\].*?\[\*{2}\].*?(?=\[\*{2}\])')


with open ("C:Location/test.txt", 'r') as txt:


    data=txt.read().replace('\n','\r')


    section = randomTextgroup.findall(data)


    for i in section:

        randText = re.search('\]\s(.*?)\[', i)
        test = randText.group(1)
        print test

        findMe = re.search('findMe', i)
        result2 = findMe.group()
        print result2

        numbers = re.findall('(\d{3}\-\d{3})', i)
        print numbers, '\n'

结果如下：

random text-A 
findMe
['100-012', '499-217'] 

random text-B 
findMe
[] 

random text-C 
findMe
['223-300', '888-888', '333-444']

注意：对于其他读取器，输入文件保持不变（如原文章所述）

相关问题更多 >

编程相关推荐

热门问题

热门文章