使用python处理文本文件作为一个大字符串，并迭代每个{}在文件中

hermione [ { keyword.adsf keyword.bsdf }, { keyword.1sdf keyword.bsd keyword.bsd } ] ron [ { keyword.adsf keyword.bsdf }, { keyword.1sdf keyword.bsd keyword.bsd } ]

2条回答

网友

1楼 · 编辑于 2024-05-21 18:58:56

下面是一种使用纯Python的方法。如果您需要更复杂的处理而不是仅仅计算事物，这可能很方便。你知道吗

import sys

def prn(s):
    sys.stdout.write(str(s))

def _parse_list(f, line):
    if line.strip() != '{':
        raise ValueError("list part must start with '{'")
    prn(line)

    count = 0
    found_list_end = False
    for line in f:
        if line.strip().startswith('}'):
            found_list_end = True
            break
        count += 1
    if not found_list_end:
        raise ValueError("list part must end with '}'")
    prn("    {}\n".format(count))
    prn(line)


def parse_section(f):
    found_section_start = False
    for line in f:
        prn(line)
        words = line.split()
        if len(words) == 2 and words[1] == '[':
            found_section_start = True
            break
    if not found_section_start:
        return False  

    for line in f:
        if line.strip() == ']':
            prn(line)
            return True
        _parse_list(f, line)
    return True

with open("data.txt", "rt") as f:
    while parse_section(f):
        pass

网友

2楼 · 编辑于 2024-05-21 18:58:56

使用正则表达式，可以执行以下操作：

import re

contents_of_each = re.findall('{([^}]+?)}', txt, re.DOTALL)

这将为您提供一个包含{}之间每个字符串的列表

工作原理：它先搜索一个开放卷曲，然后搜索一个或多个非封闭卷曲的字符序列，然后搜索封闭卷曲，但只返回括号内的内容。你知道吗

re.DOTALL将换行符视为常规字符，匹配跨多行的卷曲对。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章

使用python处理文本文件作为一个大字符串，并迭代每个{}在文件中

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >