用Python替换JSON文件中的字符。由于文件太大（超过1 GB），编辑出现问题

2条回答

网友

1楼 · 编辑于 2024-10-16 20:42:14

为了能够处理JSON对象中字符串中可能的$字符，您可以将输入字符串data1与$拆分为片段，将片段逐个连接到一个字符串中，直到它可以解析为JSON为止，此时输出该字符串并清除它以转到下一个片段：

import json
candidate = ''
for fragment in data1.split('$'):
    candidate += fragment
    try:
        json.loads(candidate)
        print(candidate)
        candidate = ''
    except json.decoder.JSONDecodeError:
        candidate += '$'
        continue

例如，给定data1 = '''{}${"a":"$"}${"b":{"c":2}}'''，这将输出：

{}
{"a":"$"}
{"b":{"c":2}}

网友

2楼 · 编辑于 2024-10-16 20:42:14

问题可能出在a.readlines()，因为它会将整个文件带到内存中。在处理大型文件时，逐行阅读会更有趣，如下所示：

with open(fname) as f: 
    for line in f:
        # Do your magic here, on this loop
# No need to close it, since the `with` will take care of that.

如果您的目标是用\n替换每个$，则如下所示：

with open(fname, "r+") as f: 
    for line in f:
        line.replace("$", "\n")

相关问题更多 >

编程相关推荐

热门问题

热门文章

用Python替换JSON文件中的字符。由于文件太大（超过1 GB），编辑出现问题

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >