在Python中遍历.txt文件并分离为字符串

========== ID: 10001 Found:(4) ========== MSG: ERR_ID - ***ERROR*** _errortexthere_ ========== ID: 10002 Found:(26) ========== MSG: ERR_ID - ***ERROR*** _errortexthere_ line2 line3 line4 line5 ========== ID: 10003 Found:(15039) ========== MSG: ERR_ID - ***ERROR*** _errortexthere_ etc1 etc2 etc3

2条回答

网友

1楼 · 编辑于 2024-09-30 01:21:36

您应该逐行阅读并检查行中的第一个元素是否是ID

f = open('workfile', 'r')
for line in f:
    arr = line.split(" ")
    if(arr[0] == "ID:"):
       # do what you need too

网友

2楼 · 编辑于 2024-09-30 01:21:36

下面是一个非常简单的实现，它只检测以精确字符串“ID:”开头的行。它忽略空行和完全匹配==========的行。你知道吗

它将每个ID:后面的行保存到字典中，字典的键是ID字符串。你知道吗

from io import BytesIO
from pprint import pprint

infile = BytesIO("""
==========
ID: 10001      Found:(4)
==========
MSG: ERR_ID  - ***ERROR*** _errortexthere_

==========
ID: 10002      Found:(26)
==========
MSG: ERR_ID  - ***ERROR*** _errortexthere_
line2
line3
line4
line5
""")


buffer = ""
d = {}
id = None

for line in infile:
    if line.rstrip() in ("==========",""):
        # skip blank lines or delimiting lines
        pass
    elif line.startswith("ID: "):
        # save the buffer we've been collecting to the dictionary...
        if id is not None:        
            d[id] = buffer

        # ... and start collecting new lines
        id = line.split()[1]
        buffer = ""
    else:
        buffer += line
else:
    # save whatever lines are leftover after the last `ID:`
    if id is not None:
        d[id] = buffer

pprint(d)

输出：

{'10001': 'MSG: ERR_ID  - ***ERROR*** _errortexthere_\n',
 '10002': 'MSG: ERR_ID  - ***ERROR*** _errortexthere_\nline2\nline3\nline4\nline5\n'}

相关问题更多 >

编程相关推荐

热门问题

热门文章