如何在python中处理来自多行的信息？

\>WB02 \t F27C8.1 IV \t B-9641 \>WB03 \t F07C3.7 \>WB04 \t F52H2.2 \>WB04 \t F52H2.2 \>WB05 \t T13A10.10 IV \t B-15643 IV \t B-11650 IV \t B-13649

3条回答

网友

1楼 · 编辑于 2024-10-01 13:34:10

跟踪以'>'开头的最后一行就足够了。您可以根据脚本对无效输入的鲁棒性进行调整：

#!/usr/bin/env python
import fileinput

last = None
for line in fileinput.input():
    mark, sep, value = line.partition('\t')
    if not sep: continue # skip lines without a tab
    if mark.startswith('>WB'):
       last = value.strip()
    elif mark.strip() == 'IV':
       print('%s\t%s' % (last, value.strip()))

用法

^{pr2}$

Output

F27C8.1 B-9641
T13A10.10   B-15643
T13A10.10   B-11650
T13A10.10   B-13649

网友

2楼 · 编辑于 2024-10-01 13:34:10

当然，可以通过增量读取文件来完成此操作。您只需要保留一个变量来保存您看到的最后一行的值。所以，像这样：

with open("input.txt") as f:
    lastmarkedline = None
    for line in f:
        if line.startswith('>'):
            lastmarkedline = line
        elif lastmarkedline is not None:
            field1 = lastmarkedline.split()[1]
            field2 = line.split()[1]
            print "{0}\t{1}".format(field1, field2)

网友

3楼 · 编辑于 2024-10-01 13:34:10

您可以逐行处理文件，检查每行是否以“>；”开头。当遇到以“>；”开头的行时，请捕获第二列中的值。对于不以“>；”开头的行，可以输出上次捕获的值以及关联的子值。在

with open('data.txt', 'r') as f:
    lastHeader = ''
    for line in f:
        pieces = line.split('\t')
        if line[0] == '>':
            lastHeader = pieces[1].strip()
        else:
            print "%s  \t  %s" % (lastHeader, pieces[1].strip())

用法

Output

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何在python中处理来自多行的信息？

用法

Output

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >