如何在python中解析文件并写入输出文件

EGW05759 Pld5 I79_005987 GO_function: GO:0003824 - catalytic activity [Evidence IEA]; GO_process: GO:0008152 - metabolic process [Evidence IEA] EGW05760 Exo1 I79_005988 GO_function: GO:0003677 - DNA binding [Evidence IEA]; GO_function: GO:0003824 - catalytic activity [Evidence IEA]; GO_function: GO:0004518 - nuclease activity [Evidence IEA]; GO_process: GO:0006281 - DNA repair [Evidence IEA]

2条回答

网友

1楼 · 编辑于 2024-10-02 18:21:08

使用csv模块：

import csv, re

with open('test_parsing.txt', 'rU') as infile, open('test_parsing_out.txt', 'a') as outfile:
    reader = csv.reader(infile, delimiter="\t")
    for line in reader:
        result = line[1] + " " + ':'.join(re.findall("GO:\d{6}", line[3]))
        outfile.write(result + "\n")

# OUTPUT
Pld5 GO:000382:GO:000815
Exo1 GO:000367:GO:000382:GO:000451:GO:000628

网友

2楼 · 编辑于 2024-10-02 18:21:08

f = open('test_parsing.txt', 'rU')
f1 = open('test_parsing_out.txt', 'a')
for line in f:
    match = re.search('\w+\s+(\w+)\s+\w+\s+\w+\:', line)
    match1 = re.findall('GO:\d+', line)
    f1.write('%s %s \n'%(match.group(1), ''.join(match1)))
f1.close()

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何在python中解析文件并写入输出文件

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >