嵌套文件读取不会在所有主循环中循环

with open(db_file, 'r') as db, open(sortIn, 'r') as inF, open(out_file, 'w') as outF: for line in inF: for dbline in db: if len(dbline) > 1: dbline = dbline.split('\t') if line.rstrip('\n') == dbline[db_specifications[0]]: outF.write(dbline[db_specifications[0]] + ': ' + dbline[db_specifications[1]] + '\n') break

2条回答

网友

1楼 · 编辑于 2024-10-02 08:20:28

我可以通过插入以下行来解决问题：
line = next(inF) 在break语句之前

网友

2楼 · 编辑于 2024-10-02 08:20:28

由于db_文件中的数据按第二列排序，因此可以使用此代码读取该文件

with open("xyz.txt", "r") as db_file, open("abc.txt", "r") as sortIn, open("out.txt", 'w') as outF:

    #first read the sortIn file as a list
    i_list = [line.strip() for line in sortIn.readlines()]

    #for each record read from the file, split the values into key and value
    for line in db_file:
        t_key,t_val = line.strip().split(' ')

        #if value is in i_list file, then write to output file
        if t_val in i_list: outF.write(t_val + ': ' + t_key + '\n')

        #if value has reached the max value in sort list
        #then you don't need to read the db_file anymore
        if t_val == i_list[-1]: break

输出文件将包含以下项：

NG_002527: RPL26P4
NG_006062: VN2R13P

在上面的代码中，我们必须先阅读排序列表。然后读取db_文件中的每一行。i_list[-1]将具有sortIn file的最大值，因为sortIn文件也按升序排序

与下面的代码相比，上面的代码具有更少的i/o

=========== 以前提交的答复：

根据数据在db_文件中的存储方式，我们似乎必须读取整个文件，以对照sortIn文件进行检查。如果db_文件中的值按第二列排序，那么一旦找到sortIn中的最后一项，我们就可以停止读取该文件

假设我们需要从文件中读取所有记录，看看下面的代码是否适合您

with open("xyz.txt", "r") as db_file, open("abc.txt", "r") as sortIn, open("out.txt", 'w') as outF:

    #read the db_file and convert it into a dictionary
    d_list = dict([line.strip().split(' ') for line in db_file.readlines()])

    #read the sortIn file as a list
    i_list = [line.strip() for line in sortIn.readlines()]

    #check if the value of each value in d_list is one of the items in i_list
    out_list = [v + ': '+ k for k,v in d_list.items() if v in i_list]

    #out_list is your final list that needs to be written into a file
    #now read out_list and write each item into the file
    for i in out_list:
        outF.write(i + '\n')

输出文件将包含以下项：

NG_002527: RPL26P4
NG_006062: VN2R13P

为了帮助您，我还打印了d_列表、i_列表和out_列表中的内容

d_列表中的内容如下所示：

{'RPL24P3': 'NG_002525', 'RPLP1P1': 'NG_002526', 'RPL26P4': 'NG_002527', 'VN2R11P': 'NG_006060', 'VN2R12P': 'NG_006061', 'VN2R13P': 'NG_006062', 'VN2R14P': 'NG_006063'}

i_列表中的内容如下所示：

['NG_002527', 'NG_006062']

从out_列表写入out文件的内容如下所示：

['NG_002527: RPL26P4', 'NG_006062: VN2R13P']

相关问题更多 >

编程相关推荐

热门问题

热门文章