缩短Python鳕鱼

2024-09-28 21:57:35 发布

您现在位置:Python中文网/ 问答频道 /正文

我觉得这段Python代码可以大大缩短,但我几乎总是倾向于回到编写C风格的布局。你认为缩短时间的最好方法是什么?可读性是一种奖励,而不是要求。在

def compfiles(file1, file2):
    linecnt = 0
    for line1 in open(file1):
        line1 = line1.strip()
        hit = False
        for line2 in open(file2):
            line2 = line2.strip()
            if line2 == line1:
                hit = True
                break
        if not hit:
            print("Miss: file %s contains '%s', but file %s does not!" % (file1, line1, file2))
        linecnt += 1
    print("%i lines compared between %s and %s." % (linecnt, file1, file2))

fn = ["file1.txt", "file2.txt"]
compfiles(fn[0], fn[1])
compfiles(fn[1], fn[0])

Tags: inforifnotopenfile1file2fn
2条回答
def compfiles(file1, file2):
    with open(file1) as fin:
        set1 = set(fin)
    with open(file2) as fin:
        set2 = set(fin)
    ... # do some set operations

如果文件的顺序是重复的,则迭代1

^{pr2}$

您的代码效率极低,因为您open循环中的第二个文件迭代第一个文件。只需将第二个文件读入一个列表(或者更好的是,一个set,它提供了平均O(1)查找时间)并使用in运算符。另外,您的linecnt变量只计算file1中的行数-您可以将这些行读入一个列表中,然后调用该列表上的len来获得相同的行数:

def compfiles(file1, file2):
    lines1 = [l.strip() for l in open(file1).read().split("\n")]
    lines2 = set([l.strip() for l in open(file2).read().split("\n")])
    for line in lines1:
        if not line in lines2:
            print("Miss: file %s contains '%s', but file %s does not!" % (file1, line, file2))
    print("%i lines compared between %s and %s." % (len(lines1), file1, file2))

相关问题 更多 >