如何比较python中的两个HTML文件并只打印它们之间的差异？

import difflib file1 = open('sonarlint-report.html', 'r').readlines() file2 = open('sonarlint-report_latest.html', 'r').readlines() htmlDiffer = difflib.HtmlDiff() htmldiffs = htmlDiffer.make_file(file1, file2) with open('comparison.html', 'w') as outfile: outfile.write(htmldiffs)

2条回答

网友

1楼 · 编辑于 2024-10-08 20:20:15

如果使用difflib.Differ，则只能保留差异行，并使用每行上写入的两个字母代码进行过滤。从docs：

class difflib.Differ
This is a class for comparing sequences of lines of text, and producing human-readable differences or deltas. Differ uses SequenceMatcher both to compare sequences of lines, and to compare sequences of characters within similar (near-matching) lines.
Each line of a Differ delta begins with a two-letter code:
Code Meaning
'- ' line unique to sequence 1
'+ ' line unique to sequence 2
' ' line common to both sequences
'? ' line not present in either inputsequence
Lines beginning with ‘?’ attempt to guide the eye to intraline differences, and were not present in either input sequence. These lines can be confusing if the sequences contain tab characters

通过保持这些行以“-”和“+”开头，只是区别。在

网友

2楼 · 编辑于 2024-10-08 20:20:15

我将首先尝试逐行遍历每个html文件，并检查这些行是否相同。在

with open('file1.html') as file1, open('file2.html') as file2:
    for file1Line, file2Line in zip(file1, file2):
        if file1Line != file2Line:
            print(file1Line.strip('\n'))
            print(file2Line.strip('\n'))

您将不得不在一行中处理换行符和多行差异，但这可能是一个好的开始：）

相关问题更多 >

编程相关推荐

热门问题

热门文章

如何比较python中的两个HTML文件并只打印它们之间的差异？

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >