擅长:python、mysql、java
<p>可以使用<a href="https://stackoverflow.com/a/2087433/1258041">here</a>显示的方法逐行取消对文件中HTML转义序列的转义。在</p>
<pre><code>import html.parser
h = html.parser.HTMLParser()
with urllib.request.urlopen(link) as fin, open(
"file.html", 'w', encoding='utf-8') as fout:
for line in fin:
fout.write(h.unescape(line.decode('utf-8')))
</code></pre>