<p>假设html是python字符串(在代码中<code>html_string</code>是<code>str(htmlresponse.text)</code>):</p>
<pre class="lang-py prettyprint-override"><code>html_string = '''<html>
<head>
<! <link rel="icon" href="/sort/this/later.jpg" type="image/x-icon" /> >
<title>foo</title>
<meta name="description" content="bar" />
'''
</code></pre>
<p>按换行符<code>html_string.split('\n')</code>拆分它将输出:</p>
<pre class="lang-py prettyprint-override"><code>['<html>',
'',
' <head>',
'',
' <! <link rel="icon" href="/sort/this/later.jpg" type="image/x-icon" /> >',
'',
' <title>foo</title>',
'',
' <meta name="description" content="bar" />',
'']
</code></pre>
<p>这段代码将提取列表中的每个字符串,如果字符串的长度为<code>> 0</code>,则将其保留</p>
<pre class="lang-py prettyprint-override"><code>list1 = [line for line in html_string.split('\n') if len(line) > 0]
</code></pre>
<p>或更紧凑:</p>
<pre class="lang-py prettyprint-override"><code>list1 = [line for line in html_string.split('\n') if line]
</code></pre>
<p>这将给你:</p>
<pre class="lang-py prettyprint-override"><code>['<html>',
' <head>',
' <! <link rel="icon" href="/sort/this/later.jpg" type="image/x-icon" /> >',
' <title>foo</title>',
' <meta name="description" content="bar" />']
</code></pre>
<p>但是<code>list1</code>是一个列表。要将其转换回字符串,您需要:</p>
<pre class="lang-py prettyprint-override"><code>new_html_string = '\n'.join(list1)
</code></pre>
<p>打印<code>new_html_string</code>将为您提供:</p>
<pre class="lang-py prettyprint-override"><code><html>
<head>
<! <link rel="icon" href="/sort/this/later.jpg" type="image/x-icon" /> >
<title>foo</title>
<meta name="description" content="bar" />
</code></pre>
<p>总而言之:</p>
<pre class="lang-py prettyprint-override"><code>html_string = '''<html>
<head>
<! <link rel="icon" href="/sort/this/later.jpg" type="image/x-icon" /> >
<title>foo</title>
<meta name="description" content="bar" />
'''
list1 = [line for line in html_string.split('\n') if line]
new_html_string = '\n'.join(list1)
</code></pre>