擅长:python、mysql、java
<p>这个问题是一年前提出的,但有人可能会通过谷歌找到这个问题。在</p>
<p>你可以用“a.article_html”获取文章文本中的图像和其他html。在</p>
<pre><code>from newspaper import Article
a = Article('https://www.nytimes.com/2019/04/25/us/politics/joe-biden-anita-hill.html',
keep_article_html=True,
language='en')
a.download()
a.parse()
print(a.html) # This article's unchanged and raw HTML
print(a.article_html) # The HTML of this article's main node
</code></pre>
<p>记住参数“keep_article_html=True”</p>