擅长:python、mysql、java
<p>我将向您推荐一种相当简单的方法</p>
<pre><code>import requests
from bs4 import BeautifulSoup as bs
page = requests.get('https://www.todayonline.com/googlenews.xml').content
soup = bs(page)
news = [i.text for i in soup.find_all('news:title')]
print(news)
</code></pre>
<p>输出</p>
<pre><code>['DBS named world’s best bank by New York-based financial publication',
'Russia has very serious questions to answer on Navalny - UK',
"Exclusive: 90% of China's Sinovac employees, families took coronavirus vaccine - CEO",
'Three militants killed after fatal attack on policeman in Tunisia',
.....]
</code></pre>
<p>此外,如果需要,还可以查看XML页面以获取更多信息</p>
<p>p.S.在清理<strong>任何网站之前,始终检查合规性:)</p>