擅长:python、mysql、java
<p>您需要的数据可以很容易地在<code>td</code>标签中找到,标签的类名为<code>titleColumn</code>。您可以从那里提取电影名称和链接</p>
<p>在这里,我将展示前10部电影。您可以修改此代码以满足您的要求</p>
<pre><code>import requests
import bs4 as bs
url = "https://www.imdb.com/chart/top"
response = requests.get(url)
html = response.text
soup = bs.BeautifulSoup(html, 'lxml')
t = soup.findAll('td', class_='titleColumn')
for i in range(10):
a_tag = t[i].find('a')
link = 'https://www.imdb.com/' + a_tag['href']
title = a_tag.text
print(f'Link: {link}\nMovie: {title}\n')
</code></pre>
<pre><code>Sample Output:
Link: https://www.imdb.com//title/tt0111161/
Movie: The Shawshank Redemption
Link: https://www.imdb.com//title/tt0068646/
Movie: The Godfather
Link: https://www.imdb.com//title/tt0071562/
Movie: The Godfather: Part II
Link: https://www.imdb.com//title/tt0468569/
Movie: The Dark Knight
</code></pre>