使用BeautifulSoup对IMDb页面进行web浏览问题的回答

使用BeautifulSoup对IMDb页面进行web浏览

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

我对WebScraping/Python和beauthoulsoup还不熟悉，我的代码很难正常工作。在 我想通过url:<a href="http://m.imdb.com/feature/bornondate" rel="nofollow">http://m.imdb.com/feature/bornondate</a>“获取： <ul> <li>名人的名字</li> <li>名人形象</li> <li>专业</li> <li>最好的作品</li> </ul> 为那一页上的十位名人。我不知道我做错了什么。在 这是我的代码： <pre><code>import urllib2 from bs4 import BeautifulSoup url = 'http://m.imdb.com/feature/bornondate' test_url = urllib2.urlopen(url) readHtml = test_url.read() test_url.close() soup = BeautifulSoup(readHtml) # Using it track the number of Actor count = 0 # Fetching the value present within tag results person = soup.findChildren('section', 'posters list') # Changing the person into an iterator iterperson = iter(person[0].findChildren('a')) # Finding 'a' in iterperson. Every 'a' tag contains information of a person for a in iterperson: imgSource = a.find('img')['src'].split('._V1.')[0] + '._V1_SX214_AL_.jpg' person = a.findChildren('div', 'label') title = person[0].find('span', 'title').contents[0] ##profession = person[0].find('div', 'detail').contents[0].split(,) ##bestWork = person[0].find('div', 'detail').contents[1].split(,) print '*******************************IMDB People Born Today***********************************' # Printing the S.No of the person print 'S.No. --> ', count += 1 print count # Printing the title/name of the person print 'Title --> ' + title # Printing the Image Source of the person print 'Image Source --> ', imgSource # Printing the Profession of the person ##print 'Profession --> ', profession # Printing the Best work of the person ##print 'Best Work --> ', bestWork </code></pre> 目前没有打印出来。还有，如果这是模糊的，你能解释一下如何做名人的名字，例如？在 下面是第一位名人的html代码，如果有帮助的话： ^{pr2}$

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

使用BeautifulSoup对IMDb页面进行web浏览

1 个回答

相关Python问题