Python:从u打开一些引用

2024-09-29 20:20:31 发布

您现在位置:Python中文网/ 问答频道 /正文

我有urlhttps://cars.mail.ru/reviews/renault/?year=2010-2016 我应该从那里开门

https://cars.mail.ru/reviews/renault/sandero_stepway/2015/143355/
https://cars.mail.ru/reviews/renault/sandero/2015/147850/
https://cars.mail.ru/reviews/renault/sandero/2012/147529/
https://cars.mail.ru/reviews/renault/duster/2014/147433/
https://cars.mail.ru/reviews/renault/logan/2011/146991/
https://cars.mail.ru/reviews/renault/duster/2015/146645/

我需要打开所有这些链接和旁边的下一页,并打开那里的所有链接。 我怎么能快点? 如果我使用

models = ['11', '12', '14', '15', '16', '17', '18', '19', '20', '21', '25', '30', '4', '5', '6', '9',
     'avantime', 'clio', 'clio_rs', 'duster', 'espace', 'estafette', 'express', 'fluence',
     'fuego', 'grand_espace', 'grand_scenic', 'kangoo', 'kaptur', 'koleos', 'laguna', 'latitude',
     'logan', 'mascott', 'master', 'megane', 'megane_rs', 'modus', 'safrane', 'sandero', 'sandero_stepway',
     'scenic', 'symbol', 'trafic', 'twingo', 'vel_satis']
years = ['2010', '2011', '2012', '2013', '2014', '2015', '2016']
pattern = 'https://cars.mail.ru/reviews/renault/'

for model in models:
    for year in years:
        for i in range(143350, 143360):
            res = pattern + model + '/' + year + '/' + str(i)
            try:
                page = urllib2.urlopen(res).read()
                print page
                soup = BeautifulSoup(page, 'html.parser')
            except:
                continue

太费时了


Tags: inhttpsfor链接rupagemailcars
1条回答
网友
1楼 · 发布于 2024-09-29 20:20:31

你在做什么

len(models) * len(years) * (143360 - 143350) # 3220

HTTP请求。如果每一次只需要一秒钟,你就忙了将近一个小时

你可以试试^{}

相关问题 更多 >

    热门问题