未排序飞行kr网址

import urlparse import httplib def unshorten_url(url, max_tries=10): return __unshorten_url(url, [], max_tries) def __unshorten_url(url, check_urls, max_tries): if max_tries == 0: if len(check_urls) > 0: return check_urls[0] return url if url in check_urls: return url unshortended = '' try: parsed = urlparse.urlparse(url) h = httplib.HTTPConnection(parsed.netloc) h.request('HEAD', url) except: return None try: response = h.getresponse() except: return url if response.status/100 == 3 and response.getheader('Location'): unshortended = response.getheader('Location') else: return url #print max_tries, unshortended if unshortended != url: if 'http' not in unshortended: return url check_urls.append(url) return __unshorten_url(unshortended, check_urls, (max_tries-1)) else: return unshortended print unshorten_url('http://t.co/5skmePb7gp')

1条回答

网友

1楼 · 发布于 2024-09-26 22:52:53

我用这种方式使用Request[0]而不是httplib，它可以很好地处理类似于https://flic.kr/p/qf3mGd的URL：

>>> import requests
>>> requests.head("https://flic.kr/p/qf3mGd", allow_redirects=True, verify=False).url
u'https://www.flickr.com/photos/106783633@N02/15911453212/'

[0]http://docs.python-requests.org/en/latest/

相关问题更多 >

编程相关推荐

热门问题

热门文章