擅长:python、mysql、java
<p>您可以使用<a href="http://code.google.com/p/httplib2/" rel="noreferrer">HttpLib2</a>获取URL的实际位置:</p>
<pre><code>import httplib2
def getContentLocation(link):
h = httplib2.Http(".cache_httplib")
h.follow_all_redirects = True
resp = h.request(link, "GET")[0]
contentLocation = resp['content-location']
return contentLocation
if __name__ == '__main__':
link = 'http://podcast.at/podcast_url344476.html'
print getContentLocation(link)
</code></pre>
<p>执行过程如下:</p>
<pre><code>$ python2.7 getContentLocation.py
http://keyinvest.podcaster.de/8uhr30.rss
</code></pre>
<p>注意这个例子也使用缓存(urllib和httplib都不支持缓存)。因此,这将反复明显加快运行速度。这可能对爬行/抓取很有意思。如果不需要缓存,请将<code>h = httplib2.Http(".cache_httplib")</code>替换为<code>h = httplib2.Http()</code>。</p>