擅长:python、mysql、java
<p>要提取动态生成(使用javascript)的内容,可以使用<a href="http://code.google.com/p/selenium/" rel="nofollow">selenium</a>:</p>
<pre><code>#!/usr/bin/env python
from contextlib import closing
from selenium.webdriver import Firefox # pip install selenium
url = "http://busymovies.appspot.com/News.html?id=2965032"
# use firefox to get page with javascript generated content
with closing(Firefox()) as browser:
browser.get(url)
link = browser.find_element_by_link_text("Direct Link")
print link.get_attribute("href")
</code></pre>
<h3>输出</h3>
^{pr2}$