擅长:python、mysql、java
<p>只需添加到@mhawke的答案中,而不是硬编码脚本标记的偏移量,您可以循环遍历所有脚本标记并匹配与您的模式匹配的脚本标记</p>
<pre><code>web = urllib.urlopen("http://www.nasdaq.com/quotes/nasdaq-financial-100-stocks.aspx")
pattern = re.compile('var table_body = (.*?);')
soup = BeautifulSoup(web.read(), "lxml")
scripts = soup.find_all('script')
for script in scripts:
if(pattern.match(str(script.string))):
data = pattern.match(script.string)
stock = json.loads(data.groups()[0])
print stock
</code></pre>