擅长:python、mysql、java
<p>首先阅读BeautifulSoup文档<a href="http://www.crummy.com/software/BeautifulSoup/bs4/doc/" rel="nofollow">http://www.crummy.com/software/BeautifulSoup/bs4/doc/</a></p>
<p>第二,这里有一个小例子可以让你走得更远:</p>
<pre><code>from bs4 import BeautifulSoup as bs
soup = bs(your_html_content)
# for fbbuzzresult
buzz = soup.findAll("div", {"id" : "fbbuzzresult"})[0]
# to get postbuzz
pbuzz = buzz.findAll("div", {"class" : "postbuzz"})
"""pbuzz is now an array with the postbuzz divs
so now you can iterate through them, get
the contents, keep traversing the DOM with BS
or do whatever you are trying to do
So say you want the text from an element, you
would just do: the_element.contents[0]. However
if I'm remembering correctly you have to traverse
down through all of it's children to get the text.
"""
</code></pre>