擅长:python、mysql、java
<p>看看文档,attrs是一个设计糟糕的参数,应该更像是一个**kwargs。在</p>
<p><a href="http://www.crummy.com/software/BeautifulSoup/bs4/doc/#searching-by-css-class" rel="nofollow">http://www.crummy.com/software/BeautifulSoup/bs4/doc/#searching-by-css-class</a>表示实际要传递类\kwarg:</p>
<pre><code>>>> from bs4 import BeautifulSoup
>>> src = """ <div class="s">
... <div>
... <div class="f kv" style="white-space:nowrap">
... <cite class="vurls">www.somewebsite.com/</cite>\U+200E
... </div>
... </div>
... </div>
...
... """
>>> soup = BeautifulSoup(src)
>>> soup.find_all('cite')
[<cite class="vurls">www.somewebsite.com/</cite>]
>>> soup.find_all('cite', attr={'class': 'vurls'})
[]
>>> soup.find_all('cite', class_='vurls')
[<cite class="vurls">www.somewebsite.com/</cite>]
</code></pre>