<pre><code>for span in soup.findAll('span'):
if span.a:
print span.a["href"]
</code></pre>
<p>在列表组件中:</p>
^{pr2}$
<p>如果<code>print span.a</code>在循环中,您将看到<code>None</code>,因此您需要在使用<code>span.a["href"]</code>之前检查{<cd3>},否则您将得到一个<code>TypeError: 'NoneType' object has no attribute '__getitem__'</code></p>
<p>您可以使用set comp,因为存在重复的URL:</p>
<pre><code>urls = {span.a["href"] for span in soup.findAll('span') if span.a}
</code></pre>
<p>然后搜索您需要的任何url:</p>
<pre><code>for url in sorted(urls):
if "sayfa" in url:
print url
http://www.forumsokagi.com/peygamber-ve-evliyalarin-hayatlari/sayfa2/
http://www.forumsokagi.com/peygamber-ve-evliyalarin-hayatlari/sayfa3/
http://www.forumsokagi.com/peygamber-ve-evliyalarin-hayatlari/sayfa4/
http://www.forumsokagi.com/peygamber-ve-evliyalarin-hayatlari/sayfa7/
In [26]: import urllib2
In [27]: from bs4 import BeautifulSoup
In [28]: domain="http://www.forumsokagi.com/peygamber-ve-evliyalarin-hayatlari/"
In [29]: page = urllib2.urlopen(domain).read()
In [30]: soup = BeautifulSoup(page)
In [31]: urls = {span.a["href"] for span in soup.findAll('span') if span.a}
In [32]: for url in sorted(urls):
....: if "sayfa" in url:
....: print url
....:
http://www.forumsokagi.com/peygamber-ve-evliyalarin-hayatlari/sayfa2/
http://www.forumsokagi.com/peygamber-ve-evliyalarin-hayatlari/sayfa3/
http://www.forumsokagi.com/peygamber-ve-evliyalarin-hayatlari/sayfa4/
http://www.forumsokagi.com/peygamber-ve-evliyalarin-hayatlari/sayfa7/
</code></pre>