擅长:python、mysql、java
<p>在python2.7中使用beauthoulsoup3.2.1</p>
<pre><code>from BeautifulSoup import BeautifulSoup
search_string = 'SearchString'
website_source = '<a href="http://www.link-to-get.com">SearchString</a> <a href="http://www.link-to-get.com">OtherString</a>\
<a href="http://www.link-to-getSearchString.com">otherString</a>'
soup = BeautifulSoup(website_source)
# this will return a list of lists that has the url's and the name for the link
anchors = [[row['href'], row.text] for row in soup.findAll('a') if row['href'].find(search_string) <> -1 or search_string in row.text]
# prints whole list
print anchors
#prints first list
print anchors[0]
# prints the url for the first list
print anchors[0][0]
</code></pre>
<p>问题似乎是我用beauthoulsoup3.2.1测试了上述内容,它只在python2.x中运行,而您使用的是python3.4,因此出现了错误。<br/>
如果你安装BeautifulSoup4并尝试下面的代码,它应该可以工作。还要注意的是,beauthoulsoup4可以在2.x和3.x中工作</p>
<p>请注意,以下内容尚未测试。在</p>
^{pr2}$