擅长:python、mysql、java
<p>我尝试了这个方法,并且成功了,避免使用regexp来匹配每个<code>'href'</code>:</p>
<pre><code>from bs4 import BeautifulSoup as bs
soup = bs(htmltext)
for a in soup.findAll('a'):
a['href'] = "mysite"
</code></pre>
<p>查一下,在<a href="http://beautiful-soup-4.readthedocs.org/en/latest/#modifying-the-tree" rel="noreferrer">bs4 docs</a>。</p>