擅长:python、mysql、java
<p>只需从页面获取所有链接,请使用下面的代码:(python3)</p>
<pre><code>from bs4 import BeautifulSoup
import re
from urllib.request import urlopen
html_page = urlopen("http://www.google.com/")
soup = BeautifulSoup(html_page)
for link in soup.findAll('a', attrs={'href': re.compile("^http://")}):
print (link.get('href'))
</code></pre>