擅长:python、mysql、java
<p>您可以使用下面的代码获取网页中的所有链接。如果您想查看网页链接的特定部分。更改<code>soup.findAll</code>函数</p>
<pre><code>from bs4 import BeautifulSoup
from urllib.request import urlopen
URL = f"http://www.saij.gob.ar/resultados.jsp?r=%20fecha-rango:[19460101%20TO%2020211231]&b=avanzada&o=0&p=25&f=Total|Tipo%20de%20Documento/Legislaci%C3%B3n|Fecha|Organismo|Publicaci%C3%B3n|Tema|Estado%20de%20Vigencia|Autor|Jurisdicci%C3%B3n/Nacional&v=colapsada"
page = urlopen(URL).read()
soup = BeautifulSoup(page,'lxml')
links = soup.findAll('a')
for link in links:
try:
if(link['href'][0]=='/'):
print (URL+link['href'])
elif(link['href'][0]=='h'):
print(link['href'])
else:
pass
except:
pass
</code></pre>
<p>输出:
<a href="https://i.stack.imgur.com/RRnmc.png" rel="nofollow noreferrer"><img src="https://i.stack.imgur.com/RRnmc.png" alt="Output"/></a></p>