擅长:python、mysql、java
<p>另一个解决方案:</p>
<pre><code>import bs4
import requests
r = requests.get('http://www.repository.voxforge1.org/downloads/fr/Trunk/Audio/Main/16kHz_16bit/')
soup = bs4.BeautifulSoup(r.content, 'html.parser')
for a in soup.select('a[href*=".tgz"]'):
print(a['href'])
</code></pre>
<p>印刷品:</p>
<pre><code>4h-20100505-vgm.tgz
Agoniste-20130928-bfg.tgz
Agoniste-20130928-fnn.tgz
Agoniste-20130928-gaf.tgz
Agoniste-20130928-izd.tgz
Agoniste-20130928-ndz.tgz
Agoniste-20130928-pzq.tgz
Agoniste-20130928-qyu.tgz
Agoniste-20130928-rva.tgz
...and so on.
</code></pre>