擅长:python、mysql、java
<blockquote>
<p>to find the quoted strings after <code>href=</code></p>
</blockquote>
<p>短<code>requests</code>+<code>beautifulsoup</code>溶液:</p>
<pre><code>import requests, bs4
soup = bs4.BeautifulSoup(requests.get('http://.openquestions.com').content, 'html.parser')
hrefs = [a['href'] for a in soup.select('dl dt a')]
print(hrefs)
</code></pre>
<p>输出:</p>
<pre><code>['oq-phys.htm', 'oq-math.htm', 'oq-life.htm', 'oq-tech.htm', 'oq-geo.htm', 'oq-map.htm', 'oq-about.htm', 'oq-howto.htm', 'oqc/oqc-home.htm', 'oq-indx.htm', 'oq-news.htm', 'oq-best.htm', 'oq-gloss.htm', 'oq-quote.htm', 'oq-new.htm']
</code></pre>