擅长:python、mysql、java
<p>如果使用多个捕获组,<code>re.findall</code>返回元组列表而不是字符串列表。尝试以下操作(仅使用单个组):</p>
<pre><code>>>> import re
>>> page = '''
... <a href="http://asecuritysite.com">here</a>
... <a href="https://www.sans.org/webcasts/archive/2013">there</a>
... '''
>>> re.findall(r'href="(https?:\/\/[^"]+)"',page)
['http://asecuritysite.com', 'https://www.sans.org/webcasts/archive/2013']
</code></pre>
<p>根据<a href="http://docs.python.org/2/library/re.html#re.findall" rel="nofollow">^{<cd1>} documentation</a>:</p>
<blockquote>
<p>If one or more groups are present in the pattern, return a list of
groups; this will be a <strong>list of tuples if the pattern has more than one
group.</strong></p>
</blockquote>