擅长:python、mysql、java
<p><code>html.parser</code>使用下面的正则表达式来提取元素属性</p>
<pre><code>html = """<input class="form-control" name='searchString' type=text value= "myString"/>"""
attrfind = re.compile(
r'((?<=[\'"\s/])[^\s/>][^\s/=>]*)(\s*=+\s*'
r'(\'[^\']*\'|"[^"]*"|(?![\'"])[^>\s]*))?(?:\s|/(?!>))*')
attrs = attrfind.findall(html)
for attr in attrs:
print('{}: {}'.format(attr[0], attr[2]))
</code></pre>
<p>结果</p>
<pre><code>class: "form-control"
name: 'searchString'
type: text
value: "myString"
</code></pre>