擅长:python、mysql、java
<p>你在找Python的<a href="http://docs.python.org/library/re.html" rel="nofollow noreferrer">re module</a>。</p>
<p>看看<a href="http://docs.python.org/library/re.html#re.findall" rel="nofollow noreferrer">re.findall</a>和<a href="http://docs.python.org/library/re.html#re.search" rel="nofollow noreferrer">re.search</a>。</p>
<p>正如您所提到的,您正在尝试解析html,为此使用<code>html parsers</code>。python中有两个选项可用,比如<a href="http://lxml.de/" rel="nofollow noreferrer">lxml</a>或<a href="http://www.crummy.com/software/BeautifulSoup/" rel="nofollow noreferrer">BeautifulSoup</a>。</p>
<p>看看这个<a href="https://stackoverflow.com/questions/1732348/regex-match-open-tags-except-xhtml-self-contained-tags/1732454#1732454">Why you should not parse html with regex</a></p>