擅长:python、mysql、java
<p>不要使用正则表达式解析HTML。使用像<a href="http://www.crummy.com/software/BeautifulSoup/" rel="noreferrer">BeautifulSoup</a>这样的HTML解析器。看看这有多简单:</p>
<pre><code>from BeautifulSoup import BeautifulSoup
html = r'<a href="removed because it was too long"><b>LG</b> X110</a>'
soup = BeautifulSoup(html)
print ''.join(soup.findAll(text=True))
# LG X110
</code></pre>