擅长:python、mysql、java
<p>这里有一个关于<code>lxml</code>的快速操作。强烈推荐<code>xpath</code>。在</p>
<pre><code>>>> from lxml import etree
>>> doc = etree.XML("""<span class="nobr">
... <a href="http://www.google.com/">
... http://www.google.com/
... <sup>
... <img align="absmiddle" alt="" border="0" class="rendericon" height="7" src="http://jira.atlassian.com/icon.gif" width="7"/>
... </sup>
... </a>
... </span>""")
>>> for a in doc.xpath('//span[@class="nobr"]/a[@href="http://www.google.com/"]'):
... for sub in list(a):
... a.remove(sub)
...
>>> print etree.tostring(doc,pretty_print=True)
<span class="nobr">
<a href="http://www.google.com/">
http://www.google.com/
</a>
</span>
</code></pre>