擅长:python、mysql、java
<p>使用<strong><code>etree.XMLParser</code></strong>对象的正确方法(<code>lxml.etree</code>仅限):</p>
<pre><code>import re
from lxml import etree
tags_to_fix = ['clientssss-manuf', 'client-manuf', 'essid']
parser = etree.XMLParser(recover=True) # recovery mode !
tree = etree.parse("input.xml", parser)
for el in tree.xpath('//*[name()="clientssss-manuf" or name()="client-manuf" or name()="essid"]'):
el.text = re.sub(r'\w{1,2};\s*', '', el.text).strip()
tree.write("output.xml", encoding="utf-8", pretty_print=True)
</code></pre>
<p>产生的<code>output.xml</code>的关键片段:</p>
^{pr2}$