擅长:python、mysql、java
<p>试试<code>lxml</code>库:它遵循elementtreeapi,并添加了许多额外的功能。从<a href="https://lxml.de/compatibility.html" rel="nofollow noreferrer">compatibility overview</a>:</p>
<blockquote>
<p>ElementTree ignores comments and processing instructions when parsing XML, while etree will read them in and treat them as Comment or ProcessingInstruction elements respectively. This is especially visible where comments are found inside text content, which is then split by the Comment element.</p>
<p>You can disable this behaviour by passing the boolean <code>remove_comments</code> and/or <code>remove_pis</code> keyword arguments to the parser you use. For convenience and to support portable code, you can also use the <code>etree.ETCompatXMLParser</code> instead of the default <code>etree.XMLParser</code>. It tries to provide a default setup that is as close to the ElementTree parser as possible.</p>
</blockquote>
<p>我知道在stdlib中不是这样,但是根据我的经验,当你需要标准元素树没有提供的东西时,这是最好的选择。在</p>