擅长:python、mysql、java
<p>我用正则表达式来解决这个问题。假设每个文档都包含一个处理多个文档的xml函数。执行此循环后,数据将为空,或包含不完整的XML文档,并且句柄函数将被调用零次或多次。在</p>
<pre><code>while True:
match = re.match (r'''
\s* # ignore leading whitespace
( # start first group
<(?P<TAG>\S+).*?> # opening tag (with optional attributes)
.*? # stuff in the middle
</(?P=TAG)> # closing tag
) # end of first xml document
(?P<REM>.*) # anything else
''',
data, re.DOTALL | re.VERBOSE)
if not match:
break
document = match.group (1)
handle (document)
data = match.group ('REM')
</code></pre>