在python中解析包含强调标记的xml文件

2条回答

网友

1楼 · 编辑于 2024-06-28 11:02:35

xml = '''<Segment StartTime="639.752" EndTime="642.270" Participant="fe016">
  But I bet it's a good <Pause/> superset of it.
</Segment>'''

# solution using ETree
from xml.etree import ElementTree as ET

root = ET.fromstring(xml)
pause = root.find('./Pause')
print(root.text + pause.tail)

网友

2楼 · 编辑于 2024-06-28 11:02:35

另一个解决方案

from simplified_scrapy import SimplifiedDoc,req,utils
html = '''<Segment StartTime="639.752" EndTime="642.270" Participant="fe016">
  But I bet it's a good <Pause/> superset of it.
</Segment>'''
doc = SimplifiedDoc(html)
print(doc.Segment)
print(doc.Segment.text)

结果:

{'StartTime': '639.752', 'EndTime': '642.270', 'Participant': 'fe016', 'tag': 'Segment', 'html': "\n  But I bet it's a good <Pause /> superset of it.\n"}
But I bet it's a good superset of it.

相关问题更多 >

编程相关推荐

热门问题

热门文章

在python中解析包含强调标记的xml文件

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >