Python：如何提取嵌入在html文件中的xml？

<html> <head> <title> test֤</title> </head> <body> <form name="acsForm" action="" method="post" > <textarea rows=10 cols=80 name="xmlText"><?xml version="1.0" encoding="UTF-8"?> <samlp:Response xmlns:samlp="urn:oasis:names:tc:SAML:2.0:protocol"> </samlp:Response> </textarea> <textarea name="2nd"> text2....</textarea> </form> </body> </html>

3条回答

网友

1楼 · 编辑于 2024-09-28 20:47:31

也许lxml会起作用，虽然我自己从来没有用过它，所以我不知道做你想做的事情有多简单/复杂。在

网友

2楼 · 编辑于 2024-09-28 20:47:31

（啊！为什么这么多作者似乎认为<textarea>内容不需要HTML转义？傻瓜！）在

不幸的是，beauthoulsoup3.1没有应用（不正确但常见的）浏览器链接，即将<和{}字符视为文本，而是创建真正的XML元素。在

BeautifulSoup 3.0可以应付。Why there's a difference.

网友

3楼 · 编辑于 2024-09-28 20:47:31

尝试使用BeautifulGroup库的^{}部分，它是为XML设计的。在

相关问题更多 >

编程相关推荐

热门问题

热门文章