Python beauthulsoup解析特定tex问题的回答

Python beauthulsoup解析特定tex

回答此问题可获得 20 贡献值，回答如果被采纳可获得 50 分。

我正在解析一个html文件，我想找到文件中写着“小报告公司”的部分，旁边有一个“X”或复选框，或者没有。复选框通常是用Wingdings字体或ascii代码完成的。在下面的HTML中，您将看到它旁边的wingdings中有一个<code>&#254;</code>。在 我可以显示正则表达式搜索文本的结果，但在下一步中查找复选框时遇到了问题。在 我将使用它来解析许多不同的html文件，这些文件的格式不完全相同，但大多数文件都将使用类似于本例的表格和ascii文本。在 以下是HTML代码： <pre><code><HTML> <HEAD><TITLE></TITLE></HEAD> <BODY> <DIV align="left">Indicate by check mark whether the registrant is a large accelerated filer, an accelerated filer, a non-accelerated filer, or a smaller reporting company. See the definitions of &#147;large accelerated filer,&#148; &#147;accelerated filer&#148; and &#147;smaller reporting company&#148;. (Check one): </DIV> <DIV align="center"> <TABLE style="font-size: 10pt" cellspacing="0" border="0" cellpadding="0" width="100%">  <TR valign="bottom"> <TD width="22%">&nbsp;</TD> <TD width="3%">&nbsp;</TD> <TD width="22%">&nbsp;</TD> <TD width="3%">&nbsp;</TD> <TD width="22%">&nbsp;</TD> <TD width="3%">&nbsp;</TD> <TD width="22%">&nbsp;</TD> </TR> <TR></TR>   <TR valign="bottom"> <TD align="center" valign="top"> Large accelerated filer &#111; </TD> <TD>&nbsp;</TD> <TD align="center" valign="top">Accelerated filer &#111; </TD> <TD>&nbsp;</TD> <TD align="center" valign="top"> Non-accelerated filer &#111; (Do not check if a smaller reporting company) </TD> <TD>&nbsp;</TD> <TD align="center" valign="top"> Smaller reporting company &#254;</TD> </TR>  </TABLE> </DIV></BODY></HTML> </code></pre> 下面是我的Python代码： ^{pr2}$ 问题：我如何设置此项以进行依赖于第一次搜索的第二次搜索？所以当我找到“小报告公司”时，我可以搜索接下来的几行，看看是否有ascii码？我一直在看汤医生。我试着做find and findNext，但没能让它发挥作用。在

0 条评论
分类：Python问答

默认排序时间排序

1 个回答

匿名 1天前

　擅长：python、mysql、java

Python beauthulsoup解析特定tex

1 个回答

相关Python问题