擅长:python、mysql、java
<p>看起来这是<a href="http://en.wikipedia.org/wiki/XPath" rel="nofollow noreferrer">xpath</a>的工作。但是,<em>beauthoulsoup不支持XPath表达式</em>。在</p>
<p>考虑切换到<a href="http://lxml.de/" rel="nofollow noreferrer">lxml</a>或<a href="http://scrapy.org/" rel="nofollow noreferrer">scrapy</a>。在</p>
<p>仅供参考,对于测试xml,例如:</p>
<pre><code><html>
<h2 class="tabellen_ueberschrift al">Points</h2>
<div class="fl" style="width:49%;">
<table class="tabelle_grafik lh" cellpadding="2" cellspacing="1">a</table>
</div>
<h2 class="tabellen_ueberschrift al">Illegal</h2>
<div class="fl" style="width:49%;">
<table class="tabelle_grafik lh" cellpadding="2" cellspacing="1">b</table>
</div>
</html>
</code></pre>
<p>查找div中h2=“Points”后具有“tabelle_grafik lh”类的表的XPath表达式是:</p>
^{pr2}$