<p>我正在尝试获取以下html代码的标题:</p>
<pre><code><FONT COLOR=#5FA505><B>Claim:</B></FONT> &nbsp; Coed makes unintentionally risqu&eacute; remark about professor's "little quizzies."
<BR><BR>
<CENTER><IMG SRC="/images/content-divider.gif"></CENTER>
</code></pre>
<p>我用的是这个代码:</p>
^{pr2}$
<p>我成功地从前面提到的html代码中提取了我想要的正确的<code>Claim:</code>值,但它也(在同一页面中具有类似结构的其他代码)提取了下面的html。我定义我的<code>xpath()</code>只是拉入名为<code>Claim:</code>的<code>font</code>标记,那么它为什么还要拉下面的<code>Origins</code>?我怎样才能修好它呢?我试着看看我是否能只得到下一个而不是所有的,但是没用</p>
<pre><code><FONT COLOR=#5FA505 FACE=""><B>Origins:</B></FONT> &nbsp; Print references to the "little quizzies" tale date to 1962, but the tale itself has been around since the early 1950s. It continues to surface among college students to this day. Similar to a number of other college legends
</code></pre>