擅长:python、mysql、java
<p>查找感兴趣的元素所需的表达式取决于这些元素与文档中其他元素相比的唯一性。因此,如果没有元素的上下文,就很难提供帮助。在</p>
<p>您感兴趣的元素是文档中唯一的<code>font</code>元素并且具有<code>detDesc</code>类的元素吗?在</p>
<p>如果是这样,下面是一个使用<a href="http://lxml.de/" rel="nofollow">^{<cd3>}</a>的解决方案:</p>
<pre><code>import lxml.html as lh
html = '''
<font class="detDesc">Uploaded 10-29&nbsp;18:50, Size 4.36&nbsp;GiB, ULed by <a class="detDesc" href="/user/NLUPPER002/" title="Browse NLUPPER002">NLUPPER002</a></font>
'''
tree = lh.fromstring(html)
results = []
# iterate over all elements in the document that have a class of "detDesc"
for el in tree.xpath("//font[@class='detDesc']"):
# extract text from the font element
first = el.text
# extract text from the first <a> within the font element
second = el.xpath("a")[0].text
results.append((first, second))
print results
</code></pre>
<p><strong>结果:</strong></p>
^{pr2}$