擅长:python、mysql、java
<p>我觉得你把事情弄得有点太复杂了;使用另一个库和xpath进行尝试:</p>
<pre><code>import lxml.html as lh
cds = """[your html above]"""
doc = lh.fromstring(cds)
for cd in doc.xpath('//cd'):
decade = cd.xpath('./@decade')[0]
title = cd.xpath('./title/text()')[0]
artist = cd.xpath('./artist/text()')[0]
print("Title: "+title+", Artist: "+artist+", Decade: "+decade)
</code></pre>
<p>输出:</p>
<pre><code>Title: Gorillaz, Artist: Gorillaz, Decade: 00s
Title: Demon Days, Artist: Gorillaz, Decade: 00s
</code></pre>