<p>我正在努力搜寻speeches-usa.com的标题链接。下面是我的Python代码:</p>
<pre><code>SPEECH_SOURCE = 'http://www.speeches-usa.com/'
def get_speeches():
cj = CookieJar()
opener = urllib2.build_opener(urllib2.HTTPCookieProcessor(cj))
p = opener.open(SPEECH_SOURCE)
soup = BeautifulSoup(p.read(), PARSER_TYPE)
info = soup.find_all('a', class_='ListText')
elements = []
for element in info:
elements.append(element)
for i in x range(0, min(len(elements), 5)):
print elements[i]
</code></pre>
<p>(1)我不确定要在soup中放入什么。find \u all()参数可以获取链接-我尝试放入elements.append(element.get \u text()),但这样会产生以下结果,从而删除链接</p>
<pre><code>John Adams - Inaugural
Address
Samuel Adams - American
Independence
Spiro Agnew - Television
News Coverage
Susan B. Anthony - Women's
Right to Vote
</code></pre>
<p>(2)结果似乎不完整,例如,下面的代码中缺少Jane Adams</p>
<pre><code><a class="ListText" href="Transcripts/john_adams-inaugural.html">John Adams - Inaugural
Address<br/>
</a>
0
<a class="ListText" href="Transcripts/samuel_adams-independence.html">Samuel Adams - American
Independence<br/>
</a>
1
<a class="ListText" href="Transcripts/spiro_agnew-networknews.html">Spiro Agnew - Television
News Coverage<br/>
</a>
2
<a class="ListText" href="Transcripts/susan_b_anthony-vote.html">Susan B. Anthony - Women's
Right to Vote</a>
3
<a class="ListText" href="Transcripts/spiro_agnew-networknews.html"></a>
4
</code></pre>
<p>帮助和指导将不胜感激</p>