<p>如果你能修改你的问题,让我们知道你到底在找什么。但是,这里有一个基本的示例,说明如何从这个页面获取数据。你知道吗</p>
<pre><code>from bs4 import BeautifulSoup
import re
html = '''<div class="overview">
<span class="course_titles">Courses:</span>
<a href="/schools/courses/173/" class="course_name">Math101</a> (Math; Monday; Room 10);
<a href="/schools/student/1388/" class="coursestudent_name">Mark</a> 17,
<a href="/schools/student/1401/" class="coursestudent_name">Alex</a> 18, ),
<a href="/schools/courses/2693/" class="course_name">English101</a> (English; Thursdays; Room 12);
<a href="/schools/student/1403/" class="coursestudent_name">Sarah</a> 16,
<a href="/schools/student/1411/" class="coursestudent_name">Nancy</a> 17,
<a href="/schools/student/1390/" class="coursestudent_name">Casey</a> 17 ),
</div>'''
soup = BeautifulSoup(html)
all_links = soup.find_all('a')
dict_courseinfo = {}
dict_key = ''
stu_lst = []
for n, link in enumerate(all_links):
if link.get('class')[0] == 'course_name':
if n > 0:
dict_courseinfo[dict_key] = stu_lst
stu_lst = []
dict_key = str(link.text)
else:
age = int(re.search(link.text + r"</a> (\d+)", html).group(1))
stu_lst.append((str(link.text), age))
dict_courseinfo[dict_key] = stu_lst
print dict_courseinfo
</code></pre>
<p>将输出:</p>
<pre><code>{'Math101': [('Mark', 17), ('Alex', 18)], 'English101': [('Sarah', 16), ('Nancy', 17), ('Casey', 17)]}
</code></pre>