<p>假设你想用球队名称和球员构建dict,你可以做如下的事情。我不知道你是否需要突出的球员,比如特雷弗鲍尔?我添加了变量以备需要时保存它们。在</p>
<p>广告框和工具框通过:not pseudo class被排除,它传递一个要忽略的类列表。在</p>
<pre><code>from bs4 import BeautifulSoup as bs
import requests
r = requests.get('https://www.rotowire.com/baseball/daily-lineups.php')
soup = bs(r.content, 'lxml')
team_dict = {}
teams = [item.text for item in soup.select('.lineup__abbr')] #26
matches = {}
i = 0
for teambox in soup.select('.lineups > div:not(.is-ad, .is-tools)'):
team_visit = teams[i]
team_home = teams[i + 1]
highlights = teambox.select('.lineup__player-highlight-name a')
visit_highlight = highlights[0].text
home_highlight = highlights[1].text
match = team_visit + ' v ' + team_home
visitors = [item['title'] for item in teambox.select('.is-visit .lineup__player [title]')]
home = [item['title'] for item in teambox.select('.is-home .lineup__player [title]')]
matches[match] = {'visitor' : [{team_visit : visitors}] ,
'home' : [{team_home : home}]
}
i+=1
</code></pre>
<hr/>
<p><strong>示例信息:</strong></p>
<p><a href="https://i.stack.imgur.com/kZVB5.png" rel="nofollow noreferrer"><img src="https://i.stack.imgur.com/kZVB5.png" alt="enter image description here"/></a></p>
<hr/>
<p><strong>当前结构:</strong></p>
<p><img src="https://i.stack.imgur.com/AiIOf.png" width="200"/></p>