Python BeautifulSoup错误元素在尝试查找href时不可见？

<a class="element-invisible element-focusable" href="#main-content" tabindex="1">Skip to main content</a> <a class="element-invisible element-focusable" href="#main-content">Skip to main content</a>

page = requests.get('https://registrar.fas.harvard.edu/calendar').content soup = bs4.BeautifulSoup(page, 'lxml') links = soup.find_all('a') #print links for link in links: print link if link.get('href') != None and '.ics' in link.get('href'): endout = link.get('href') if endout[:6] == 'webcal': endout ='https' + endout[6:] print print 'URL: ' + endout print return endout break

1条回答

网友

1楼 · 发布于 2024-09-29 23:33:04

我建议通过传递csshref选择器和regex模式来简化搜索：

links = soup.find_all('a', {'href' : re.compile('.*\.ics') })

输出：

[<a class="subscribe" href="https://registrar.fas.harvard.edu/calendar/upcoming/all/export.ics">subscribe</a>,
 <a class="ical" href="https://registrar.fas.harvard.edu/calendar/upcoming/all/export.ics">iCal</a>]

你现在就不需要跳转来验证你的锚定标签了。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章