Python:如何使用bs4获取页码?

2024-10-02 02:38:42 发布

您现在位置:Python中文网/ 问答频道 /正文

我需要从这里得到一个数据页

<span class="page-item mhide"> 
<a class="page-link active" href="javascript:void(0)" style="display: none;">1</a>
<a onclick="_gaq.push(['_trackEvent', 'BuSearch', 'ClickOn_pagination_number', 'go_to_pagination'])"
class="page-link" data-page="1" 
href="/search/po_yers%5B0%5D=2003&amp;category_id=1&amp;marka_id%5B0%5D=28&amp;model_id%5B0%5D=262&amp;abroad=2&amp;
custom=1&amp;indexName=auto&amp;page=1&amp;countpage=20">2</a> </span>

我尝试过这个,但它返回1

def get_pages_count(html):
    soup = BeautifulSoup(html, 'html.parser')
    pagination = soup.find_all('span', class_='page-item mhide')
    if pagination:
        return int(pagination[-1].get_text()) # return the last element which is equal to pages number
    else:
        return 1

完整代码为herelink


Tags: tonumbergetreturnhtmlpagelinkpagination

热门问题