擅长:python、mysql、java
<p>查看页面后,“下一步”按钮只是一个链接</p>
<pre class="lang-html prettyprint-override"><code><a href="/events/february/5?p=2" class="pag__next" rel="next">
<span>Next</span>
</a>
</code></pre>
<p>请注意链接<code>/events/february/5?p=2</code>。您所需要做的就是在一个范围内迭代并进行请求调用。每当你点击404,你就退出循环。我将把循环交给你</p>
<p>编辑</p>
<pre class="lang-py prettyprint-override"><code>i = 1
while True:
res = request.get(f"https://www.onthisday.com/events/february/5?p={i}")
if is_visited(res.content):
# TODO write a function to check if you have visited these contents
break
...
# TODO wirte a function to updated the visited list or something similar
visited(res.content)
i+=1 # incrementing i
</code></pre>