擅长:python、mysql、java
<p>初始化pageNum inside for循环:</p>
<pre><code>for x in postkeys:
pageNum = 1
</code></pre>
<p>循环和格式URL的增量pageNum边</p>
<pre><code>for item in data_list:
#format website url
url = "https://www.yell.com/ucs/UcsSearchAction.do?keywords={}&pageNum={}&location={}".format(comtype, pageNum, x)
site = requests.get(url, headers=headers)
# check response status code:
if site.status_code != 200:
break
pageNum += 1
</code></pre>
<p>应删除此for循环:</p>
<pre><code>for y in postkeynum:
url = 'https://www.yell.com/ucs/UcsSearchAction.do?keywords=' + comtype + '&pageNum=' + str(y) + '&location=' + x
data_list.append(url)
</code></pre>