擅长:python、mysql、java
<p><code>storeName</code>是一个数组,需要循环遍历它。目前,您在每一页上都使用页码对其进行索引,这可能不是您的意图。你知道吗</p>
<p>下面是代码的正确版本,添加了循环。你知道吗</p>
<pre><code>import urllib
import re
pagelist = ["page=1","page=2","page=3","page=4","page=5","page=6","page=7","page=8","page=9","page=10"]
ziplocations = ["=30008","=30009"]
i=0
while i<len(pagelist):
url = "http://www.boostmobile.com/stores/?" +pagelist[i]+"&zipcode=30008"
htmlfile = urllib.urlopen(url)
htmltext = htmlfile.read()
regex = '<h2 style="float:left;">(.+?)</h2>'
pattern = re.compile(regex)
storeName = re.findall(pattern,htmltext)
for sn in storeName:
print "Store Name=", sn
i+=1
</code></pre>