我需要从这个页面获取每个产品的url http://www.stalkbuylove.com/new-arrivals/week-2.html#/page/1 然后需要从product link获取每个产品的详细信息。我不知道怎么做。在
import scrapy
import json
import redis
r_server = redis.Redis('localhost')
class DmozSpider(scrapy.Spider):
name = "dmoz"
allowed_domains = ["stalkbuylove.com"]
start_urls = [
"http://www.stalkbuylove.com/new-arrivals/week-2.html#/page/1"
]
def parse(self, response):
for sel in response.css('.product-detail-slide'):
name = sel.xpath('div/a/@title').extract()
price = sel.xpath('div/span/span/text()').extract()
productUrl = sel.xpath('div/a/@href').extract()
request = scrapy.Request(''.join(productUrl), callback=self.parseProductPage)
r_server.hset(name,"Name",name)
r_server.hset(name,"Price",price)
r_server.hset(name,"ProductUrl",productUrl)
print name, price, productUrl
def parseProductPage(self, response):
for sel in response.css('.top-details-product'):
availability = sel.xpath('div/link/@href').extract()
print availability
有人能帮忙吗?当我得到产品的网址如何抓取网址?现在我正在调用parseProductUrlPage,它不工作。在
目前没有回答
相关问题 更多 >
编程相关推荐