在斯帕利的pipelin中注入参数

process = CrawlerProcess(get_project_settings()) process.crawl(SomeCrawler) process.crawl(AnotherCrawler) ... some_argument = ... # instantiate my custom argument # this is made up, it's what i've been unable to find how to do properly my_pipeline = MyPipeline(some_argument) process.pipelines.append(my_pipeline, ...) process.start()

1条回答

网友

1楼 · 发布于 2024-09-28 03:20:05

您可以使用scrapyfrom_crawler方法。废文档有一个好的description和{a2}：

class MongoPipeline(object):

    collection_name = 'scrapy_items'

    def __init__(self, mongo_uri, mongo_db):
        self.mongo_uri = mongo_uri
        self.mongo_db = mongo_db

    @classmethod
    def from_crawler(cls, crawler):
        return cls(
            mongo_uri=crawler.settings.get('MONGO_URI'),
            mongo_db=crawler.settings.get('MONGO_DATABASE', 'items')
        )

“如果存在，则调用此classmethod从爬虫程序创建管道实例。它必须返回管道的新实例。“

这样，您就可以根据爬虫程序或爬行器设置创建管道的新实例。在

相关问题更多 >

编程相关推荐

热门问题

热门文章