scrapy：访问管道中的spider类变量__init__

我知道你可以访问process_item（）中的spider变量，但是如何访问管道中的spider变量init函数？scrapy：访问管道中的spider类变量__init__

class SiteSpider(CrawlSpider): 
    def __init__(self): 
     self.id = 10 

class MyPipeline(object): 
    def __init__(self): 
     ...

我还需要访问MyPipeline中的CUSTOM_SETTINGS_VARIABLE。

来源

2013-11-22 valerij vasilcenko

您无法访问蜘蛛实例，因为在引擎启动时完成了管道初始化。事实上，你必须认为你的管道可以处理多个蜘蛛，而不是一个蜘蛛。

话虽如此，你可以挂钩spider_opened信号来访问蜘蛛实例启动时。

from scrapy import signals 


class MyPipeline(object): 

    def __init__(self, mysetting): 
     # do stuff with the arguments... 
     self.mysetting = mysetting 

    @classmethod 
    def from_crawler(cls, crawler): 
     settings = crawler.settings 
     instance = cls(settings['CUSTOM_SETTINGS_VARIABLE'] 
     crawler.signals.connect(instance.spider_opened, signal=signals.spider_opened) 
     return instance 

    def spider_opened(self, spider): 
     # do stuff with the spider: initialize resources, etc. 
     spider.log("[MyPipeline] Initializing resources for %s" % spider.name) 

    def process_item(self, item, spider): 
     return item

来源

2013-11-23 18:39:47 Rolando

请更新您的答案：我还需要访问MyPipeline中的CUSTOM_SETTINGS_VARIABLE。 –

@hellomyfriends是设置模块中的设置吗？您可以通过'crawler.settings'访问设置模块。 – Rolando

是的。非常感谢 –

scrapy：访问管道中的spider类变量init

回答

scrapy：访问管道中的spider类变量__init__

回答

相关问题

scrapy：访问管道中的spider类变量init