Pipelines API Reference
Minimal, auto-generated API docs for pipelines. See README/Quickstart for usage.
Main pipelines
DbInsertPipeline
- class scrapy_item_ingest.DbInsertPipeline(settings)[source]
Bases:
ItemsPipeline,RequestsPipelineMain pipeline that combines item processing and request tracking. Inherits from both ItemsPipeline and RequestsPipeline.
ItemsPipeline
- class scrapy_item_ingest.ItemsPipeline(settings)[source]
Bases:
BasePipelinePipeline for handling scraped items
RequestsPipeline
- class scrapy_item_ingest.RequestsPipeline(settings)[source]
Bases:
BasePipelinePipeline for handling request tracking
Base class
Notes
Tables: job_items, job_requests, job_logs (created when CREATE_TABLES = True).
Configure DB via DB_URL or discrete fields (DB_HOST, DB_USER, etc.).
See configuration for all settings and extensions for DB logging controls.