Skip to content

huangzhiyong/scrapy-mongodb

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Description

It's a pipeline which allow you to store scrapy items in MongoDB database.

Install

pip install "ScrapyMongoDB"

Configure your settings.py:

ITEM_PIPELINES = [
  'scrapymongodb.MongoDBPipeline',
]

MONGODB_SERVER = 'localhost'
MONGODB_PORT = 27017
MONGODB_DB = 'scrapy'
MONGODB_COLLECTION = 'items'
MONGODB_UNIQ_KEY = 'url'
MONGODB_ITEM_ID_FIELD = '_id'
MONGODB_SAFE = True

Changelog

0.4.1 Cancel use of deprecated settings import

0.4.0 Update only those fields that are defined in the scrapy Item by Mikhail Fedosov (https://github.com/fedosov)

0.3.0 Code improvements by Jordi Llonch (https://github.com/llonchj)

0.2.1 pip now install requires modules scrapy and pymongo

License

Copyright 2011 Julien Duponchelle

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

About

Mongodb support for scrapy

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published