This fork of Mongita implements a limited-size cache (to prevent OOM issues with large data sets) and improves performance by removing some deep copies.
Mongita is a lightweight embedded document database that implements a commonly-used subset of the MongoDB/PyMongo interface. Mongita differs from MongoDB in that instead of being a server, Mongita is a self-contained Python library. Mongita can be configured to store its documents either on disk or in memory.
"Mongita is to MongoDB as SQLite is to SQL"
Please report any bugs. Mongita is free and open source. You can contribute!
- Embedded database: Mongita is a good alternative to SQLite for embedded applications when a document database makes more sense than a relational one.
- Unit testing: Mocking PyMongo/MongoDB is a pain. Worse, mocking can hide real bugs. By monkey-patching PyMongo with Mongita, unit tests can be more faithful while remaining isolated.
- MongoDB compatibility: Mongita implements a commonly-used subset of the PyMongo API. This allows projects to be started with Mongita and later upgraded to MongoDB once they reach an appropriate scale.
- Embedded/self-contained: Mongita does not require a server or start a process. It is just a Python library. To use it, just add
import mongita
to the top of your script. - Speed: Mongita is within an order of magnitude of MongoDB and Sqlite in 10k document benchmarks. See the performance section below.
- Well tested: Mongita has 100% test coverage and more test code than library code.
- Limited dependencies: Mongita runs anywhere that Python runs. Currently the only dependencies are
pymongo
(for bson) andsortedcontainers
(for faster indexes). - Thread-safe: (EXPERIMENTAL) Mongita avoids race conditions by isolating certain document modification operations.
- You need a traditional server/client relationship: Mongita is an embedded database. It is not process-safe. When you have multiple clients, a traditional server/client database is the correct choice.
- You run a lot of uncommon commands: Mongita implements a commonly used subset of MongoDB. While the goal is to eventually implement most of it, it will take some time to get there.
- You need extreme performance: Mongita has comparable performance to MongoDB and SQLite for common operations. However, it's possible you'll find bottlenecks - especially with uncommon operations.
pip3 install mongita
>>> from mongita import MongitaClientDisk
>>> client = MongitaClientDisk()
>>> hello_world_db = client.hello_world_db
>>> mongoose_collection = hello_world_db.mongoose_collection
>>> mongoose_collection.insert_many([{'name': 'Meercat', 'does_not_eat': 'Snakes'},
{'name': 'Yellow mongoose', 'eats': 'Termites'}])
<mongita.results.InsertManyResult object at 0x000000000>
>>> mongoose_collection.count_documents({})
2
>>> mongoose_collection.update_one({'name': 'Meercat'}, {'$set': {"weight": 2}})
<mongita.results.UpdateResult object at 00000000000>
>>> mongoose_collection.find({'weight': {'$gt': 1}})
<mongita.cursor.Cursor object at 00000000000>
>>> list(mongoose_collection.find({'weight': {'$gt': 1}}))
[{'_id': 'a1b2c3d4e5f6', 'name': 'Meercat', 'does_not_eat': 'Snakes', 'weight': 2}]
>>> mongoose_collection.delete_one({'name': 'Meercat'})
<mongita.results.DeleteResult object at 00000000000>
Refer to the PyMongo docs for detailed syntax and behavior. Most named keyword parameters are not implemented. When something is not implemented, efforts are made to be loud and obvious about it.
mongita.MongitaClientMemory / mongita.MongitaClientDisk (PyMongo docs)
mongita.MongitaClient.close()
mongita.MongitaClient.list_database_names()
mongita.MongitaClient.list_databases()
mongita.MongitaClient.drop_database(name_or_database)
Note: By default, MongitaClientDisk stores its data in ~/.mongita. To use a different directory, pass host
when initializing client = MongitaClientDisk(host=<db_path>)
.
Database (PyMongo docs)
mongita.Database.list_collection_names()
mongita.Database.list_collections()
mongita.Database.drop_collection(name_or_collection)
Collection (PyMongo docs)
mongita.Collection.insert_one(document)
mongita.Collection.insert_many(documents, ordered=True)
mongita.Collection.find_one(filter, sort)
mongita.Collection.find(filter, sort, limit)
mongita.Collection.replace_one(filter, replacement, upsert=False)
mongita.Collection.update_one(filter, update)
mongita.Collection.update_many(filter, update)
mongita.Collection.delete_one(filter)
mongita.Collection.delete_many(filter)
mongita.Collection.count_documents(filter)
mongita.Collection.distinct(key, filter)
mongita.Collection.create_index(keys)
mongita.Collection.drop_index(index_or_name)
mongita.Collection.index_information()
Cursor (PyMongo docs)
mongita.Cursor.sort(key_or_list, direction=None)
mongita.Cursor.next()
mongita.Cursor.limit(limit)
mongita.Cursor.skip(skip)
mongita.Cursor.clone()
mongita.Cursor.close()
CommandCursor (PyMongo docs)
mongita.CommandCursor.next()
mongita.CommandCursor.close()
errors (PyMongo docs)
mongita.errors.MongitaError (parent class of all errors)
mongita.errors.PyMongoError (alias of MongitaError)
mongita.errors.InvalidOperation
mongita.errors.OperationFailure
mongita.errors.DuplicateKeyError
mongita.errors.MongitaNotImplementedError
results (PyMongo docs)
mongita.results.InsertOneResult
mongita.results.InsertManyResult
mongita.results.UpdateResult
mongita.results.DeleteResult
Currently implemented query operators
$eq
$gt
$gte
$in
$lt
$lte
$ne
$nin
Currently implemented update operators
$set
$inc
$push
Mongita is an excellent project for open source contributors. There is a lot to do and it is easy to get started. In particular, the following tasks are high in priority:
- More testing. Try Mongita on a project and report any bugs. The unit tests are extensive but in-the-wild bugs are still possible.
- Windows support. Mongita fails unit tests on Windows. Fixing these and integration testing the library would be very helpful.
- SQLite engine. SQLite beats Mongita in most benchmarks. Having it as an engine would provide a fast persistent database which would be a little less than a perfect MongoDB reproduction.
- Cache eviction. In the disk engine, cache is never evicted after it is read or written. This could cause issues when dealing with large databases.
- More update operators. Currently, only $set and $inc are implemented.
- More query operators. Currently, only the "comparison operators" are implemented.
- find_one_and_... methods.
- Aggregation pipelines.
- More cursor methods. Currently only sort, next, skip, and limit are implemented.
You are welcome to email me at scottmrogowski@gmail.com if you are interested.
BSD 3-clause. Mongita is free and open source for any purpose with basic restrictions related to liability, warranty, and endorsement.
Mongita was started as a component of the fastmap server. Fastmap offloads and parallelizes arbitrary Python functions on the cloud.
- TinyMongo: Python library. Attempts to replicate the MongoDB interface.
- MontyDb: Python library. Attempts to replicate the MongoDB interface.
- UnQLite: Embedded NoSQL with Python bindings. Does not attempt to replicate the MongoDB interface. Very popular.
- NeDB: Javascript library. Attempts to replicate the MongoDB interface. Very popular.