Rust Database Engine

A high-performance database engine written in Rust, focusing on efficient BSON document storage and retrieval.

Week 2 Progress: BSON Serialization Implementation

Completed Features

Zero-Copy Deserialization
- Implemented streaming deserialization to minimize memory allocations
- Added support for partial document reading
- Optimized field name extraction
Memory Optimizations
- Added document size validation
- Implemented memory limits for document operations
- Added nesting depth limits to prevent stack overflow
- Optimized string allocation patterns
Performance Features
- Streaming array/object encoding
- Efficient handling of nested documents
- Buffer reuse for string and binary data
- Progress tracking callbacks for long operations

API Improvements

Document Operations

// Create and modify documents
let mut doc = Document::new();
doc.set("field", Value::String("value".to_string()));

// Access document fields
if let Some(value) = doc.get("field") {
    println!("Found value: {}", value);
}

Serialization

// Serialize with progress tracking
let mut encoder = BsonEncoder::new(buffer);
encoder.with_progress_callback(|written, total| {
    println!("Progress: {}/{}", written, total);
});
encoder.encode_document(&doc)?;

Partial Document Reading

// Read only specific fields
let fields = vec!["field1", "field2"];
let partial = decoder.decode_partial_document(&fields)?;

Performance Characteristics

Memory Usage
- Document size validation (max 16MB)
- Streaming operations for large documents
- Efficient string handling
CPU Efficiency
- Zero-copy operations where possible
- Minimal data copying during serialization
- Efficient nested document handling
- Optimized string processing
Throughput
- Streaming support for large documents
- Progress tracking for long operations
- Partial document reading support

Error Handling

Validation
- Document size limits
- Field name validation
- UTF-8 string validation
- Nesting depth limits
Error Types
- IO errors
- Memory limit errors
- Invalid data errors
- Missing field errors

Testing

Unit Tests
- Document lifecycle tests
- Error handling tests
- Streaming operation tests
- Memory limit tests
Integration Tests
- End-to-end document operations
- Serialization/deserialization
- Error handling scenarios
- Performance characteristics

Getting Started

Installation

git clone <repository-url>
cd rust_database_engine
cargo build --release

Running Tests

cargo test
cargo bench  # Run benchmarks

Example Usage

use database::{Document, Value};

// Create a document
let mut doc = Document::new();
doc.set("name", Value::String("example".to_string()));

// Serialize
let mut encoder = BsonEncoder::new(buffer);
encoder.encode_document(&doc)?;

// Deserialize
let mut decoder = BsonDecoder::new(buffer);
let doc = decoder.decode_document()?;

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Description

NoSQL lightweight database easy to use, with a focus on performance and simplicity. The engine supports basic
CRUD operations, indexing, querying, and transactions. It is built to resemeble MongoDB's system of collections
and documents.

Inspirations

MongoDB
PostgreSQL

Current Progress

Database Types
Document Struct that holds database types
Testing and Benchmarks for Types & Documents
BSON Serialization/Deserialization

Examples

#[allow(dead_code)]

fn example_organization_structure() -> Document {
    let mut user1 = BTreeMap::new();
    user1.insert("name".to_string(), Value::String("Charlie".to_string()));
    user1.insert("role".to_string(), Value::String("Developer".to_string()));

    let mut user2 = BTreeMap::new();
    user2.insert("name".to_string(), Value::String("Dana".to_string()));
    user2.insert("role".to_string(), Value::String("Designer".to_string()));

    let team_members = vec![Value::Object(user1), Value::Object(user2)];

    let mut team = BTreeMap::new();
    team.insert("name".to_string(), Value::String("Frontend".to_string()));
    team.insert("members".to_string(), Value::Array(team_members));

    let mut org = BTreeMap::new();
    org.insert(
        "org_name".to_string(),
        Value::String("Acme Corp".to_string()),
    );
    org.insert("teams".to_string(), Value::Array(vec![Value::Object(team)]));

    Document {
        data: org,
        id: Value::ObjectId(ObjectId::new()),
    }
}

Benchmark Results

Document Serialization
- Small documents (10 fields): ~4 µs
- Medium documents (100 fields): ~35 µs
- Large documents (1000 fields): ~370 µs
- Very large documents (10000 fields): ~6 ms
- Nested documents (depth 5-50): 0.6-3.5 µs
- Mixed type documents: ~1 µs
Document Deserialization
- Small documents (10 fields): ~10 µs
- Medium documents (100 fields): ~130 µs
- Large documents (1000 fields): ~1.7 ms
- Very large documents (10000 fields): ~21 ms
- Nested documents (depth 5-50): 1.8-16 µs
- Mixed type documents: ~2.3 µs
Partial Document Operations
- Small documents (100 fields):
  - 1 field: ~30 µs
  - 10 fields: ~37 µs
  - 50 fields: ~80 µs
- Large documents (10000 fields):
  - 1 field: ~4.4 ms
  - 10 fields: ~4.9 ms
  - 50 fields: ~7.3 ms
Streaming Operations
- Document size scaling:
  - 1000 fields: ~400 µs encode, ~1.8 ms decode
  - 10000 fields: ~6 ms encode, ~22 ms decode
  - 100000 fields: ~76 ms encode, ~240 ms decode
- Multiple document streaming: ~0.5 µs per document
Field Name Extraction
- 100 fields: ~30 µs
- 1000 fields: ~500 µs

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.github/workflows		.github/workflows
database		database
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Rust Database Engine

Week 2 Progress: BSON Serialization Implementation

Completed Features

API Improvements

Performance Characteristics

Error Handling

Testing

Getting Started

Contributing

Description

Inspirations

Current Progress

Examples

Benchmark Results

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

EthanRule/rust_database_engine

Folders and files

Latest commit

History

Repository files navigation

Rust Database Engine

Week 2 Progress: BSON Serialization Implementation

Completed Features

API Improvements

Performance Characteristics

Error Handling

Testing

Getting Started

Contributing

Description

Inspirations

Current Progress

Examples

Benchmark Results

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages