Website | Documentation | Official Repository | Blog | YouTube | Discord | Telegram | LinkedIn | X
YDB is an open source Distributed SQL Database that combines high availability and scalability with strict consistency and ACID transactions.
YDB was designed from scratch to respond to the growing demand for scalable interactive web services. Scalability, strict consistency, and effective cross-row transactions were a must for such an OLTP-like workload. YDB was built by people with strong backgrounds in databases and distributed systems who have experience developing a NoSQL database and the MapReduce system for one of the largest search engines in the world.
Basic YDB features:
- Both row-oriented and column-oriented tables for transactional and analytical workloads. Also, persistent queues (topics) for moving data around.
- Fault-tolerant configuration that survives disk, node, rack, or even datacenter outages.
- Automatic disaster recovery with minimum latency disruptions for applications.
- Independent horizontal scalability of storage and compute layers.
- ACID transactions across multiple nodes and tables with strict consistency.
- Rich SQL dialect (YQL) for data manipulation and schema definition.
- PostgreSQL-compatible mode for table operations and Kafka-compatible mode for topics.
- YDB clusters can be deployed with Ansible, Kubernetes, or manually.
YDB can be deployed in three availability zones (datacenters). A cluster remains available for both reads and writes during a complete outage of a single zone. Availability zones and regions are covered in more detail in documentation.
Unlike traditional relational databases, YDB scales out, providing developers with the capability to simply extend clusters with computation or storage resources to handle increasing load. YDB has disaggregated storage and compute layers, which allow you to scale storage and compute resources independently.
Current production installations have over 10000 nodes, store petabytes of data, and handle millions of distributed transactions per second.
YDB's built-in automatic recovery support allows it to seamlessly survive hardware failures. After unpredictable disk, node, rack, or even datacenter failure, YDB remains fully available for reads and writes and automatically restores required data redundancy.
YDB supports multitenant and serverless setups. A user can run a YDB cluster and create several databases that share one pool of storage and have different compute nodes. Alternatively, a user can run several serverless databases that share one pool of compute resources to utilize them effectively.
YDB runs on x86 64-bit platforms with at least 8 GB of RAM.
In most production environments, YDB runs on 64-bit x86 machines working under Ubuntu Linux.
For development purposes, it is regularly tested that YDB can be compiled and run under the latest versions of MacOS and Microsoft Windows.
If you want to experiment with YDB, start with the Quick Start guide. It will yield a single-node cluster suitable for functional testing, app development, and similar tasks.
Suppose you want to jump into more serious scenarios like testing YDB fault tolerance, running performance benchmarks, or even running production or preproduction workloads. In that case, you'll need a full-fledged multi-node YDB cluster that can be deployed with either Ansible for bare metal or virtual machines or Kubernetes for containers.
Instructions on how to build YDB server (ydbd) and client (ydb) binaries are provided in BUILD.md. Also, see documentation on Ya Make build system.
We are glad to welcome new contributors! The contributor's guide provides more details on how to get started as a contributor.
There's also a separate section of YDB documentation for contributors, mostly with more technical content.
Visit YDB website for the latest success stories and user scenarios.