Skip to content

Wind-Gone/awesome-dbgiant-Industry-paper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Awesome-DBGiant-Industry-Paper 666

A curated paper list of awesome Industry databases, frameworks, ressources, tools and other awesomeness, for data engineers.

Welcome new PR, please conform to the committed rules: paperName(with link) [MeetingName Year]

If the paper has the open-source code, please supply its github links in Meeting.

Google

  1. Progressive Partitioning for Parallelized Query Execution in Google’s Napa [VLDB 23]
  2. Keep Your Distributed Data Warehouse Consistent at a Minimal Cost [SIGMOD 23]

Amazaon

  1. Amazon Redshift and the Case for Simpler Data Warehouses [SIGMOD 15]
  2. Amazon Redshift Re-invented [SIGMOD 22]
  3. The Story of AWS Glue [VLDB 23]
  4. Auto-WLM: ML-enhanced workload management in Amazon Redshift [SIGMOD 23]
  5. Amazon DynamoDB: A Scalable, Predictably Performant, and Fully Managed NoSQL Database Service [OSDI 22]

Tencent

  1. Angel-PTM: A Scalable and Economical Large-scale Pre-training System in Tencent [VLDB 23]
  2. EmbedX: A Versatile, Efficient and Scalable Platform to Embed Both Graphs and High-Dimensional Sparse Data [VLDB 23]
  3. Towards General and Efficient Online Tuning for Spark [VLDB 23]

Alibaba

  1. Eigen: End-to-end Resource Optimization for Large-Scale Databases on the Cloud [VLDB 23]
  2. Anser: Adaptive Information Sharing Framework of AnalyticDB [VLDB 23]
  3. Lindorm TSDB: A Cloud-native Time-series Database for Large-scale Monitoring Systems [VLDB 23]
  4. Vineyard: Optimizing Data Sharing in Data-Intensive Analytics [SIGMOD 23]

OceanBase

  1. OceanBase Paetica: A Hybrid Shared-nothing/Shared-everything Database for Supporting Single Machine and Distributed Cluster [VLDB 23]

PolarDB

  1. PolarDB-SCC: A Cloud-Native Database Ensuring Low Latency for Strongly Consistent Reads [VLDB 23]
  2. PolarDB-IMCI:A Cloud-Native HTAP Database System at Alibaba [SIGMOD 23]

Oracle

  1. Automatic SQL Error Mitigation in Oracle [VLDB 23]

Bytedance

  1. ByteHTAP: ByteDance’s HTAP System with High Data Freshness and Strong Data Consistency [VLDB 22]
  2. Krypton: Real-time Serving and Analytical SQL Engine at ByteDance [VLDB 23]
  3. VeDB: A Software and Hardware Enabled Trusted Relational Database [SIGMOD 23]

Huawei

  1. Taurus MM: bringing multi-master to the cloud [VLDB 23]

Microsoft

  1. POLARIS: The Distributed SQL Engine in Azure Synapse [VLDB 20]
  2. Microsoft Purview: A System for Central Governance of Data [VLDB 23]
  3. OneProvenance: Efficient Extraction of Dynamic Coarse-Grained Provenance From Database Query Event Logs [VLDB 23]
  4. Towards Building Autonomous Data Services on Azure [SIGMOD 23]

Intel

  1. Big Data Analytic Toolkit: A general-purpose, modular, and heterogeneous acceleration toolkit for data analytical engines [VLDB 23]

Meta

  1. Presto: A Decade of SQL Analytics at Meta [SIGMOD 23]
  2. Disaggregating RocksDB: A Production Experience [SIGMOD 23]

Snowflake

  1. The Snowflake Elastic Data Warehouse [SIGMOD 16]
  2. Building An Elastic Query Engine on Disaggregated Storage [OSDI 20]
  3. What’s the difference? Incremental processing with change queries in Snowflake [SIGMOD 23]

Databrics

  1. Photon: A Fast Query Engine for Lakehouse Systems [SIGMOD 22]

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published