Skip to content

Wind-Gone/awesome-dbgiant-Industry-paper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 

Repository files navigation

Awesome-DBGiant-Industry-Paper 666

Awesome DBGiant Industry Paper visitor badge GitHub Repo stars GitHub Repo forks

Introduction

A curated paper list of awesome industry papers from various giant database vendors and other awesomeness, for database researchers/engineers.

Contributing

The repository is under construction. Welcome new PR, please conform to the committed rules:

paperName(with pdf link) [MeetingName Year] Github link if it has open-sourced code (optional)

Acknowledge

Thanks to all authors of the paper/repository I cite :D

Table of Content

Google

  1. Progressive Partitioning for Parallelized Query Execution in Google’s Napa [VLDB 23]
  2. Keep Your Distributed Data Warehouse Consistent at a Minimal Cost [SIGMOD 23]

Amazaon

  1. Amazon Redshift and the Case for Simpler Data Warehouses [SIGMOD 15]
  2. Amazon Redshift Re-invented [SIGMOD 22]
  3. Amazon DynamoDB: A Scalable, Predictably Performant, and Fully Managed NoSQL Database Service [OSDI 22]
  4. The Story of AWS Glue [VLDB 23]
  5. Auto-WLM: ML-enhanced workload management in Amazon Redshift [SIGMOD 23]
  6. Resource Management in Aurora Serverless [VLDB 24]

Tencent

  1. Angel-PTM: A Scalable and Economical Large-scale Pre-training System in Tencent [VLDB 23]
  2. EmbedX: A Versatile, Efficient and Scalable Platform to Embed Both Graphs and High-Dimensional Sparse Data [VLDB 23]
  3. Towards General and Efficient Online Tuning for Spark [VLDB 23]
  4. TDSQL: Tencent Distributed Database System [VLDB 24]

Alibaba

  1. Eigen: End-to-end Resource Optimization for Large-Scale Databases on the Cloud [VLDB 23]
  2. Anser: Adaptive Information Sharing Framework of AnalyticDB [VLDB 23]
  3. Lindorm TSDB: A Cloud-native Time-series Database for Large-scale Monitoring Systems [VLDB 23]
  4. Vineyard: Optimizing Data Sharing in Data-Intensive Analytics [SIGMOD 23]
  5. Flux: Decoupled Auto-Scaling for Heterogeneous Query Workload in Alibaba AnalyticDB [SIGMOD 24]

OceanBase

  1. OceanBase Paetica: A Hybrid Shared-nothing/Shared-everything Database for Supporting Single Machine and Distributed Cluster [VLDB 23]

PolarDB

  1. PolarDB-SCC: A Cloud-Native Database Ensuring Low Latency for Strongly Consistent Reads [VLDB 23]
  2. PolarDB-IMCI:A Cloud-Native HTAP Database System at Alibaba [SIGMOD 23]
  3. PolarDB-MP: A Multi-Primary Cloud-Native Database via Disaggregated Shared Memory [SIGMOD 24]

Oracle

  1. Automatic SQL Error Mitigation in Oracle [VLDB 23]
  2. Grouping, Subsumption, and Disjunctive Join Optimizations in Oracle [VLDB 24]

Bytedance

  1. ByteHTAP: ByteDance’s HTAP System with High Data Freshness and Strong Data Consistency [VLDB 22]
  2. Krypton: Real-time Serving and Analytical SQL Engine at ByteDance [VLDB 23]
  3. VeDB: A Software and Hardware Enabled Trusted Relational Database [SIGMOD 23]
  4. LavaStore: ByteDance's Purpose-built, High-performance, Cost-effective Local Storage Engine for Cloud Services [VLDB 24]

Huawei

  1. Taurus MM: bringing multi-master to the cloud [VLDB 23]
  2. GaussDB: A Cloud-Native Multi-Primary Database with Compute-Memory-Storage Disaggregation [VLDB 24]

Microsoft

  1. POLARIS: The Distributed SQL Engine in Azure Synapse [VLDB 20]
  2. Microsoft Purview: A System for Central Governance of Data [VLDB 23]
  3. OneProvenance: Efficient Extraction of Dynamic Coarse-Grained Provenance From Database Query Event Logs [VLDB 23]
  4. Towards Building Autonomous Data Services on Azure [SIGMOD 23]

Intel

  1. Big Data Analytic Toolkit: A general-purpose, modular, and heterogeneous acceleration toolkit for data analytical engines [VLDB 23]

Meta

  1. Presto: A Decade of SQL Analytics at Meta [SIGMOD 23]
  2. Disaggregating RocksDB: A Production Experience [SIGMOD 23]

Snowflake

  1. The Snowflake Elastic Data Warehouse [SIGMOD 16]
  2. Building An Elastic Query Engine on Disaggregated Storage [OSDI 20]
  3. What’s the difference? Incremental processing with change queries in Snowflake [SIGMOD 23]

Databrics

  1. Photon: A Fast Query Engine for Lakehouse Systems [SIGMOD 22]

SingleStore

  1. SingleStore-V: An Integrated Vector Database System in SingleStore [VLDB 24]

ClickHouse

  1. ClickHouse - Lightning Fast Analytics for Everyone [VLDB 24]

Star History

Star History Chart

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published