What is PolarDB - PolarDB - Alibaba Cloud Documentation Center

PolarDB is a cloud-native relational database by Alibaba Cloud. Built on a storage-compute decoupled architecture, it combines software and hardware optimization to deliver high performance, high availability, security, mass storage, and second-level elasticity. PolarDB is 100% compatible with MySQL and PostgreSQL, highly compatible with Oracle syntax, and supports centralized and distributed deployments. Compared to self-managed databases, PolarDB delivers up to 6x transaction performance and 400x analytical performance at 50% of the total cost of ownership (TCO).

Choose the PolarDB engine that fits your application — each maintains full ecosystem compatibility.

PolarDB database	Ecosystem compatibility	Service architecture	Product form
PolarDB for MySQL	100% compatible with MySQL	Shared storage, compute-storage decoupled	Public cloud, Apsara Stack Enterprise Edition, DBStack
PolarDB for PostgreSQL	100% compatible with PostgreSQL, highly compatible with Oracle syntax	Shared storage, compute-storage decoupled
PolarDB Distributed Edition	Centralized (Standard Edition): 100% compatible with MySQL Distributed (Enterprise Edition): Highly compatible with MySQL	Shared-nothing, integrated centralized and distributed architecture

Benefits

Ecosystem compatibility

100% compatible with MySQL and PostgreSQL ecosystems.
Highly compatible with Oracle syntax with end-to-end migration solutions. The service remains available during migration, as proven by over 500 customers.
PolarDB integrates with existing systems for smooth upgrades in both centralized and distributed deployments.

High performance

Up to 6x the transactional processing (TP) performance of open-source databases.
Up to 400x online analytical processing (OLAP) performance, powered by parallel query and In-Memory Column Index (IMCI).

High availability

Single-zone, dual-zone, three-zone (RPO = 0), and cross-region multi-level high availability configurations.
Up to 99.995% SLA.

Mass storage

PolarDB for MySQL/PolarDB for PostgreSQL: Supports up to 500 TB of storage.
PolarDB Distributed Edition supports petabyte-scale storage.

Easy to scale

The intelligent proxy supports multiple read consistency levels.
Serverless dynamic and elastic scaling, and distributed linear scaling.
Low-latency physical replication for efficient, stable inter-node data transfer.

Security

Access control: Resource Access Management (RAM) users, IP address whitelists, security groups, and virtual private clouds (VPCs).
Data security: Transparent data encryption (TDE), backup and recovery, and flashback query.
Transmission security: Secure Sockets Layer (SSL) encryption.

Service architecture

PolarDB for MySQL

PolarDB for MySQL uses a cloud-native, storage-compute decoupled architecture that combines commercial-grade stability, performance, and scalability with open-source simplicity and rapid iteration.

PolarDB for MySQL architecture

Database proxy (Proxy)

A network proxy between applications and the database. Provides security authentication, automatic read/write splitting, load balancing, consistency levels, connection pools, persistent connections, and overload protection.
Database compute nodes
- PolarDB uses a multi-node cluster architecture. A Cluster Edition cluster contains one read/write node and multiple read-only nodes. A Multi-master Cluster contains multiple read/write nodes and multiple read-only nodes.
- Read/write nodes and read-only nodes use active-active failover to provide high database availability.
- Compute nodes provide the SQL engine feature and are available in General-purpose and Dedicated specifications.
Shared distributed storage

Multiple compute nodes share a single data copy, reducing storage costs. Built on a distributed block storage and file system, capacity scales online to hundreds of terabytes.

PolarDB for PostgreSQL

PolarDB for PostgreSQL clusters support centralized and distributed deployments:

Centralized

Built on a cloud-native, storage-compute decoupled architecture, it combines commercial-grade stability and performance with open-source simplicity and rapid iteration.
Distributed

A distributed database built on centralized PolarDB for PostgreSQL clusters. Uses a two-layer CN/DN architecture for distributed scalability with storage-compute decoupling, while retaining all centralized cluster features.

Centralized (Enterprise and Standard Editions)

PolarDB for PostgreSQL centralized architecture

Database proxy (Proxy)

A network proxy between applications and the database. Provides security authentication, automatic read/write splitting, load balancing, consistency levels, connection pools, persistent connections, and overload protection.
Database compute nodes
- PolarDB uses a multi-node cluster architecture. A Cluster Edition cluster contains one read/write node and multiple read-only nodes.
- Read/write nodes and read-only nodes use active-active failover to provide high database availability.
- Compute nodes provide the SQL engine feature and are available in General-purpose and Dedicated specifications.
Shared distributed storage

Multiple compute nodes share a single data copy, reducing storage costs. Built on a distributed block storage and file system, capacity scales online to hundreds of terabytes.

Distributed

PolarDB for PostgreSQL distributed architecture

Database nodes
- Composed of compute nodes (CNs) and data nodes (DNs). CNs are compute nodes responsible for cluster metadata management and distributed plan generation. DNs are data nodes responsible for storing actual data shards.
- Each CN and DN uses a centralized architecture with storage-compute decoupling. It supports a database proxy and a one-primary-multiple-read-only mode. It provides high availability failover for compute nodes within a CN or DN and supports adding read-only (RO) nodes to scale the read capability of a single CN or DN.
Distributed features
- Supports manual sharding and horizontal scaling for business scenarios at the petabyte level.
- Ensures consistency for distributed transactions.
- Supports dual-zone deployment. The primary zone is the working cluster, and the secondary zone is the Hot Standby Cluster.
- Provides 24/7 non-disruptive upgrade, downgrade, and maintenance capabilities. You can also add heterogeneous CN and DN nodes to the cluster.

PolarDB Distributed Edition

In a PolarDB for X-Engine cluster, storage nodes (DNs) start as a centralized deployment 100% compatible with MySQL 5.7 and 8.0. When you need distributed scaling, upgrade in place — distributed components connect to existing storage nodes without data migration or application changes.

Centralized (Standard Edition)

PolarDB Distributed Edition centralized architecture

Storage node (DN)

Handles data persistence using the Paxos consensus protocol for high reliability and strong consistency. The self-developed Lizard distributed transaction engine system provides more reliable high availability and ~35% better performance than the native MySQL distributed engine.
Multi-replica data architecture

The Paxos consensus protocol ensures strong consistency (RPO = 0) between replicas — each write requires majority confirmation. If one node fails, the cluster continues operating. Replica roles:
- Leader
  
  Processes client requests, makes decisions, and maintains logs for data consistency and recoverability.
- Follower
  
  Executes instructions from the Leader. Can be elected as the new Leader if the current Leader fails.
- Logger
  
  Provides Paxos protocol services only (no data services). Participates in Leader elections and may temporarily assume the Leader role until Followers catch up, then yields.
- Learner
  
  Passively receives system status without voting or decision-making rights.

Distributed (Enterprise Edition)

PolarDB Distributed Edition distributed architecture

Global meta service (GMS)

Maintains globally consistent metadata (tables, schemas, statistics), security information (accounts, permissions), and the global timestamp service (TSO).
Compute node (CN)

Stateless entry point with SQL parser, optimizer, and executor. Handles distributed routing, computation, 2PC coordination, distributed DDL, and global index maintenance.
Storage node (DN)

Persists row store data using Paxos for strong consistency and MVCC for distributed transaction visibility. Supports computation pushdown (Project, Filter, Join, Aggregation).
Column store node (Columnar)

Persists columnstore indexes on OSS, consuming distributed transaction binary logs in real time. Combined with compute nodes, provides snapshot-consistent column store queries.
Change data capture (CDC) node

Provides incremental subscriptions compatible with MySQL binary log format and primary/secondary replication compatible with MySQL replication protocol.

Video introduction

How to use PolarDB

Manage PolarDB clusters, databases, and accounts through:

Console: A graphical web interface for easy operations.
API: Programmatic access to all console operations.
SDK: Language-specific wrappers for all console operations.
CLI: Command-line access to all console operations.

Quick start

PolarDB database	Billing	Operation Guides	Whitepapers
PolarDB for MySQL	Billing	User Guide	Performance Whitepaper
PolarDB for PostgreSQL	Billing	O&M Guide Kernel features Extension Guide Spatiotemporal engine (GanosBase)	Performance Whitepaper
PolarDB for PostgreSQL (Compatible with Oracle)	Billing	O&M Guide Kernel features Extension Guide Spatiotemporal engine (GanosBase)	Performance Whitepaper Technical Whitepaper
PolarDB Distributed Edition	Billing overview	User Guide	Performance Whitepaper Technical Whitepaper