The real-time analysis feature consists of high-speed data import API and real-time data source integration. This feature directly writes data to compute nodes and uses AnalyticDB for PostgreSQL to consume Kafka data. This prevents high workloads on coordinator nodes when you write data by using the COPY or INSERT statement.
Feature description
High-speed data import API
gRPC is an open source high-performance Remote Procedure Call (RPC) framework that was developed by Google. High-speed data import API uses the gRPC protocol as the client API and directly writes data to compute nodes to achieve highly efficient data transmission. You can use the API to manage data writes and updates in a flexible manner. For more information, see High-speed data import API.
Real-time data source integration
Real-time data source integration uses AnalyticDB for PostgreSQL to consume data from Kafka topics in real time. This helps you analyze data that is generated from batch processing and stream processing. For more information, see Real-time data source integration.
Scenarios
You want to build real-time analysis reports and real-time analysis applications.
You want to build real-time data warehouses that integrate batch processing and stream processing in combination with incremental real-time materialized views.
If the UPSERT or UPDATE statement fails to provide high write performance, you can use the high-speed data import API feature to achieve high-throughput UPSERT performance.
You want to import data from Kafka topics to AnalyticDB for PostgreSQL without using other data integration tools.
Benefits
Kafka data can be directly consumed to reduce dependencies on real-time processing components.
Data is directly written to compute nodes without passing through coordinator nodes to significantly improve write throughput.
The high-speed data import API feature provides higher performance than the UPSERT or UPDATE statement.
Data writes do not result in workloads on coordinator nodes.