Benefits
Stability: This service evolved from Alibaba's internal real-time data transfer system and has proven its stability and reliability by supporting the annual Double 11 event.
High throughput: Supports terabytes of data writes per day for a single topic. Each shard supports hundreds of gigabytes of data writes per day.
Low cost: The service is available on demand with a pay-as-you-go model, ensuring that you only pay for the resources you use.
Ecosystem integration: Built on the Apsara distributed system, this service is deeply integrated with the Alibaba Cloud big data ecosystem. It seamlessly connects with products such as MaxCompute, Real-time Compute, and Interactive Analytics to create a unified data architecture.
Features
Data ingestion: You can use multiple SDKs, APIs, and third-party plugins, such as Flume and Logstash, to efficiently ingest data into DataHub.
Data shipping: The DataConnector module lets you sync ingested data in real time to downstream storage and analysis systems, such as MaxCompute, OSS, and TableStore. This process requires minimal configuration and reduces the workload on your data links.
Data caching: Flexible cache durations allow downstream systems to re-consume data, and automatic multiple backups ensure high data reliability.
Multiple interfaces: You can use the web console for direct interaction or APIs and SDKs for programmatic interaction. These options are available to meet a variety of needs.