DataHub is developed based on the real-time data transmission system of Alibaba Cloud. This reliable and highly available service is one of the backbone services that ensure stability during the peak hours of the Double 11 shopping festival.
1.2 High throughput
You can write terabytes (TB) of data to a topic and up to hundreds of gigabytes (GB) of records to a shard every day.
1.3 Cost efficiency
You can use DataHub to process streaming data anytime and anywhere by adopting the pay-as-you-go billing method. This helps increase your cost efficiency.
1.4 Ecosystem integration
As one of the major services in the Apsara system, DataHub is deeply integrated with the Alibaba Cloud big data system and seamlessly connected with Alibaba Cloud services such as MaxCompute, Realtime Compute, and Hologres. You can exploit the entire big data system by using DataHub.
2.1 Data access
DataHub provides a variety of SDKs, APIs, and third-party plug-ins such as Flume and Logstash. You can use them to import data to DataHub with ease.
2.2 Data delivery
You can configure a DataConnector with a few simple steps to automatically synchronize data from DataHub to other Alibaba Cloud services such as MaxCompute, Object Storage Service (OSS), and Tablestore. This feature greatly reduces the energy consumption of the data link layer.
2.3 Data cache
DataHub supports flexible cache configurations. Downstream systems can consume data multiple times. The data redundancy mechanism ensures high data reliability.
2.4 Multiple connection methods
DataHub provides a web console for you to manage data. You can also use APIs and SDKs for interactions between programs. DataHub can meet your needs in various scenarios.