Alibaba Cloud Data Lake Analytics (DLA) is a serverless, high performance interactive query service that requires zero infrastructure setup or upfront cost. Customers can use DLA to query data stored in a number of Alibaba Cloud services, such as objects in OSS, key-value pairs in Table Store, and relational data in RDS. Customers can explore a single dataset at a time or analyze data across multiple sources simultaneously, and DLA delivers fast, interactive responses with a Massive Parallel Processing (MPP) architecture. DLA is fully compatible with SQL syntax which customers are already familiar with and therefore have an extremely low learning curve. In addition, DLA works with popular BI tools natively so that customers can easily visualize their data, derive insights, and accelerate decision-making.
Fig.1: Data Lake Analytics in Alibaba Cloud
DLA is designed for those who need cloud-native analytics and are seeking a cost-effective solution that can turn raw data into real-time insights. Compared with conventional analytics platforms which require upfront hardware provisioning and setup, DLA allows customers to run queries on demand with no upfront costs and quickly explore ideas on new data while only paying for the resources consumed. Designed for developers, business analysts, and data scientist where a self-service operational model is highly preferred, DLA provides built-in ETL (Extract, Transform and Load) functionality that reduces the heavy lifting of pre-processing data of different formats and origins so customers can focus on analysis and insights.
Serverless: For enterprise users, the service is made available on a Pay-As-You-Go basis with ZERO maintenance effort. Key benefits include but not limited to instant startup, transparent upgrade, and elastic QoS.
Database-like User Experience: DLA offers standard SQL interface, with outstanding SQL compatibility and comprehensive built-in functions. Based on JDBC/ODBC connectivity, users are granted with fast and convenient service access, as well as low migration cost. Meanwhile, integration with BI product enables DLA to turn big data into consumable insights and visualizations. With database experience, DLA helps customers accelerate their cloud migration process.
Deep Ecosystem Integration: DLA enables complex analytics for data coming from different sources with various formats. Not only user can leverage DLA to analyze data stored on Alibaba cloud OSS and Table Store respectively, but also, they can join the results and thus generate new insights.
Optimized Performance: DLA fully leverages Massive Parallel Processing (MPP) architecture, vectorized execution optimization, multi-tenancy resource allocation and priority scheduling to achieve optimized performance.
For raw data (logs, CSV, JSON, Avro, etc.) persisted on a given storage, OSS for example, you can query specified objects/files/folders without having to go thru a complex ETL process, and get the results back instantly
Fig.2: Query OSS Data
When dealing with multiple data sources, OSS and Table Store for example, DLA enables JOIN operation across heterogeneous data sources, turning your big data into consolidated insights.
Fig.3: Joint Analytics Across OSS and Table Store
Data Lake Analytics is now available in the Singapore, China (Hangzhou, Shanghai, Beijing), and the UK regions, and would be coming soon to US (West) and Australia regions. To get started on Data Lake Analytics, watch the DLA webinar here or visit the product page here.
Check out the latest offers for Data Lake Analytics at www.alibabacloud.com/campaign/data-lake-analytics-2018
Alibaba Clouder - October 18, 2019
Alibaba Clouder - November 9, 2018
Alibaba Clouder - November 12, 2018
ApsaraDB - March 3, 2020
Alibaba Clouder - November 15, 2018
Alibaba EMR - December 2, 2019
A premium, serverless, and interactive analytics serviceLearn More
SDDP automatically discovers sensitive data in a large amount of user-authorized data, and detects, records, and analyzes sensitive data consumption activities.Learn More
Realtime Compute offers a highly integrated platform for real-time data processing, which optimizes the computing of Apache Flink.Learn More
Secure and easy solutions for moving you workloads to the cloudLearn More
More Posts by Alibaba Clouder