DataWorks is an important platform as a service (PaaS) of Alibaba Cloud. It offers all-around services, including Data Integration, DataStudio, Data Map, Data Quality, and DataService Studio. In addition, it provides a one-stop data development and management console to help enterprises mine and explore data value.
DataWorks supports multiple compute and storage engines, including MaxCompute, E-MapReduce, Realtime Compute for Apache Flink, Machine Learning Platform for AI, Graph Compute, and Hologres. It also allows you to use custom computing and storage services. As an all-in-one platform, DataWorks provides end-to-end big data services, artificial intelligence (AI) development, and data governance.
DataWorks simplifies data transmission, conversion, and integration. You can import data from different data stores, convert, analyze, and process the data, and then transmit the data to other data systems.
DataWorks provides powerful scheduling capabilities. For more information, see Schedule.
DataWorks provides a graphical user interface (GUI) for you to develop code and design workflows. You can perform simple drag-and-drop operations to create complex data analytics nodes without the need to use development tools.
A browser with Internet access enables you to develop code anytime, anywhere.
Operation Center provides a visualized node monitoring and management tool and displays the overall node running status in DAGs.
You can configure various alert notification methods to promptly notify relevant staff when a node error occurs. This ensures normal business operation.
DataWorks + Data Integration + AnalyticDB for MySQL + Quick BI + MaxCompute
DataWorks + Data Integration + Quick BI + MaxCompute
Related service: Data Security Guard of DataWorks
In this section, you will learn how to perform data quality monitoring. This section will mainly go over how you can monitor the data quality in the process of using the data workshop, set up quality monitoring rules, monitor alerts and tables.
This article describes the intermediate-to-advanced features of DataWorks Advanced Edition and introduces the features and applicable scenarios for each feature of DataWorks Basic Edition, Standard Edition, Professional Edition, and Enterprise Edition. It helps you select the most suitable DataWorks edition to solve your problems.
Independently developed by Alibaba, DataWorks is used to build and administer 99% of the data-driven and data-focused business operations of Alibaba Group by tens of thousands of data and algorithm development engineers every day.
Initially released in 2010, DataWorks has undergone many technological changes and architecture upgrades up to what is the current version, unfortunately resulting in a great deal of historical baggage. Technological innovation and business development often work well together and complement each other, but they can also restrict each other and cause various problems. The latter is the case with DataWorks. The big data product has some long-standing problems, of which include slow access, extensive code changes required to fix a single bug, and environmental complexity. Problematically, previous iterations have not fundamentally upgraded DataWorks and resolved all of these problems. Rather, they have only improved performance, optimized the underlying engineering structures, and reduced repeated code.
This article will take a look at how we can resolve some of the problems that have plagued DataWorks by adopting the wildly popular microservice architecture and explore how we can transform the technical architecture of DataWorks in a practical manner while avoiding jumping through several complicated engineering hoops.
DataWorks is a Big Data platform product launched by Alibaba Cloud. It provides one-stop Big Data development, data permission management, offline job scheduling, and other features.
DataWorks works straight ‘out-the-box’ without the need to worry about complex underlying cluster establishment and Operations & Management.
Alibaba Clouder - February 11, 2021
Alibaba Clouder - January 6, 2021
Alibaba Clouder - February 11, 2021
Alibaba Clouder - June 23, 2021
Alibaba EMR - April 27, 2021
Alibaba Cloud MaxCompute - March 2, 2020
A secure environment for offline data development, with powerful Open APIs, to create an ecosystem for redevelopment.Learn More
Alibaba Cloud DNS PrivateZone is a Virtual Private Cloud-based (VPC) domain name system (DNS) service for Alibaba Cloud users.Learn More
Alibaba Mail is one of the only email service providers in the industry that supports public cloud services and provides fast, secure, and stable services.Learn More
More Posts by Alibaba Clouder