The DataAnalysis module of DataWorks allows you to analyze, edit, and share data online. This topic provides an overview of DataAnalysis, including its features, benefits, and permissions.
Compared with offline data analysis, DataAnalysis provides the following benefits:
High capacity: DataAnalysis uses compute engines to analyze large amounts of data in an efficient manner.
Data mobility: DataAnalysis allows you to obtain data from different data sources for query and analysis and share analysis results with specified members.
High security: DataAnalysis allows you to analyze data online without the need to download data to your on-premises machine. DataAnalysis also allows you to manage the permissions to analyze and share data.
The SQL query feature allows you to write SQL statements to quickly query and analyze data in data sources on which you have query permissions. You can also add some self-managed data sources and the compute engines that are associated with your workspace to the SQL query feature. The added data sources and compute engines can be used as datasets for DataAnalysis. You can preview data in the datasets and generate SQL statements for data tables. For more information, see SQL query.
The workbook feature is the core feature of DataAnalysis. The feature provides a personal space in which you can create and configure a workbook to obtain, explore, and analyze data. The feature also provides common workbook functionalities that can help you get started with workbooks. You can import data from data sources or your on-premises machine to a workbook and perform data profiling. You can also share a workbook with a specified member to allow the member to view data in the workbook. For more information, see Create and manage a workbook.
The dimension table feature allows you to upload data from your on-premises machine to a MaxCompute table and use the MaxCompute table together with other tables to analyze data online. For more information, see Create and manage a MaxCompute table.
In some cases, data analysts and operations engineers need to manually maintain an online MaxCompute table. If the dimension table feature is not used to create a MaxCompute table, the following operations must be performed each time data changes occur:
Operations engineers describe their requirements to data analysts.
Data analysts submit the requirements to developers.
Developers edit SQL statements by using DataWorks, create an Excel table, synchronize data to the Excel table, and then deliver the Excel table to operations engineers.
If the dimension table feature is used, operations engineers need to only create, modify, and save a MaxCompute table in DataAnalysis. This improves efficiency.
Since June 1, 2022, only users to which the Data Analyst role is assigned can use DataAnalysis. If you want to use DataAnalysis, you can contact the user that is assigned the Workspace Manager role to assign the Data Analyst role to you. For more information, see Manage permissions on workspace-level services.
Users that are assigned the Visitor role or a custom role but are not assigned the Data Analyst role before June 1, 2022 can no longer use DataAnalysis after June 1, 2022.
Users that are assigned a role other than the Visitor role or a custom role are automatically granted the permissions on DataAnalysis and can continue to use DataAnalysis.
For more information about users, roles, and permissions, see Permissions of workspace-level roles.
Go to the DataAnalysis page
Log on to the DataWorks console. In the left-side navigation pane, choose . On the page that appears, select the desired workspace from the drop-down list and click Go to SQL Query.
By default, the Homepage tab is displayed. In the Shortcuts section of this tab, you can click SQL Query or Spreadsheet to perform the related operations to analyze data. In the Recently Edited section of this tab, you can click the name of a file to perform operations.