This topic describes how to create and run Spark applications in the AnalyticDB for MySQL console.
You can use the Spark editor to create and run Spark batch or streaming applications.
You can view the driver log and submission details of the current Spark application.
You can view the execution logs of SQL statements.
An AnalyticDB for MySQL Data Lakehouse Edition (V3.0) cluster is created. For more information, see Create a cluster.
A job resource group that has at least 8 AnalyticDB compute units (ACUs) of reserved computing resources is created. For more information, see Create a resource group.
A Resource Access Management (RAM) user is granted the required permissions. For more information, see the "Grant permissions to a RAM user" section of the Manage RAM users and permissions topic.
A database account of the AnalyticDB for MySQL cluster is associated with the RAM user. For more information, see Associate or disassociate a database account with or from a RAM user.
The AliyunADBSparkProcessingDataRole role is created and AnalyticDB for MySQL is authorized to access other cloud resources. For more information, see Perform authorization within an Alibaba Cloud account.
Create and run a Spark application
- Log on to the AnalyticDB for MySQL console.
- In the upper-left corner of the page, select a region.
- In the left-side navigation pane, click Clusters.
- On the Data Lakehouse Edition (V3.0) tab, find the cluster that you want to manage and click the Cluster ID.
In the left-side navigation pane, choose .
On the Spark JAR Development page, click the icon to the right of Applications.
In the Create Application Template dialog box, configure the parameters that are described in the following table.
The name of the application or directory. File names are case-insensitive.
If you select Application from the Type drop-down list, the template is in the file format.
If you select Directory from the Type drop-down list, the template is in the folder format.
The parent directory of the file or folder.
Batch: batch application.
Streaming: streaming application.
After you create a Spark template, configure a Spark application in the Spark editor. For more information, see Overview of Spark application development.
After you configure the Spark application, perform the following operations:
Click Save to save the Spark application. Then, you can reuse the application.
Click Run Now to run the Spark application. The status of the Spark application is displayed on the Applications tab in real time.Note
Before you run a Spark application, you must select a job resource group and an application type.
View information about a Spark application
On the Applications tab, search for an application by application ID and perform the following operations to view information about the Spark application:
Click Log in the Actions column to view the driver log of the current Spark application or the execution log of SQL statements.
Click UI in the Actions column to go to the corresponding Spark UI. Access to the UI has a validity period. If the validity period ends, you must re-access the UI.
Click Details in the Actions column to view submission details of the current application, such as the log path, web UI URL, cluster ID, and resource group name.
Choose More > Stop in the Actions column to stop the current application.
Choose More > History in the Actions column to view the history of retry attempts on the current application.
On the Execution History tab, view the history of retry attempts on all applications.Note
By default, no retry is performed after an application fails. To perform retry attempts, configure the spark.adb.maxAttempts and spark.adb.attemptFailuresValidityInterval parameters. For more information, see Configuration parameters of Spark applications.