Write multi-statement SQL in script mode - MaxCompute

Script mode compiles an entire multi-statement SQL file as a single unit and submits it as one job. MaxCompute generates one execution plan, queues the script once, and runs all statements atomically. This makes script mode well-suited for extract, transform, and load (ETL) pipelines, scheduled batch jobs, and complex query orchestration that would otherwise require chaining multiple separate submissions.

Cost estimation is not supported for SQL in script mode. Actual fees appear in your bill. For details, see View billing details.

A single script can reference up to 10,000 tables. Each reference counts individually, including repeated references to the same table and tables referenced inside view definitions.

When to use script mode

Script mode fits two main scenarios:

Complex single-statement rewrites — replace deeply nested subqueries with a sequence of readable table variable assignments.
Multi-statement pipelines — combine statements that logically belong together into one atomic job, so they either all succeed or all roll back.

If data from multiple data sources is prepared at long intervals (for example, one is ready at 01:00 and another at 07:00), do not use a table variable to assemble a large SQL script in script mode. In that case, keep the statements in separate jobs.

Script structure

A script in script mode follows a fixed structure. The sections must appear in this order: SET → DDL → DML. Each section can be empty, but sections cannot be interleaved.

-- 1. SET: configure session-level properties
SET odps.sql.type.system.odps2=true;
[SET odps.stage.reducer.num=xxx;]

-- 2. DDL: create tables (optional)
CREATE TABLE table1 ...;

-- 3. DML: declare variables and insert results
@var1 := SELECT [ALL | DISTINCT] select_expr, select_expr, ... FROM table3 [WHERE where_condition];
@var2 := SELECT [ALL | DISTINCT] select_expr, select_expr, ... FROM table4 [WHERE where_condition];
@var3 := SELECT [ALL | DISTINCT] var1.select_expr, var2.select_expr, ... FROM @var1 JOIN @var2 ON ...;
INSERT OVERWRITE|INTO TABLE [PARTITION (...)]
    SELECT [ALL | DISTINCT] select_expr, select_expr, ... FROM @var3;

-- Additional patterns
[@var4 := SELECT [ALL | DISTINCT] var1.select_expr, var1.select_expr, ... FROM @var1
    UNION ALL | UNION
    SELECT [ALL | DISTINCT] var2.select_expr, var2.select_expr, ... FROM @var2;
CREATE [EXTERNAL] TABLE [IF NOT EXISTS] table_name AS
    SELECT ... FROM @var4;]

Execution semantics

Behavior	Detail
Atomic execution	If any statement fails, the entire script fails and all operations roll back.
Single job	MaxCompute generates one job for the entire script. The job starts only after all input data is ready.
Single execution plan	All statements compile into a single directed acyclic graph (DAG), so the script is queued and dispatched once.

Table variables

Declare table variables with @ and assign results using :=:

@var1 := SELECT * FROM src WHERE key > 0;

Key rules:

In script mode, you cannot assign a table type variable to another variable with a specified data type. For example, the following is not allowed:
```
@a TABLE (name STRING);
@a := SELECT 'tom';
@b STRING;
@b := SELECT * FROM @a;
```

A variable can hold a constant value. Use SELECT * FROM @var to convert it into a scalar for arithmetic with other columns:

@a := SELECT 10;                                         -- assign constant 10
@b := SELECT key, value + (SELECT * FROM @a) FROM t2;   -- use @a as scalar
SELECT * FROM @b;

Restrictions

Restriction	Detail
Screen output statements	Only one standalone `SELECT` (or similar result-displaying statement) is allowed per script.
`CREATE TABLE AS`	Allowed once per script, and only as the last executable statement. Prefer creating the table and inserting data separately.
Mixed write modes	In the same script, avoid using both `OVERWRITE` and `INTO` on the same table. DML operations on transactional tables and standard tables cannot be mixed.
Read-after-write	Reading a table in the same script after writing to it causes an error. Write to a variable first, then insert and read from the variable.

Read-after-write — problem and fix:

-- Error: the table is written to, then read in the same script.
INSERT OVERWRITE TABLE src2 SELECT * FROM src WHERE key > 0;
@a := SELECT * FROM src2;
SELECT * FROM @a;

-- Fix: write to a variable, then use the variable for both the insert and the read.
@a := SELECT * FROM src WHERE key > 0;
INSERT OVERWRITE TABLE src2 SELECT * FROM @a;
SELECT * FROM @a;

IF statements

Script mode supports conditional branching. MaxCompute supports two IF syntaxes:

-- Single branch
IF (condition) BEGIN
  statement1
  statement2
END

-- Multi-branch
IF (condition) BEGIN
  statements
END ELSE IF (condition2) BEGIN
  statements
END ELSE BEGIN
  statements
END

If a branch contains only one statement, BEGIN and END are optional (similar to { } in Java). DDL statements such as CREATE TABLE, ALTER TABLE, and TRUNCATE TABLE are not supported inside IF branches.

The condition can be one of two types:

Boolean expression — the branch is resolved at compile time:

@date := '20190101';
@row TABLE(id STRING);
IF (CAST(@date AS BIGINT) % 2 == 0) BEGIN
  @row := SELECT id FROM src1;
END ELSE BEGIN
  @row := SELECT id FROM src2;
END
INSERT OVERWRITE TABLE dest SELECT * FROM @row;

Boolean scalar subquery — the branch is resolved at runtime, so MaxCompute may submit multiple jobs:

@i BIGINT;
@t TABLE(id BIGINT, value BIGINT);
IF ((SELECT COUNT(*) FROM src WHERE a = '5') > 1) BEGIN
  @i := 1;
  @t := SELECT @i, @i*2;
END ELSE BEGIN
  @i := 2;
  @t := SELECT @i, @i*2;
END
SELECT id, value FROM @t;

Example

The following script joins and unions data from three source tables, then inserts results into two destination tables. All statements compile as a single DAG and execute atomically.

CREATE TABLE IF NOT EXISTS dest(key STRING, value BIGINT) PARTITIONED BY (d STRING);
CREATE TABLE IF NOT EXISTS dest2(key STRING, value BIGINT) PARTITIONED BY (d STRING);

@a := SELECT * FROM src  WHERE value > 0;
@b := SELECT * FROM src2 WHERE key IS NOT NULL;
@c := SELECT * FROM src3 WHERE value IS NOT NULL;

@d := SELECT a.key, b.value FROM @a LEFT OUTER JOIN @b ON a.key = b.key AND b.value > 0;
@e := SELECT a.key, c.value FROM @a INNER JOIN @c ON a.key = c.key;
@f := SELECT * FROM @d UNION SELECT * FROM @e UNION SELECT * FROM @a;

INSERT OVERWRITE TABLE dest  PARTITION (d='20171111') SELECT * FROM @f;

@g := SELECT e.key, c.value FROM @e JOIN @c ON e.key = c.key;
INSERT OVERWRITE TABLE dest2 PARTITION (d='20171111') SELECT * FROM @g;

Run scripts in supported tools

Script mode is available in MaxCompute Studio, the MaxCompute client (odpscmd), DataWorks, and the Java and Python SDKs.

MaxCompute Studio

Before running a script in MaxCompute Studio, complete the setup:

After you compile and run the script, MaxCompute Studio displays the execution plan. Although the script contains multiple statements, the plan renders as a single DAG.

MaxCompute client (odpscmd)

Use odpscmd v0.27 or later. Install the latest odpscmd package, then submit a script with the -s flag:

..\bin>odpscmd -s myscript.sql

-s is a command-line flag, not an interactive command. Script mode and table variables are not supported in the odpscmd interactive shell.

For details on running the client, see Run the MaxCompute client.

DataWorks

In DataWorks, create an ODPS Script node. Edit the script in the node editor, then click the Run icon to submit it to MaxCompute. View the execution plan and results from the Logview URL in the output panel.

For setup details, see Create an ODPS Script node.

Java SDK

All SDK examples use odps.sql.submit.mode=script in the hints map to enable script mode.

import java.util.HashMap;
import java.util.List;
import java.util.Map;

import com.aliyun.odps.Instance;
import com.aliyun.odps.Odps;
import com.aliyun.odps.OdpsException;
import com.aliyun.odps.account.Account;
import com.aliyun.odps.account.AliyunAccount;
import com.aliyun.odps.data.Record;
import com.aliyun.odps.task.SQLTask;

public class SdkTest {

  public static void main(String[] args) throws OdpsException {
    // Store credentials in environment variables. Do not hard-code AccessKey pairs.
    Account account = new AliyunAccount(
        System.getenv("ALIBABA_CLOUD_ACCESS_KEY_ID"),
        System.getenv("ALIBABA_CLOUD_ACCESS_KEY_SECRET")
    );
    Odps odps = new Odps(account);
    odps.setDefaultProject("<your-project>");
    odps.setEndpoint("<your-endpoint>");

    String sqlScript = "@a := SELECT * FROM jdbc_test;\n"
                     + "SELECT * FROM @a;";

    // Required: enable script mode
    Map<String, String> hints = new HashMap<>();
    hints.put("odps.sql.submit.mode", "script");

    Instance instance = SQLTask.run(odps, "<your-project>", sqlScript, hints, null);
    instance.waitForSuccess();

    List<Record> recordList = SQLTask.getResult(instance);
    for (Record record : recordList) {
      System.out.println(record.get(0));
      System.out.println(record.get(1));
    }
  }

}

For more information, see Java SDK introduction.

Python SDK

import os
from odps import ODPS

# Store credentials in environment variables. Do not hard-code AccessKey pairs.
o = ODPS(
    os.environ["ALIBABA_CLOUD_ACCESS_KEY_ID"],
    os.environ["ALIBABA_CLOUD_ACCESS_KEY_SECRET"],
    "<your-project>",
    "<your-endpoint>"
)

sql_script = """
@a := SELECT * FROM jdbc_test;
SELECT * FROM @a;
"""

# Required: enable script mode
hints = {"odps.sql.submit.mode": "script"}
instance = o.execute_sql(sql_script, hints=hints)

with instance.open_reader() as reader:
    for rec in reader:
        print(rec[0], rec[1])

For more information, see Python SDK introduction.

Replace the following placeholders with your actual values:

Placeholder	Description
`<your-project>`	Your MaxCompute project name
`<your-endpoint>`	Your MaxCompute endpoint, for example `https://service.cn-hangzhou.maxcompute.aliyun.com/api`