All Products
Search
Document Center

DataWorks:ClickHouse SQL

Last Updated:Apr 09, 2026

ClickHouse SQL lets you run distributed SQL queries on structured data to improve job efficiency. In DataWorks, you can use a ClickHouse SQL node to develop tasks, schedule them to run periodically, and integrate them with other jobs. This topic describes the main workflow for developing a task using a ClickHouse SQL node.

Prerequisites

Procedure

  1. On the ClickHouse SQL node editing page, follow these steps:

    Develop SQL code

    In the SQL editor, write your task code. You can define variables in your code using the ${variable_name} format. Assign values to these variables under Scheduling Parameters in the Scheduling Settings pane on the right. This enables dynamic parameter passing for scheduled job runs. For more information about scheduling parameters, see Sources and expressions for scheduling parameters. Example:

    CREATE DATABASE IF NOT EXISTS ck_test;
    CREATE TABLE IF NOT EXISTS  ck_test.first_table  (
    `product_code` String,
    `package_name` String
    ) ENGINE = MergeTree ORDER BY package_name SETTINGS index_granularity = 8192;
    INSERT INTO ck_test.first_table (product_code, package_name) VALUES ('1', ${var});
    SELECT * FROM ck_test.first_table;
    Note

    In this example, the ${var} parameter can be set to 1.

    Run SQL task

    1. In the Run Configuration pane, configure the Compute Resource and Resource Group.

      1. For Computing Resource, select the name of the CDH cluster that you registered in DataWorks.

      2. For Resource Group, select a scheduling resource group that has a confirmed connection to the data source. For more information, see Network connectivity solutions.

    2. On the toolbar, click the Select a data source drop-down menu. In the dialog box, select the ClickHouse data source that you created, and click Run to run the SQL task.

  2. To run the node periodically, configure its scheduling properties based on your business requirements. For configuration details, see Node scheduling configuration.

  3. After configuring the node, deploy it. For more information, see Node and workflow deployment.

  4. After you deploy the task, view its run status in Operation Center. For more information, see Get started with Operation Center.