Manage locks and ANALYZE command concurrency during development. Configure edit locks, object commits, query acceleration, and storage size updates.
Limitations
-
You can use the Storage size update settings feature only when the compute engine is E-MapReduce 3.x, E-MapReduce 5.x, CDH 5.x, CDH 6.x, FusionInsight 8.x, Cloudera Data Platform 7.x, AsiaInfo DP 5.3, ArgoDB, Transwarp TDH 6.x, StarRocks, SelectDB, or Doris.
-
The SelectDB and Doris compute engines do not support Table management settings or Default compute engine for standard modeling.
Permissions
Requires a custom role with the Manage development platform settings permission, or the Super Admin or System Admin role.
Access the development platform
-
In the top navigation bar of the Dataphin homepage, choose Management Center > System Settings.
-
In the left-side navigation pane, choose Platform Settings > Development Platform.
Edit lock
-
In the Edit Lock section, click the edit icon, enable Exclusive Edit Lock, and configure the lock.
Parameter
Description
Exclusive Edit Lock
When disabled, users can override each other's locks. When enabled, a locked object cannot be edited by others until the lock is released or expires.
Lock Duration
The lock expires after the specified inactivity period. Default: 30 minutes. Range: 5 minutes to 120 minutes.
Automatically release when object is closed
Releases the lock when the object's edit tab is closed.
Automatically release on successful commit
Releases the lock after a successful commit. Not released if the commit fails.
-
Click OK.
Click Restore Defaults to reset.
Storage size update settings
Hive does not automatically update storage sizes for tables written to HDFS by integration or real-time tasks, so the Data Catalog may not reflect current sizes. Dataphin can run the ANALYZE command automatically after task completion to fetch the latest storage data.
-
In the Storage Size Update Settings section, click the edit icon, enable automatic storage size update, and configure concurrent connections.
-
Automatic Storage Size Update: Disabled by default. When enabled, Dataphin runs the ANALYZE command on the target Hive table after each successful task to update storage sizes. Increase concurrent connections to reduce execution time if your Hive server has sufficient capacity. High concurrency consumes significant compute resources and may affect other tasks.
-
Maximum Connections: Maximum concurrent connections for
ANALYZEcommands. Default: 5. Range: 1–200.ImportantANALYZEcommands running or pending for more than 24 hours are automatically terminated.
-
-
Click OK.
Note-
When you enable Automatic Storage Size Update and click OK, the concurrent connection setting takes effect immediately. High concurrency consumes significant compute resources and may affect running tasks.
-
When you disable Automatic Storage Size Update, running or pending
ANALYZEcommands continue. Subsequent tasks will not trigger automatic updates. Run theANALYZEcommand manually in Hive to update storage sizes.
-
Node task settings
-
In the Node task settings area, click Edit to configure default scheduling time and commit rules.
-
Creation
Parameter
Description
Default priority
The default priority for new integration, compute, and logical table tasks. Options: Lowest, Low, or Medium (default).
Default scheduling time
Randomly within an interval
The default time interval is 00:00–03:00, and the default random interval is 5 minutes.
-
The end time must be later than the start time. Format: hh:mm (00:00–23:59).
-
The random time interval must be an integer from 1 to 30.
Fixed time
The default fixed time is 00:00.
Format: hh:mm (00:00–23:59).
Default Python version
Default Python version for compute tasks, offline templates, and third-party packages. Options: Python 2.7, Python 3.7 (default), or Python 3.11.
Note-
By default, Default Scheduling Time is set to Randomly within an interval. You can change it to Fixed time as needed.
-
When you create an offline task (integration, compute, or logical table), the scheduling time is set based on this configuration.
-
If Default Scheduling Time is set to Randomly within an interval, a random time is selected based on the configured rules.
-
If Default Scheduling Time is set to Fixed time, the configured time is used.
-
-
-
Execution
Hide logview URL if SQL contains account and password global variables: Prevents account and password global variables in SQL from appearing in plaintext in MaxCompute logview URLs. Disabled by default.
If you enable this option, the logview URL is hidden in execution, data preview, and O&M logs for MAXCOMPUTE_SQL and logical table tasks that use account and password global variables. The URL is replaced with:
The logview url is invisible because the current SQL is using the global variable "{dp_glb_xxx}", which is of type account and password..NoteAvailable only for the MaxCompute compute engine.
-
Commit
Parameter
Description
Automatically parse dependencies when committing offline development objects
When enabled, the system parses and updates upstream dependencies on each commit of offline objects (SQL compute tasks, logical table tasks). Prevents missing dependencies.
Validate field data types on logical table commit
When enabled, validates that field calculation return types match the declared data type on logical table commit. Mismatches block the commit to prevent implicit type conversion errors.
-
Decommission and deletion
Allow deleting published objects in the development environment: When enabled, published objects (compute tasks, integration tasks, logical tables, atomic metrics, business qualifiers, derived metrics) can be deleted from the development environment.
ImportantDeleted objects cannot be recovered. Deleting an object in development without publishing the deletion makes the production object unmodifiable.
-
Default dependency period and policy
Modify the Default Dependency Period and Default Dependency Policy.
-
Default Dependency Period: Options: Current period (day), Previous period (day), Last 24 hours, or N days ago. For N days ago, the default N is 2 (required).
-
Default Dependency Policy: Options: First instance, Latest instance, All instances, or Last instance.
The initial default dependency periods and policies are as follows.
Current node scheduling period
Upstream node scheduling period
Upstream self-dependency
Default dependency period
Default dependency policy
Daily/Weekly/Monthly
Daily
Yes/No
Current period (day)
Last instance
Daily/Weekly/Monthly
Hourly/Minutely
No
Current period (day)
All instances
Daily/Weekly/Monthly
Hourly/Minutely
Yes
Current period (day)
Last instance
Monthly/Weekly/Daily/Hourly/Minutely
Monthly/Weekly
Yes
Current period (day)
Last instance
Monthly/Weekly/Daily/Hourly/Minutely
Monthly/Weekly
No
Current period (day)
Last instance
Hourly/Minutely
Daily
Yes/No
Current period (day)
Last instance
Hourly/Minutely
Hourly/Minutely
Yes/No
Current period (day)
Last instance
-
-
Tag values
Manage task tag values. Click Add Tag to create a tag. Maximum: 50 tags, each up to 64 characters.
Click Delete to remove a tag from all associated tasks. Re-adding the tag restores it.
-
-
After you complete the configuration, click OK.
Click Restore Defaults to reset.
Table management settings
Table management settings are not supported for StarRocks, GaussDB(DWS), Doris, or SelectDB.
-
In the Table management settings area, click Edit to configure table deletion publish settings.
-
Automatically generate a pending publish item for table deletion after deleting a table using SQL: Enabled by default. Generates a pending publish item when a
DROP TABLEstatement runs in an ad hoc query or SQL compute task. When disabled, runningDROP TABLE table_namedoes not generate a publish item. -
Generate a pending publish item for table deletion when deleting a table in Table Management: Enabled by default. Generates a pending publish item when a table is deleted in Table Management. When disabled, no publish item is generated.
-
-
Configure the Default Storage Format or Default External Table Storage Format. Supported formats vary by compute engine.
Note-
When the compute engine is AnalyticDB for PostgreSQL, Default Storage Format is not configurable.
-
Default External Table Storage Format is available only for MaxCompute.
-
In the table below, - indicates unsupported.

Engine default
Hudi
Delta (Delta Lake)
Paimon
Iceberg
Kudu
Parquet
Avro
Rcfile
Orc
Textfile
Sequencefile
Binaryfile
Csv
Text
Json
MaxCompute
-
-
-
-
-
-
Supported
Supported
Supported
Supported
Supported
Supported
-
-
-
-
Lindorm (compute engine)
Supported
-
-
-
Supported
-
Supported
Supported
Supported
Supported
Supported
Supported
-
-
-
-
Databricks
Supported
-
Supported
-
-
-
Supported
Supported
-
Supported
-
-
Supported
Supported
Supported
Supported
Amazon EMR
Supported
Supported
Supported
Supported
Supported
Supported
Supported
Supported
Supported
Supported
Supported
Supported
-
-
-
-
-
Transwarp TDH 6.x
-
Transwarp TDH 9.3.x
Supported
-
-
Supported
Supported
-
Supported
Supported
Supported
Supported
Supported
Supported
-
-
-
-
-
CDH 5.x
-
CDH 6.x
-
E-MapReduce 3.x
-
E-MapReduce 5.x
-
Cloudera Data Platform 7.x
-
Huawei FusionInsight 8.x
-
AsiaInfo DP 5.3
Supported
Supported
Supported
Supported
Supported
Supported
Supported
Supported
Supported
Supported
Supported
Supported
-
-
-
-
-
-
Configure the default lifecycle for physical and logical tables in MaxCompute. Default: empty (no lifecycle). Range: 1–36,500 days. Quick options: 7, 14, 30, or 360 days.
NoteDefault lifecycle is available only for the MaxCompute compute engine.
-
After you complete the configuration, click OK.
Click Restore Defaults to reset.
Default compute engine for standard modeling
Set the default compute engine for standard modeling when using a Hadoop compute engine. Options: Hive, Impala, and Spark, with the following limitations:
If a task type is not enabled for the project's compute source, the system defaults to Hive. Create a Hadoop compute source.
-
Hive: Cannot read Kudu source tables.
-
Impala: Reads Kudu source tables but does not support storing logical tables in Kudu format. Avoid Impala if your source tables are not in Kudu format.
NoteImpala is not available for Amazon EMR.
-
Spark: Cannot read Kudu source tables.
Query acceleration
Enable or disable MaxCompute Query Acceleration (MCQA). When enabled, MCQA applies to all ad hoc MaxCompute SQL queries and SQL unit queries in the analysis platform.
Query acceleration is available only for MaxCompute.
Single-tenant multi-engine
In multi-engine mode, only Edit Lock, Storage Size Update Settings, Node Task Settings, and Table Management Settings are configurable. The following must be configured elsewhere:
-
Hide logview URL if SQL contains account and password global variables: Configure in Cluster Management > MaxCompute Cluster - Other Configurations. For more information, see Other configurations.
-
Default Storage Format/Default External Table Storage Format: Configure in Cluster Management > Other configurations. For more information, see Other configurations.
-
Default Compute Engine for Standard Modeling: Configure in Cluster Management > Other configurations. For more information, see Other configurations.
-
Query Acceleration: Configure in Cluster Management > MaxCompute Cluster - Other Configurations. For more information, see Other configurations.