Data sources supported by data profiling
Profiling a large amount of data can degrade database performance. To avoid performance degradation, extract a reasonable amount of data for profiling. For more information about reasonable profiling ranges, see Profiling range.
Data source type | Detection partition | First N records | Random sampling of N records | Percentage sampling of N records |
MySQL | Not supported | Supported | Supported (poor performance) | Supported |
Oracle | Not supported | Support | Supported (poor performance) | Supported (poor performance at high percentages) |
PostgreSQL | Not supported | Supported | Supported (poor performance) | Supported |
Microsoft SQL Server | Not supported | Supported | Supported (poor performance) | Supported |
Compute engines supported by data profiling
Compute engine | Data profiling |
MaxCompute | Supported |
E-MapReduce 3.X | Supported |
E-MapReduce 5.x | Supported |
CDH 5.X | Support |
CDH 6.X | Supported |
AsiaInfo DP 5.3 | Supported |
Cloudera Data Platform 7.x | Supported |
Amazon EMR | Supported |
Transwarp Data Hub (TDH) | Supported |
FusionInsight 8.X | Supported |
Lindorm (compute engine) | Supported |