This topic describes how to create a crawler to collect metadata from an SQL Server data source. You can view the collected metadata on the Data Map page.
Background information
After you create a metadata crawler to collect full metadata of EMR tables, the system enables automated incremental metadata collection. This way, the metadata crawler can automatically synchronize incremental metadata of the EMR tables to DataWorks. Procedure
- In the left-side navigation pane, click Data Recognition. The Data Recognition page appears.
- In the left-side navigation pane, click SQL Server.
- On the SQL ServerMetadata Crawler page, click Create Crawler.
- In the Create Crawler dialog box, set the parameters in each step.
- In the Basic Information step, set the parameters as required.
Parameter | Description |
---|
Crawler Name | Required. The name of the crawler. You must set a unique name. |
Crawler Description | The description of the crawler. |
Workspace | The workspace of the data source from which you want to collect metadata. |
Connect To | The type of the data source from which you want to collect metadata. The default value is SQL Server and cannot be changed. |
- Click Next.
- In the Select Collection Object step, select a data source from the Data Source drop-down list.
If no data source is available, click
Create to go to the page and add an SQL Server data source. For more information, see
Add an SQL Server data source.
- Click Start Testing next to Test Crawler Connectivity.
- If the message The connectivity test is successful appears, click Next.
- In the Configure Execution Plan step, configure an execution plan.
Valid values of the Execution Plan parameter are On-demand Execution, Monthly, Weekly, Daily, and Hourly.
- Click Next.
- In the Confirm Information step, check the information that you specified and click Confirm.
- On the SQL ServerMetadata Crawler page, find the created crawler and click Run in the Actions column.