This topic describes how to create a crawler to collect metadata from an SQL Server data source. You can view the collected metadata on the Data Map page.

Background information

After you create a metadata crawler to collect full metadata of EMR tables, the system enables automated incremental metadata collection. This way, the metadata crawler can automatically synchronize incremental metadata of the EMR tables to DataWorks.

Procedure

  1. In the left-side navigation pane, click Data Recognition. The Data Recognition page appears.
  2. In the left-side navigation pane, click SQL Server.
  3. On the SQL ServerMetadata Crawler page, click Create Crawler.
  4. In the Create Crawler dialog box, set the parameters in each step.
    1. In the Basic Information step, set the parameters as required.
      Basic Information step
      ParameterDescription
      Crawler NameRequired. The name of the crawler. You must set a unique name.
      Crawler DescriptionThe description of the crawler.
      WorkspaceThe workspace of the data source from which you want to collect metadata.
      Connect ToThe type of the data source from which you want to collect metadata. The default value is SQL Server and cannot be changed.
    2. Click Next.
    3. In the Select Collection Object step, select a data source from the Data Source drop-down list.
      If no data source is available, click Create to go to the Data Source page and add an SQL Server data source. For more information, see Add an SQL Server data source.
    4. Click Start Testing next to Test Crawler Connectivity.
    5. If the message The connectivity test is successful appears, click Next.
    6. In the Configure Execution Plan step, configure an execution plan.
      Valid values of the Execution Plan parameter are On-demand Execution, Monthly, Weekly, Daily, and Hourly.
    7. Click Next.
    8. In the Confirm Information step, check the information that you specified and click Confirm.
  5. On the SQL ServerMetadata Crawler page, find the created crawler and click Run in the Actions column.