This topic describes how to create a crawler to collect metadata from a Hologres data source to DataWorks. You can view the collected metadata on the Data Map page.

Prerequisites

An EMR cluster is associated with your workspace as a compute engine instance. For more information, see Associate an EMR compute engine instance with a workspace.

Limits

  • You cannot collect metadata across regions. You must create a crawler in the region where the source metadata resides to collect the metadata.
  • You must collect metadata over the Internet.
  • Metadata collection from Hologres data sources is supported in the China (Hangzhou), China (Shanghai), China (Shenzhen), China (Beijing), and China East 2 Finance regions.

Limits

  • You cannot collect metadata across regions. You must create a crawler in the region where the source metadata resides to collect the metadata.
  • You must collect metadata over the Internet.

Procedure

  1. In the left-side navigation pane, click Hologres.
  2. On the HologresMetadata Crawler page, click Create Crawler.
  3. In the Create Crawler dialog box, complete the wizard.
    1. In the Basic Information step, set the parameters.
      Hologres
      Parameter Description
      Crawler Name Required. The name of the crawler. You must set a unique name.
      Crawler Description The description of the crawler.
      Workspace The workspace of the data source from which you want to collect metadata.
      Data Source Type The type of the data source from which you want to collect metadata. The default value is Hologres and cannot be changed.
    2. In the Select Collection Object step, select a data source from the Data Source drop-down list.
      You can collect metadata only from the Hologres instances that are associated with your workspace as compute engine instances. If no data source is available, associate a Hologres instance with your workspace. For more information, see Configure a workspace.
  4. On the HologresMetadata Crawler page, find the created crawler and click Run in the Actions column.