This topic describes how to create a crawler to collect metadata from a PostgreSQL data store to DataWorks. You can view collected metadata on the Data Map page.


  1. Go to the Data Discovery page.
    1. Log on to the DataWorks console.
    2. In the left-side navigation pane, click Workspaces. The Workspaces page appears.
    3. Find the target workspace and click Data Analytics in the Actions column.
    4. On the DataStudio page, click Icon in the upper-left corner and choose All Products > DataMap. The Data Map page appears.
    5. Click Data Discovery in the top navigation bar.
  2. In the left-side navigation pane, click PostgreSQL.
  3. On the PostgreSQLMetadata Crawler page that appears, click Create Crawler.
  4. In the Create Crawler dialog box that appears, follow these steps:
    1. In the Basic Information step, set basic parameters.
      Basic Information
      Parameter Description
      Crawler Name Required. The name of the crawler. You must specify a unique name.
      Crawler Description The description of the crawler.
      Workspace The workspace where the metadata collected from the specific data store will be used.
      Connect To The type of the data store from which metadata will be collected. The default value is PostgreSQL and cannot be changed.
    2. Click Next.
    3. In the Select object type step, select a connection from the Connection drop-down list.
      If the required connection does not exist, click Go to New to go to the Data Source page in Workspace Management and create the connection. For more information, see Configure a PostgreSQL connection.
    4. Click Test Crawler Connectivity.
    5. When the message The test was successful appears, click Next.
    6. In the Configure Execution Plan step, set scheduling parameters.
      The valid values of Execution Plan are as follows: On-demand Execution, Monthly, Weekly, Daily and Hourly.
    7. Click Next.
    8. In the Confirm Information step, verify that the configuration of the crawler is correct and click Confirm.
  5. On the PostgreSQLMetadata Crawler page, find the created crawler and click Run in the Actions column.
    After the crawler is run, click the number in the Last run update table or Last run Add table column to view details about the updated or added tables.
    You can also perform the following operations on the page:
    • Click Details in the Actions column of a crawler. In the Crawler Details dialog box that appears, view the detailed information about the crawler.
    • Click Edit in the Actions column of a crawler. In the Edit Crawler dialog box that appears, modify the configuration of the crawler.
    • Click Delete in the Actions column of a crawler. In the Confirm dialog box that appears, click OK to delete the crawler.
    • Click Stop in the Actions column of a running crawler to stop the crawler.