This topic describes how to create a crawler to collect metadata from a MySQL data source to DataWorks. You can view the collected metadata on the Data Map page.

Prerequisites

An EMR cluster is associated with your workspace as a compute engine instance. For more information, see Associate an EMR cluster with a workspace.

Limits

  • You cannot collect metadata across regions. You must create a crawler in the region where the source metadata resides to collect the metadata.
  • You must collect metadata over the Internet.

Procedure

  1. In the left-side navigation pane, click MySQL.
  2. On the MySQLMetadata Crawler page, click Create Crawler.
  3. In the Create Crawler dialog box, complete the wizard.
    1. In the Basic Information step, set the parameters.
      Basic Information Step
      Parameter Description
      Crawler Name Required. The name of the crawler. You must set a unique name.
      Crawler Description The description of the crawler.
      Workspace The workspace of the data source from which you want to collect metadata.
      Data Source Type The type of the data source from which you want to collect metadata. The default value is MySQL and cannot be changed.
    2. Click Next.
    3. In the Select Collection Object step, select a data source from the Data Source drop-down list.
      If no data source is available, click Create to go to the Data Source page and create a MySQL data source. For more information, see Add a MySQL data source.
      Note You can select an ApsaraDB RDS for MySQL instance or a MySQL data source that is accessible from the Internet by using a Java Database Connectivity (JDBC) connection string.
    4. Click Start Testing next to Test Crawler Connectivity.
    5. If the message The connectivity test is successful appears, click Next.
    6. In the Configure Execution Plan step, configure an execution plan.
      Valid values of the Execution Plan parameter are On-demand Execution, Monthly, Weekly, Daily, and Hourly.
    7. Click Next.
    8. In the Confirm Information step, check the information that you specified and click Confirm.
  4. On the MySQLMetadata Crawler page, find the created crawler and click Run in the Actions column.
After the metadata in the MySQL data source is collected, click All Data in the top navigation bar. Select MySQL from the drop-down list in the upper part of the page. You can view the tables that store the collected MySQL metadata. MySQLYou can click the name of a table to view the table details. View the details of a table