This topic describes how to collect metadata and view table information in Data Map.
- Go to the Data Discovery page.
- Log on to the DataWorks console.
- In the left-side navigation pane, click Workspaces.
- In the top navigation bar, select a region as required, find the workspace where you want to create a crawler, and then click Data Analytics in the Actions column.
- On the DataStudio page, click the icon in the upper-left corner and choose . The homepage of Data Map appears.
- In the top navigation bar, click Data Discovery.
- In the left-side navigation pane, click E-MapReduce. On the Obtain Metadata from E-MapReduce page, click Create Crawler.
- In the Create Crawler dialog box, select an engine instance from the Select Engine Instance drop-down list and click Authorize.
- On the page that appears, click the Metadata tab and click Enable.
- In the Confirm Operation message, click OK.
- Return to the Create Crawler dialog box on the Data Map page and click Refresh.
- After the authorization status changes to Authorized, click Commit.
- On the Obtain Metadata from E-MapReduce page, find the created crawler and click Obtain All.
Click Refresh in the upper-right corner of the page and verify that the running status of the E-MapReduce data collection instance changes to Collected successfully.Note After full metadata is collected from E-MapReduce, the system automatically collects incremental metadata and synchronizes new metadata from E-MapReduce tables.
View table information
- In the top navigation bar of the current page, click All Data.
- On the All Data page, click the E-MapReduce tab.
- On the E-MapReduce tab, click the table that is named rpt_user_info_d to view the details of the table.
You can also enter a keyword in the search box at the top to search for an E-MapReduce table and view the details of the table.
- Click the Lineage tab to view the lineages of the table.
What to do next
You have learned how to collect metadata and view table information in Data Map. You can now proceed with the next tutorial to learn how to monitor and ensure the quality of the data that is generated by the developed nodes. For more information, see Configure rules to monitor data quality.