E-MapReduce (EMR) Data Platform entered the maintenance state at 21:00 on February 21, 2022 (UTC+8). The data development features in EMR Data Platform are no longer updated. If you still use EMR Data Platform for data development, migrate to DataWorks at the earliest opportunity. This topic describes the impact of this change and how to complete the migration.
What stopped and what still works
As of 21:00 on February 21, 2022 (UTC+8), the following data development operations are no longer supported in EMR Data Platform:
Creating projects
Creating and running jobs
Scheduling workflows
Operations and maintenance (O&M) for data development
Projects created before this date are not affected. You can continue to run jobs and schedule workflows in existing projects.
Capabilities available after migrating to DataWorks
DataWorks is an end-to-end data development and governance platform that accumulates the big data development methodology provided by Alibaba Group based on their experience of more than 10 years. EMR is deeply integrated with DataWorks.
After migrating, you get access to the following capabilities that are not available in EMR Data Platform:
Data lake ingestion
Data modeling
Data development
Data scheduling
Data governance
Data security assurance
Migration process
The Alibaba Cloud DataWorks on EMR team provides comprehensive technical support throughout the migration.
| Phase | Operation | Participant | Estimated duration |
|---|---|---|---|
| 1. Preparations | Review the Getting started with DataWorks on EMR topic and the Migrate EMR projects to DataWorks topic. Contact the DataWorks on EMR team if you have questions. | DataWorks on EMR team and customers | 1 day |
| 2. Migration | Create a DataLake cluster (recommended) or a custom cluster, then associate it with a DataWorks workspace. Migrate your EMR projects to DataWorks. The DataWorks on EMR team can perform the migration automatically on your behalf. For step-by-step instructions, see Getting started with DataWorks on EMR. For project migration details, see Migrate EMR projects to DataWorks. | DataWorks on EMR team and customers | 2-4 days |
| 3. Check | Update job configurations such as data paths. Verify that jobs run as expected in the new environment. | DataWorks on EMR team and customers | 2-4 weeks |
| 4. Completion | Suspend workflows in EMR Data Platform and use DataWorks for scheduling. Switch data development to the new cluster and suspend the original cluster. | DataWorks on EMR team and customers | 1 week |
Get migration support
To get help with your migration, search for DingTalk group ID 16970006464 to join the support group. An Alibaba Cloud engineer will contact you to plan your migration.