New Features

Platform for AI (PAI) - DataJuicer on DLC officially released

DLC now supports submitting DataJuicer framework jobs, enabling efficient cleaning, filtering, transformation, and augmentation of large-scale datasets through over 100 operators, multi-node scalability (single or distributed), and high availability with self-healing. This enhances text and multimodal data processing capabilities for large model workloads.
Content

Target audience: Large and medium-sized internet companies, AI firms, and academic research institutions. New Features/Specifications: DLC supports submitting DataJuicer jobs with over 100 operators (including aggregator, de-duplicator, filter, formatter, grouper, mapper, and selector), flexible scaling (single-node or multi-node), and self-healing for high availability. This enables efficient data cleaning, filtering, transformation, and augmentation, enhancing text and multimodal data processing for large model applications.

Help Document

https://www.alibabacloud.com/help/zh/pai/user-guide/quickly-submit-a-datajuicer-task

7th Gen ECS Is Now Available

Increase instance computing power by up to 40% and Fully equipped with TPM chips.
Powered by Third-generation Intel® Xeon® Scalable processors (Ice Lake).

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.