DataWorks supports the ODPS Spark node type. This topic describes how to create and configure an ODPS Spark node.

WordCount

  1. In Data Analytics, right-click Workflow and select Create Workflow.
  2. Right-click Resource, choose Create Resource > JAR, and upload the compiled JAR package.

    For more information about the WordCount sample code, see WordCount samples.

  3. Right-click Data Analytics under Workflow and choose Create Data Analytics Node > ODPS Spark to create an ODPS Spark node.

  4. In the ODPS Spark dialog box that appears, configure the node information.

  5. After the ODPS Spark node is configured, publish and run the node.

Python

  1. Prepare and upload the Python resource.
  2. Create and configure an ODPS Spark node.

  3. After the ODPS Spark node is configured, publish and run the node.

Lenet (BigDL)

  1. Upload the JAR package and data (the mnist.zip file of the archive resource type).
  2. Create and configure an ODPS Spark node.

  3. After the ODPS Spark node is configured, publish and run the node.