MaxCompute Studio allows you to develop Python user defined functions (UDFs). This topic describes how to develop, test, and publish a Python UDF.

Prerequisites

The following operations are complete:

Develop a Python UDF

  1. In the Project section, right-click scripts under the MaxCompute script module and choose New > MaxCompute Python.
  2. In the Create new MaxCompute python class dialog box, enter a class name in the Name field, select python UDF from the Kind drop-down list, and then click OK.
  3. Write the UDF code in the code editor.

Test the Python UDF

After the UDF is developed, you must test whether the UDF code works as expected. MaxCompute Studio supports local runs for you to locally test code. You can download some sample data from a table to your local computer, run the code on the sample data, and debug the code.

  1. Right-click the developed Python UDF and select RUN.
  2. In the Edit configuration dialog box, set the parameters and click OK.
    **
    • MaxCompute project: the MaxCompute project that is used to run the UDF. For a local run, select local.
    • MaxCompute table: the name of the MaxCompute table that is used to run the UDF.
    • Table columns: the columns in the MaxCompute table that are used to run the UDF.
    • Download Record limit: the maximum number of records that can be downloaded from the table. Default value: 100.
    Note
    • If the specified data is already downloaded, MaxCompute Studio does not download the data again. To download the data again, run the tunnel command on the MaxCompute client.
    • By default, 100 records are downloaded. To download more sample data, run the tunnel command on the MaxCompute client or use the table download feature of MaxCompute Studio.
    • After the sample data is downloaded, you can view the data in the data file of the table under the warehouse directory.
  3. MaxCompute Studio obtains the sample data from the data file based on the columns you specified and calls the UDF to locally run it on the sample data.
    Note Local runs are implemented by using the pyou script that the MaxCompute SDK for Python (PyODPS) provides. The command for starting a local run is pyou hello.Plus<data. After you install PyODPS, you can run the corresponding command to check whether the pyou script exists:
    • For Windows, run the ${python}/../Scripts/pyou command.
    • For macOS, run the ${python}/../pyou command.
  4. View the output on the Console tab, check whether the UDF works as expected, and debug the code if required.

Publish the Python UDF

After the Python UDF is tested, you can publish it to the production environment. For more information, see Package, upload, and register a Java program.