All Products
Search
Document Center

Platform For AI:Build a model to predict the output power of a power plant

Last Updated:Dec 11, 2023

This topic describes how to use a preset template in Machine Learning Designer to build a model to predict the output power of a power plant.

Background information

In the following sample pipeline, the power generation data of a combined cycle power plant is used to build a model to predict the output power. This example also shows the application of machine learning in industrial production. The output power of a wind energy converter determines the electrical energy that one generator can produce. If you can accurately predict the output power of the generator, you can evaluate and implement the power production plan to reduce resource waste.

Prerequisites

Datasets

The dataset of the combined cycle power plant that is used in the following sample pipeline is a dataset that University of California, Irvine (UCI) provides in Machine Learning Repository. For more information, see Combined Cycle Power Plant Data Set. The dataset contains 9,568 data entries. Each data entry includes the AT, V, AP, RH, and PE fields, which indicate the measurements of the temperature, pressure, humidity, pressure intensity, and output power in sequence. The following figure shows the sample data that is used in the pipeline. 数据集

Procedure

  1. Go to the Machine Learning Designer page.

    1. Log on to the PAI console.

    2. In the left-side navigation pane, click Workspaces. On the Workspaces page, click the name of the workspace that you want to manage.

    3. In the left-side navigation pane, choose Model Training > Visualized Modeling (Designer) to go to the Machine Learning Designer page.

  2. Create a pipeline.

    1. On the Visualized Modeling (Designer) page, click the Preset Templates tab.

    2. Find the Power Plant Output Forecast template and click Create.

    3. In the Create Pipeline dialog box, configure the required parameters. You can use the default values.

      The value specified for the Pipeline Data Path parameter is the Object Storage Service (OSS) bucket path of the temporary data and models generated during the runtime of the pipeline.

    4. Click OK.

      It takes about 10 seconds to create the pipeline.

    5. On the Pipelines tab, double-click the created Power Plant Output Forecast pipeline to open the pipeline.

    6. View the components of the pipeline on the canvas as shown in the following figure. The system automatically creates the pipeline based on the preset template.

      发电厂工作流

      Area

      Description

      The Corrcoef component measures the impact of each feature on the output power. After you run the pipeline, you can right-click the Corrcoef component on the canvas and select Visual Analysis to view the impact of each feature on the output power.

      The split component divides the dataset into a training dataset and a prediction dataset by a ratio of 8 to 2.

      The Linear Regression component performs regression modeling.

      The Prediction component predicts output power based on the prediction dataset. The Evaluation component evaluates the prediction accuracy of the model.

  3. Run the pipeline and view the results.

    1. In the upper-left corner of the canvas, click the Run icon.

    2. After you run the pipeline, right-click the Correcoef component on the canvas and select Visual Analysis.

    3. In the Corrcoef section, view the impact of each feature on the output power.

      系数矩阵结果The preceding figure shows that the temperature has the greatest impact on the output power, followed by the pressure, the humidity, and the pressure intensity.

    4. Right-click the Linear Regression component on the canvas and choose View Data > Output Model Evaluation Table to view the model evaluation results.

    5. Right-click the Evaluation component on the canvas and choose View Data > Output Port to view the results that indicate the model performance.