By Garvin Li
This article implements an ad click-through rate (CTR) prediction scenario. Ad CTR prediction is a typical application in the advertising industry. By using history data to train the prediction model, this prediction method predicts daily increment data, and finds and advertises samples that meet the ad CTR standard.
The detailed fields are as follows:
Because data shown in the following screenshot is randomly generated by using the random algorithm, this experiment doesn't evaluate results, and mainly describes the experiment establishment and the use and scheduling of DataWorks. History data of 20160919 and 20160920 is used to predict 20160921 data. The MaxCompute partition table is used.
The following diagram shows the experiment process.
The experiment can be roughly divided into four modules: data source importing (ads), data pre-processing (normalization), model training (binary logistic regression), and predicting (prediction).
The intermediate process includes two steps: data normalization and model training. Model training is to use history data to train the generated prediction model. (For more principle details, please see Heart disease prediction case)
The list of prediction results is "ad_result-1", as shown below.
Go to the homepage of the console, click DataWorks to access the Data IDE workspace.
DataWorks and the machine learning platform share the same set of projects. Select the project where the experiment to be scheduled for is located, and click Start Data Modeling.
Click New and select New Task
In the configuration section of the created task, select Node Task for Task Type and Machine Learning for Type.
After the node task has been created, select the machine learning task to be scheduled for and select scheduling time in the configuration bar on the right side. In this experiment, we choose to perform training and push information at 00:00 each day.
Click Submit. Submitted jobs will be effective next day.
After the scheduling task has been submitted, click Maintain to view logs
To learn more about Alibaba Cloud Machine Learning Platform for Artificial Intelligence (PAI), visit www.alibabacloud.com/product/machine-learning
Alibaba Clouder - October 18, 2019
Apache Flink Community China - March 29, 2021
Alibaba Container Service - February 12, 2021
Alibaba Cloud MaxCompute - March 3, 2020
Alibaba Clouder - April 1, 2021
Apache Flink Community China - September 27, 2020
ET Brain is Alibaba Cloud’s ultra-intelligent AI Platform for solving complex business and social problemsLearn More
Alibaba Cloud provides big data consulting services to help enterprises leverage advanced data technology.Learn More
Alibaba Cloud experts provide retailers with a lightweight and customized big data consulting service to help you assess your big data maturity and plan your big data journey.Learn More
An end-to-end platform that provides various machine learning algorithms to meet your data mining and analysis requirements.Learn More
More Posts by GarvinLi