By Garvin Li
This article implements an ad click-through rate (CTR) prediction scenario. Ad CTR prediction is a typical application in the advertising industry. By using history data to train the prediction model, this prediction method predicts daily increment data, and finds and advertises samples that meet the ad CTR standard.
The detailed fields are as follows:
Because data shown in the following screenshot is randomly generated by using the random algorithm, this experiment doesn't evaluate results, and mainly describes the experiment establishment and the use and scheduling of DataWorks. History data of 20160919 and 20160920 is used to predict 20160921 data. The MaxCompute partition table is used.
The following diagram shows the experiment process.
The experiment can be roughly divided into four modules: data source importing (ads), data pre-processing (normalization), model training (binary logistic regression), and predicting (prediction).
The intermediate process includes two steps: data normalization and model training. Model training is to use history data to train the generated prediction model. (For more principle details, please see Heart disease prediction case)
The list of prediction results is "ad_result-1", as shown below.
Go to the homepage of the console, click DataWorks to access the Data IDE workspace.
DataWorks and the machine learning platform share the same set of projects. Select the project where the experiment to be scheduled for is located, and click Start Data Modeling.
Click New and select New Task
In the configuration section of the created task, select Node Task for Task Type and Machine Learning for Type.
After the node task has been created, select the machine learning task to be scheduled for and select scheduling time in the configuration bar on the right side. In this experiment, we choose to perform training and push information at 00:00 each day.
Click Submit. Submitted jobs will be effective next day.
After the scheduling task has been submitted, click Maintain to view logs
To learn more about Alibaba Cloud Machine Learning Platform for Artificial Intelligence (PAI), visit www.alibabacloud.com/product/machine-learning
GarvinLi - December 27, 2018
Alibaba Clouder - November 13, 2018
Alibaba Container Service - April 11, 2019
Alibaba Clouder - March 12, 2019
Alibaba Clouder - September 29, 2017
Alibaba Container Service - July 16, 2019
An end-to-end platform that provides various machine learning algorithms to meet your data mining and analysis requirements.Learn More
A secure solution to migrate TB-level or PB-level data to Alibaba Cloud.Learn More
A premium, serverless, and interactive analytics serviceLearn More
Data Integration is an all-in-one data synchronization platform. The platform supports online real-time and offline data exchange between all data sources, networks, and locations.Learn More
More Posts by GarvinLi