This topic describes how to use population census data to build a statistical model. You can use the model to analyze the impact of academic degree on income based on attributes such as age, job type, and education level.
Datasets
The experiment described in this topic uses an open source dataset from the Machine Learning Repository of the University of California, Irvine (UCI). For more information, see Adult Data Set. The dataset is the population census data of a region and contains 32,561 data records in total. The following table describes the fields in the dataset.
Field | Description | Data type |
---|---|---|
age | The age of the person. | DOUBLE |
workclass | The job type of the person. | STRING |
fnlwgt | The ID of the person. | STRING |
education | The education level of the person. | STRING |
education_num | The years of education that the person receives. | DOUBLE |
maritial_status | The marital status of the person. | STRING |
occupation | The job of the person. | STRING |
relationship | The family relationship of the person. | STRING |
capital_gain | The capital gain of the person. | STRING |
capital_loss | The capital loss of the person. | STRING |
hours_per_week | The weekly working hours of the person. | DOUBLE |
native_country | The nationality of the person. | STRING |
income | The income of the person. | STRING |
Procedure
- Go to the Machine Learning Designer page.
- Log on to the Machine Learning Platform for AI console.
- In the left-side navigation pane, click Workspaces. On the Workspaces page, click the name of the workspace that you want to manage.
- In the left-side navigation pane, choose to go to the Machine Learning Designer page.
- In the upper-right corner of the Visualized Modeling (Machine Learning Designer) page, click Go to Studio (Old Version).
- Create an experiment.
- Run the experiment and view the results.