About

We're leaders in cloud computing and digital transformation, teaming up once more to offer top-notch training. Our collaboration focuses on the importance of data in today's business world. Join Batch 2 of our program to gain valuable skills and certifications in Big Data. With a proven track record from our first batch, this program offers a dynamic learning experience with an advanced curriculum, expert guidance, and hands-on practice. Prepare to innovate, grow your organization, and lead in data analytics. Welcome to a journey of learning and success!

Self-paced Learning and Self Test

Stage I: Big Data Fundamentals

Module

Course Name

Objective

Big Data Basics

Big Data Basic Concepts

In this video, we introduce the core components of Alibaba Cloud’s Big Data ecosystem. Learn how tools such as DataWorks, MaxCompute, QuickBI, and DataV fit together, and see how each tool can be used to manage and process large datasets more effectively.

Big Data Basics

Alibaba Cloud Big Data Products Overview

In this video, we introduce the core components of Alibaba Cloud’s Big Data ecosystem. Learn how tools such as DataWorks, MaxCompute, QuickBI, and DataV fit together, and see how each tool can be used to manage and process large datasets more effectively.

Big Data Basics

Scenario in Which Products Would be Used

This this video, we examine some real data processing scenarios, and look at how Ailbaba Cloud services can be tied together to address a large number of different storage and analysis requirements.

DataWorks

DataWorks Overview

This short video introduces the topics which will be covered in the following several video lectures, such as Data Acquisition, Processing, and Quality Monitoring with DataWorks.

DataWorks

DataWorks Introduction

In this video lecture, you'll learn what DataWorks is, and how it fits into the Alibaba Cloud Big Data ecosystem. Develop a fuller understanding of the capabilities of DataWorks with analyses of common use cases.

DataWorks

Demo of DataWorks - Data Acquisition

Learn how to create a new DataWorks Workspace, and import log data into MaxCompute using DataWorks Data Integration.

DataWorks

Demo of DataWorks - Data Processing

Learn how DataWorks can be used to create "Business Flows" (Workflows) for automatic, scheduled data processing. Create a multi-stage data processing workflow and run it from the DataWorks console. See how User Defined Functions (UDFs) can be used to add custom features to MaxCompute SQL.

DataWorks

Demo of DataWorks - Data Quality Monitoring

See how DataWorks Data Quality monitoring can be used to ensure consistent data quality each time data import and processing tasks are run. See how Data Quality metrics are selected and applied to MaxCompute tables.

DataWorks

DataWorks Introduction Summary and Review

In this video, we quickly summarize the DataWorks concepts covered in the previous few sections.

DataWorks

More Details Of DataWorks

Learn more about DataWorks by visiting the DataWorks documentation. Here, you can learn about the specific features offered by different DataWorks editions, as well as watch useful video walkthroughs, access the DataWorks FAQ page, and see specific step-by-step guides to using core features like Operation Center or DataService Studio.

Big Data Blog

Setting up a MySQL data source

Learn how to create, configure, and load sample e-commerce data into an RDS MySQL database. This database is used in the next section as a data source for a DataWorks project.

Big Data Blog

Importing and Processing MySQL data using DataWorks

Learn how to import data from MySQL into MaxCompute using DataWorks, how to process that data by creating a "Business Flow" (Workflow), and learn how to export your processed data from MaxCompute back into your MySQL database.

Big Data Basics
Big Data Basic Concepts
In this video, we introduce the core components of Alibaba Cloud’s Big Data ecosystem. Learn how tools such as DataWorks, MaxCompute, QuickBI, and DataV fit together, and see how each tool can be used to manage and process large datasets more effectively.
Big Data Basics
Alibaba Cloud Big Data Products Overview
In this video, we introduce the core components of Alibaba Cloud’s Big Data ecosystem. Learn how tools such as DataWorks, MaxCompute, QuickBI, and DataV fit together, and see how each tool can be used to manage and process large datasets more effectively.
Big Data Basics
Scenario in Which Products Would be Used
This this video, we examine some real data processing scenarios, and look at how Ailbaba Cloud services can be tied together to address a large number of different storage and analysis requirements.
DataWorks
DataWorks Overview
This short video introduces the topics which will be covered in the following several video lectures, such as Data Acquisition, Processing, and Quality Monitoring with DataWorks.
DataWorks
DataWorks Introduction
In this video lecture, you'll learn what DataWorks is, and how it fits into the Alibaba Cloud Big Data ecosystem. Develop a fuller understanding of the capabilities of DataWorks with analyses of common use cases.
DataWorks
Demo of DataWorks - Data Acquisition
Learn how to create a new DataWorks Workspace, and import log data into MaxCompute using DataWorks Data Integration.
DataWorks
Demo of DataWorks - Data Processing
Learn how DataWorks can be used to create "Business Flows" (Workflows) for automatic, scheduled data processing. Create a multi-stage data processing workflow and run it from the DataWorks console. See how User Defined Functions (UDFs) can be used to add custom features to MaxCompute SQL.
DataWorks
Demo of DataWorks - Data Quality Monitoring
See how DataWorks Data Quality monitoring can be used to ensure consistent data quality each time data import and processing tasks are run. See how Data Quality metrics are selected and applied to MaxCompute tables.
DataWorks
DataWorks Introduction Summary and Review
In this video, we quickly summarize the DataWorks concepts covered in the previous few sections.
DataWorks
More Details Of DataWorks
Learn more about DataWorks by visiting the DataWorks documentation. Here, you can learn about the specific features offered by different DataWorks editions, as well as watch useful video walkthroughs, access the DataWorks FAQ page, and see specific step-by-step guides to using core features like Operation Center or DataService Studio.
Big Data Blog
Setting up a MySQL data source
Learn how to create, configure, and load sample e-commerce data into an RDS MySQL database. This database is used in the next section as a data source for a DataWorks project.
Big Data Blog
Importing and Processing MySQL data using DataWorks
Learn how to import data from MySQL into MaxCompute using DataWorks, how to process that data by creating a "Business Flow" (Workflow), and learn how to export your processed data from MaxCompute back into your MySQL database.
Stage II: Data Warehousing and Data Processing

Module

Course Name

Objectives

Course Video

Python Structured Data Processing

Introduction to Python Pandas and business scenarios

Learn how to install the Anaconda Python distribution and open a new Jupyter notebook, which can be used for processing data with Pandas.

Python Structured Data Processing

Loading data from different data types

In this video lecture, you'll learn how to import CSV or JSON data into Pandas. The video also demonstrates how to access all or part of the data once it is loaded into Pandas.

Python Structured Data Processing

Problems of the raw data

Learn how to identify common issues with raw datasets such as missing (null) values.

Python Structured Data Processing

Data scrubbing

Learn how Pandas can be used to clean your raw data by replacing NaNs and NULL values, and by standardizing data types.

Python Structured Data Processing

Data analysis and virtualization

Learn how to filter, sort, and group data in Pandas, as well as generate plots which allow you to gain insights by visualizing data instribution, trends, and correlations.

Python Structured Data Processing

Deal with more than one data set

In this video lecture, you'll learn how to work with multiple datasets in Pandas at the same time, as well as how to establish connections between datasets (merge and join) and plot data relationships.

SQL for Beginners

SQL For Beginners Course Objectives

This video briefly introduces the SQL content discussed in the following sections.

SQL for Beginners

Select Statement Basic

Learn how to query datasets with the SQL SELECT statement. See how SELECT can be used to filter, group, and sort data. This section also discusses more advanced usage, such as using the DISTINCT keyword on tables which contain NULL values.

SQL for Beginners

SELECT statement with WHERE

Learn advanced filtering techniques with SQL SELECT using the WHERE keyword. Learn how WHERE can be used with logical keywords (AND, OR, XOR, NOT) as well as arithmetic operators and comparison operators such as IN, IS, NULL, and LIKE.

SQL for Beginners

SELECT with ORDER BY and Tips

Learn advanced filtering techniques with SQL SELECT using the ORDER BY keyword. See how ORDER BY can be used to sort results in ascending or descending order.

SQL for Beginners

Table Join

Learn how SQL JOIN statements work. See how inner, outer, left, and right joins are used to combine multiple tables in SQL.

SQL for Beginners

Troubleshooting

In this section, learn some of the basic skills needed to read and understand SQL errors and warnings

SQL for Beginners

SQL Models for Syntax Checking

In this video lecture, you'll learn how MySQL databases allow you to change SQL modes, allowing you to take advantage of different SQL features for different use-cases.

MaxCompute Basic

MaxCompute Course Content Briefing

In this video, we provide a brief outline of the following sections, which focus on Alibaba Cloud's data warehousing tool, MaxCompute.

MaxCompute Basic

Introduction of MaxCompute

In this video lecture, you'll learn about MaxCompute, Alibaba Cloud's distributed data storage and processing tool. Learn how MaxCompute works, what it can do, and how it is used at Alibaba Group to store and process petabytes of data.

MaxCompute Basic

MaxCompute Architecture

In this video, we provide a brief outline of the following sections, which focus on Alibaba Cloud's data warehousing tool, MaxCompute.

MaxCompute Basic

Basic Concepts of MaxCompute

MaxCompute In this video, we explore key concepts in MaxCompute, including Projects, Tables, Partitions, and Resources.

MaxCompute Basic

How to Use MaxCompute

In this section, we review the interfaces that allow you to interact with MaxCompute, including the MaxCompute CLI, the MaxCompute Studio IDE plugin, and DataWorks.

MaxCompute Basic

Quick Start Guide of MaxCompute

Learn how to import data from OSS into MaxCompute using DataWorks, and see how the data can be manipulated and modified using the DataWorks console.

MaxCompute Basic

MaxCompute Users And Roles

Learn how MaxCompute users and roles can be used to control data access privilages within MaxCompute projects.

Python Structured Data Processing
Introduction to Python Pandas and business scenarios
Learn how to install the Anaconda Python distribution and open a new Jupyter notebook, which can be used for processing data with Pandas.
Python Structured Data Processing
Loading data from different data types
In this video lecture, you'll learn how to import CSV or JSON data into Pandas. The video also demonstrates how to access all or part of the data once it is loaded into Pandas.
Python Structured Data Processing
Problems of the raw data
Learn how to identify common issues with raw datasets such as missing (null) values.
Python Structured Data Processing
Data scrubbing
Learn how Pandas can be used to clean your raw data by replacing NaNs and NULL values, and by standardizing data types.
Python Structured Data Processing
Data analysis and virtualization
Learn how to filter, sort, and group data in Pandas, as well as generate plots which allow you to gain insights by visualizing data instribution, trends, and correlations.
Python Structured Data Processing
Deal with more than one data set
In this video lecture, you'll learn how to work with multiple datasets in Pandas at the same time, as well as how to establish connections between datasets (merge and join) and plot data relationships.
SQL for Beginners
SQL For Beginners Course Objectives
This video briefly introduces the SQL content discussed in the following sections.
SQL for Beginners
Select Statement Basic
Learn how to query datasets with the SQL SELECT statement. See how SELECT can be used to filter, group, and sort data. This section also discusses more advanced usage, such as using the DISTINCT keyword on tables which contain NULL values.
SQL for Beginners
SELECT statement with WHERE
Learn advanced filtering techniques with SQL SELECT using the WHERE keyword. Learn how WHERE can be used with logical keywords (AND, OR, XOR, NOT) as well as arithmetic operators and comparison operators such as IN, IS, NULL, and LIKE.
SQL for Beginners
SELECT with ORDER BY and Tips
Learn advanced filtering techniques with SQL SELECT using the ORDER BY keyword. See how ORDER BY can be used to sort results in ascending or descending order.
SQL for Beginners
Table Join
Learn how SQL JOIN statements work. See how inner, outer, left, and right joins are used to combine multiple tables in SQL.
SQL for Beginners
Troubleshooting
In this section, learn some of the basic skills needed to read and understand SQL errors and warnings
SQL for Beginners
SQL Models for Syntax Checking
In this video lecture, you'll learn how MySQL databases allow you to change SQL modes, allowing you to take advantage of different SQL features for different use-cases.
MaxCompute Basic
MaxCompute Course Content Briefing
In this video, we provide a brief outline of the following sections, which focus on Alibaba Cloud's data warehousing tool, MaxCompute.
MaxCompute Basic
Introduction of MaxCompute
In this video lecture, you'll learn about MaxCompute, Alibaba Cloud's distributed data storage and processing tool. Learn how MaxCompute works, what it can do, and how it is used at Alibaba Group to store and process petabytes of data.
MaxCompute Basic
MaxCompute Architecture
In this video, we provide a brief outline of the following sections, which focus on Alibaba Cloud's data warehousing tool, MaxCompute.
MaxCompute Basic
Basic Concepts of MaxCompute
MaxCompute In this video, we explore key concepts in MaxCompute, including Projects, Tables, Partitions, and Resources.
MaxCompute Basic
How to Use MaxCompute
In this section, we review the interfaces that allow you to interact with MaxCompute, including the MaxCompute CLI, the MaxCompute Studio IDE plugin, and DataWorks.
MaxCompute Basic
Quick Start Guide of MaxCompute
Learn how to import data from OSS into MaxCompute using DataWorks, and see how the data can be manipulated and modified using the DataWorks console.
MaxCompute Basic
MaxCompute Users And Roles
Learn how MaxCompute users and roles can be used to control data access privilages within MaxCompute projects.
Stage III: Advanced Data Processing Tools and Techniques

Module

Course Name

Objective

MaxCompute SQL Development

MaxCompute SQL Overview

Gain a basic understanding of MaxCompute's SQL dialect, and how it differs from common SQL dialects such as MySQL's SQL.

MaxCompute SQL Development

Data Definition Language (DDL)

Develop an understanding of the basic MaxCompute SQL DDL operations (creating, deleting, and modifying tables).

MaxCompute SQL Development

Data Manipulation Language (DML)

Develop an understanding MaxCompute's SQL DML operations (selecting and inserting records).

MaxCompute SQL Development

Built-In Function (Part 1)

In this video, we review some of MaxCompute SQL's built-in functions, including mathematical functions like MAX, ABS, and RAND.

MaxCompute SQL Development

Built-In Function (Part 2)

In this video, we review some of MaxCompute SQL's built-in functions, including time manipulation functions such as GETDATE, DATEPART, and WEEKDAY.

MaxCompute SQL Development

Built-In Function (Part 3)

In this video, you will learn how SQL functions are used in general, along with common use-cases for common SQL functions in most SQL dialects.

MaxCompute SQL Development

MaxCompute SQL Development Summary And Review

In this video lecture, we review the MaxCompute concepts discussed in the previous sections, with a focus on MaxCompute SQL development.

MaxCompute User Define Function

Introduction Of UDF

Learn the basics of MaxCompute's User Defined Functions (UDFs), which allow you to add your own new SQL functionality to MaxCompute's SQL language. Learn the differences between the major UDF types: UDF, UDAF, and UDTF.

MaxCompute User Define Function

The Implement Logical Of UDF

Learn how to create your own MaxCompute UDF functions in Java.

MaxCompute User Define Function

UDF Development Process

In this video lecture, we explain the UDF development process in detail, including the steps needed to create and compile a Java UDF in a local IDE.

MaxCompute User Define Function

UDF Summary And Review

In this section, we briefly review User Defined Functions, as covered in the previous three sections.

MaxCompute SQL Development
MaxCompute SQL Overview
Gain a basic understanding of MaxCompute's SQL dialect, and how it differs from common SQL dialects such as MySQL's SQL.
MaxCompute SQL Development
Data Definition Language (DDL)
Develop an understanding of the basic MaxCompute SQL DDL operations (creating, deleting, and modifying tables).
MaxCompute SQL Development
Data Manipulation Language (DML)
Develop an understanding MaxCompute's SQL DML operations (selecting and inserting records).
MaxCompute SQL Development
Built-In Function (Part 1)
In this video, we review some of MaxCompute SQL's built-in functions, including mathematical functions like MAX, ABS, and RAND.
MaxCompute SQL Development
Built-In Function (Part 2)
In this video, we review some of MaxCompute SQL's built-in functions, including time manipulation functions such as GETDATE, DATEPART, and WEEKDAY.
MaxCompute SQL Development
Built-In Function (Part 3)
In this video, you will learn how SQL functions are used in general, along with common use-cases for common SQL functions in most SQL dialects.
MaxCompute SQL Development
MaxCompute SQL Development Summary And Review
In this video lecture, we review the MaxCompute concepts discussed in the previous sections, with a focus on MaxCompute SQL development.
MaxCompute User Define Function
Introduction Of UDF
Learn the basics of MaxCompute's User Defined Functions (UDFs), which allow you to add your own new SQL functionality to MaxCompute's SQL language. Learn the differences between the major UDF types: UDF, UDAF, and UDTF.
MaxCompute User Define Function
The Implement Logical Of UDF
Learn how to create your own MaxCompute UDF functions in Java.
MaxCompute User Define Function
UDF Development Process
In this video lecture, we explain the UDF development process in detail, including the steps needed to create and compile a Java UDF in a local IDE.
MaxCompute User Define Function
UDF Summary And Review
In this section, we briefly review User Defined Functions, as covered in the previous three sections.
Stage IV: Visualization, Machine Learning, and AI

Module

Course Name

Objectives

Course Video

QuickBI

QuickBI Course Content Briefing

This video outlines the topics covered in the following several video lectures, including basic concepts in Alibaba Cloud QuickBI, proper chart selection, and construction of QuickBI dashboards.

QuickBI

Background Introduction Of BI

Learn how to choose the right type of charts to display your data, and how different types of charts are best suited to different roles (comparison, relationship, distribution, composition).

QuickBI

Features Highlights Of QuickBI

This section introduces QuickBI and explains how QuickBI fits into the Alibaba Cloud Big Data ecosystem.

QuickBI

Commonly Used Charts Introduction Part One

In this video lecture, we give an in-depth explanation of different chart types, and examine scenarios in which each chart type is a good (or bad) choice.

QuickBI

Demo Of Commonly Used Charts Part One

See a hands-on demo of the QuickBI console, and learn how to construct a basic data dashboard.

QuickBI

Commonly Used Charts Introduction Part Two

Learn about additional useful chart types including the funnel chart, tree chart, tree map, and conversion chart.

QuickBI

Demo Of Commonly Used Charts Part Two

See a hands-on demo of more advanced charts in QuickBI, such as the conversion chart.

QuickBI

Commonly Used Charts Introduction Part Three

In this section, we examine some additional dashboard elements in QuickBI such as the card, word cloud, and map (geo-chart) elements.

QuickBI

Demo Of Commonly Used Charts Part Three

See a hands-on demo of some of QuickBI's additional dashboard elements such as the card and word cloud.

QuickBI

Summary Of All Charts In Categories

In this video lecture, we summarize all the chart types discussed previously, tying each chart back to its best use-cases.

QuickBI

End To End Demo Introduction

Here we introduce the scenario for the end-to-end QuickBI dashboard construction demo carried out in the next video.

Machine Learning Platform for AI

Introduction Of PAI

Gain a basic understanding of Platform for AI (PAI), Alibaba Cloud's Machine Learning platform. See how a simple PAI environment can be quickly and easily created using the Alibaba Cloud web console.

Machine Learning Platform for AI

Quick Start And Architecture Overview

See a demonstration of PAI's low-code Machine Learning tool, PAI Studio. Learn how this interface can be used to prepare data, train and test machine learning models, and evaluate model accuracy.

Machine Learning Platform for AI

Demo: Best Practice Of PAI

Learn how PAI's low-code Studio tool can be used to train a machine learning model to detect financial fraud. This end-to-end demo explains how fraud rings are identified and shows the steps needed to train a working machine learning model on the provided sample data.

Machine Learning Platform for AI

User Cases

In this section, we take a look at some of PAI's current users, and discuss the ways in which they are utilizing PAI to improve a variety of different business processes.

QuickBI
QuickBI Course Content Briefing
This video outlines the topics covered in the following several video lectures, including basic concepts in Alibaba Cloud QuickBI, proper chart selection, and construction of QuickBI dashboards.
QuickBI
Background Introduction Of BI
Learn how to choose the right type of charts to display your data, and how different types of charts are best suited to different roles (comparison, relationship, distribution, composition).
QuickBI
Features Highlights Of QuickBI
This section introduces QuickBI and explains how QuickBI fits into the Alibaba Cloud Big Data ecosystem.
QuickBI
Commonly Used Charts Introduction Part One
In this video lecture, we give an in-depth explanation of different chart types, and examine scenarios in which each chart type is a good (or bad) choice.
QuickBI
Demo Of Commonly Used Charts Part One
See a hands-on demo of the QuickBI console, and learn how to construct a basic data dashboard.
QuickBI
Commonly Used Charts Introduction Part Two
Learn about additional useful chart types including the funnel chart, tree chart, tree map, and conversion chart.
QuickBI
Demo Of Commonly Used Charts Part Two
See a hands-on demo of more advanced charts in QuickBI, such as the conversion chart.
QuickBI
Commonly Used Charts Introduction Part Three
In this section, we examine some additional dashboard elements in QuickBI such as the card, word cloud, and map (geo-chart) elements.
QuickBI
Demo Of Commonly Used Charts Part Three
See a hands-on demo of some of QuickBI's additional dashboard elements such as the card and word cloud.
QuickBI
Summary Of All Charts In Categories
In this video lecture, we summarize all the chart types discussed previously, tying each chart back to its best use-cases.
QuickBI
End To End Demo Introduction
Here we introduce the scenario for the end-to-end QuickBI dashboard construction demo carried out in the next video.
Machine Learning Platform for AI
Introduction Of PAI
Gain a basic understanding of Platform for AI (PAI), Alibaba Cloud's Machine Learning platform. See how a simple PAI environment can be quickly and easily created using the Alibaba Cloud web console.
Machine Learning Platform for AI
Quick Start And Architecture Overview
See a demonstration of PAI's low-code Machine Learning tool, PAI Studio. Learn how this interface can be used to prepare data, train and test machine learning models, and evaluate model accuracy.
Machine Learning Platform for AI
Demo: Best Practice Of PAI
Learn how PAI's low-code Studio tool can be used to train a machine learning model to detect financial fraud. This end-to-end demo explains how fraud rings are identified and shows the steps needed to train a working machine learning model on the provided sample data.
Machine Learning Platform for AI
User Cases
In this section, we take a look at some of PAI's current users, and discuss the ways in which they are utilizing PAI to improve a variety of different business processes.