This topic describes how to use MaxCompute to build a petabyte-grade data warehouse based on Object Storage Service (OSS). MaxCompute can analyze the large amounts of data stored in OSS in a fast and efficient manner, which allows you to explore data value within several minutes and at low cost.
Prerequisites
- OSS is activated. One or more buckets are created.
- For more information about how to activate OSS, see Activate OSS.
- For more information about how to create a bucket, see Create buckets.
- You have activated MaxCompute and authorized MaxCompute to access OSS.
- For more information about how to activate MaxCompute, see Activate MaxCompute and DataWorks.
- You need to authorize the account used for running MaxCompute jobs to access OSS data. After you log on with an Alibaba Cloud account, go to the authorization page to authorize the account.
Background information
Internet finance applications need to store large amounts of financial data exchange objects in OSS every day and perform structured analysis of large text files. MaxCompute provides the OSS external table query function, which allows you to use external tables to load large OSS objects into MaxCompute for analysis. This method can improve the efficiency of the entire process.