MaxCompute is a fast and fully managed one-stop data warehouse solution developed by Alibaba. MaxCompute can process terabytes or petabytes of data. This topic describes open-source features of MaxCompute.

SDK

MaxCompute provides Java SDK and Python SDK interfaces to create, view, and delete MaxCompute tables. You can use the SDKs to manage MaxCompute by editing code. MaxCompute provides the following SDKs:

MaxCompute RODPS

RODPS is a plug-in that MaxCompute provides for R. For more information, see ODPS Plugin for R on GitHub.

How to obtain service support: Leave a message or create an issue in ODPS Plugin for R on GitHub.

MaxCompute JDBC

MaxCompute JDBC is an official Java Database Connectivity (JDBC) driver provided by MaxCompute. MaxCompute JDBC provides a set of interfaces for Java programs to execute SQL tasks. The project is hosted in ODPS JDBC on GitHub.

How to obtain service support: Leave a message or create an issue in ODPS JDBC on GitHub.

Data collector

MaxCompute provides a set of open-source data collectors.

MaxCompute provides data collectors for the following services:
  • Flume
  • Oracle GoldenGate (OGG)
  • Sqoop
  • Kettle
  • Hive Data Transfer UDTF

    The Flume and OGG data collectors are implemented based on the DataHub SDK, whereas the data collectors for Sqoop, Kettle, and Hive Data Transfer UDTF are implemented based on the Tunnel SDK. DataHub is a real-time data transfer channel, and Tunnel is a batch data transfer channel. The Flume and OGG data collectors are used to transfer data in real time. The data collectors for Sqoop, Kettle, and Hive Data Transfer UDTF are used to transfer data in batches in offline mode.

For more information about the source code, see Aliyun MaxCompute Data Collectors on GitHub. For more information about data collectors, see wiki.

How to obtain service support: Leave a message or create an issue in Aliyun MaxCompute Data Collectors on GitHub.