MaxCompute, alibaba Cloud big data computing service

Ali Cloud big data computing service MaxCompute usage tutorial

MaxCompute (formerly ODPS) is a big data computing service that provides fast, fully managed PB-level data warehouse solutions that enable you to analyze and process massive amounts of data economically and efficiently.

If you are a beginner to MaxCompute

If you are a beginner, we suggest you start with the following modules:

Introduction: An overview of the MaxCompute product and its included features. Reading this section gives you a general idea of MaxCompute. Quick Start: This section provides examples for you to apply for an account, install a client, create a table, authorize authorization, import and export data, run SQL tasks, and run the UDF or Mapreduce program. Basic introduction: Describes the basic concepts of MaxCompute and common commands. You can become more familiar with how MaxCompute works. Tools: Before analyzing data, you need to know how to download, configure, and use the MaxCompute tools. We provide the following Client tools: Client: You can use this tool to perform operations on MaxCompute.

It is recommended that after you are familiar with the above modules, you should further study other modules.

If you are a data analyst

If you are a data analyst, it is recommended that you familiarize yourself with SQL modules:

SQL: You can query and analyze massive data stored in MaxCompute. Includes the following functions: Supports DDL statements. You can use Create, Drop, and Alter to manage tables and partitions. You can Select several records in the table by using Select. You can use the Where statement to view records that meet the conditions to implement filtering. You can associate two tables with an equivalent Join Join. You can implement aggregation By Group By on certain columns. You can Insert the resulting record into another table by Insert Overwrite/INTO. You can implement a range of calculations using built-in functions and custom functions (UDFs).

If you have some development experience

If you have some development experience and understand distributed concepts, and some data analysis may not be implemented in SQL, it is recommended that you learn more advanced functional modules of MaxCompute. As follows:

MapReduce: Java MapReduce programming model provided by MaxCompute. You can use the Java API provided by MapReduce to write MapReduce programs that process data in MaxCompute. Graph: A set of iteration-oriented Graph computing and processing framework. The model is modeled using graphs, which consist of points and edges that contain weights. The graph is edited and evolved through iteration, and the final result is obtained. Eclipse Plugin: Make it easy to develop using MapReduce, UDF, and Graph’s Java SDK. Tunnel: You can use the Tunnel service to batch upload offline data to MaxCompute or download offline data from MaxCompute. SDK: Java SDK: Provides Java interfaces to developers. Python SDK: Provides Python interfaces to developers.

Note:

MapReduce and Graph are still in public beta. If you want to use these features, you can submit an application through the work order system. Please specify your project space name when applying, we will process it within 7 working days.

If you are the project Owner or administrator

If you are the Owner or administrator of a project space, you should be familiar with the following modules:

Security Guide: This section describes how to authorize users, share resources across project Spaces, set data protection functions for project Spaces, and authorize policies. MaxCompute Charging Guide: describes the charging mode of MaxCompute. And some commands that only the Owner of the project space can use, such as SetProject operations for other operations in common commands.

Content introduction

Lesson 1 introduction to client Configuration and Basic Commands 03:09

Class 2: Introduction to BASIC SQL operations 03:42

课时3: introduction to tunnel command operation 21:04

课时4: basic syntax difference between MaxCompute and mainstream relational database DDl

课时5: MaxCompute Studio installation configuration

Official website of Ali Yun University (Official website of Ali Yun University, Innovative Talent Workshop under cloud Ecology)