1, the Order by

Global ordering, but with only one Reducer

2, Sort by

Intra-zonal order

3, Distribute by

Similar to Partition in MapReudce, used in combination with Sort by

4, Cluster by

If the Distribute BY and Sort BY fields are the same, you can use the Cluster by mode. In addition to being Distribute by, Cluster BY also functions as Sort BY. However, the sort can only be in ascending order and cannot be specified as ASC or DESC