What is the difference between SORT BY and ORDER BY in Hive?

In Hive we use, SORT BY to sort data in each Reducer. We can use multiple Reducers in SORT BY clause.

We can use ORDER BY to sort all the data that passes through one Reducer. So ORDER BY option can be use only with one Reducer.

ORDER BY guarantees total order in the output. SORT BY guarantees ordering only within the data of one reducer.

Leave a Reply

Your email address will not be published. Required fields are marked *