What is the difference between SORT BY and ORDER BY in Hive?

In Hive we use, SORT BY to sort data in each Reducer. We can use multiple Reducers in SORT BY clause.

We can use ORDER BY to sort all the data that passes through one Reducer. So ORDER BY option can be use only with one Reducer.

ORDER BY guarantees total order in the output. SORT BY guarantees ordering only within the data of one reducer.

Read the full book at www.amazon.com
Posted in Hive, Hive Interview Questions

Leave a Reply

Your email address will not be published. Required fields are marked *

*