I recently faced some questions in an interview with a company. Can anyone help me understand the basics of Hadoop as I am new to it?
Questions:
1. Difference between "Sort By" and "Group by" in Hive. How do they work?
2. If we use the "Limit 1" in any SQL query in Hive, will Reducer work or not?
3. How to optimize Hive Performance?
4. Difference between "Internal Table" and "External Table"
5. What is the main difference between Hive and SQL
6. The relation book (title, price) contains the titles and prices of different books. Assuming that no two books have the same price, what does the ```following SQL query list?
select title
from book as B
where (select count(*)
from book as T
where T.price > B.price) < 5
Last question reference:
https://www.interviewbit.com/sql-query-interview-questions/
Can you suggest me some helpful resources so that I can improve my learning? Thank you.