HIVE Map Join is nothing but the extended version of Hash Join of SQL Server - just extending Hash Join into Distributed System. SMB(Sort Merge Bucket) Join is also similar to the SQL Server Merge Join mechnism - just extending it into Distributed S…
1. Project data with SELECT The most common use case for Hive is to query data in Hadoop. To achieve this, we need to write and execute a SELECT statement. The typical work done by the SELECT statement is to project the whole row (with SELECT * ) or…