Hive Tutorial 2 : Why to use hive

Hadoop is the future of enterprise data management.
Apache Hive is the gateway for business intelligence and visualisation tools integrated with Apache Hadoop.
Learning Apache Hive puts developers on the path to innovate on new data architecture projects and new business applications. I think it’s always more exciting to write code for something that’s ramping up, rather than for maintaining mature systems.

Learning Hive is easy for those who already know SQL.

Facebook originally created Hive because they had a pressing need to analyse their petabytes of data at Internet scale. But they did not have enough time to teach all of their data analysts to write Java programs that would kick off Map Reduce jobs.

Their analysts already knew how to write SQL queries, so Facebook created Hive as a tool those analysts could use with their existing SQL skills. After Facebook contributed their code to the Apache Foundation, the open community continued developing Hive along these same lines. So the same is true today as it was in the beginning: developers already familiar with SQL and then take part in all of the new opportunities promised by Hadoop v2.0.

Comments

Popular posts from this blog

Hive Tutorial 31 : Analytic Functions

Hive Tutorial 37 : Performance Tuning