Hive Tutorial 2 : Why to use hive
Hadoop is the future of enterprise data management.
Apache Hive is the gateway for
business intelligence and visualisation tools integrated with Apache Hadoop.
Learning
Apache Hive puts developers on the path to innovate on new data
architecture projects and new business applications. I think it’s
always more exciting to write code for something that’s ramping up, rather than
for maintaining mature systems.
Learning Hive is easy for those who already
know SQL.
Facebook
originally created Hive because they had a pressing need to analyse their
petabytes of data at Internet scale. But they did not have enough time to teach
all of their data analysts to write Java programs that would kick off Map Reduce jobs.
Their analysts
already knew how to write SQL queries, so Facebook created Hive as a tool those
analysts could use with their existing SQL skills. After Facebook contributed
their code to the Apache Foundation, the open community continued developing
Hive along these same lines. So the same is true today as it was in the
beginning: developers already familiar with SQL
and then take part in all of the new opportunities promised by Hadoop v2.0.
Comments
Post a Comment