Comments:"Hadoop Landscape Review 2013"
URL:http://www.dataintoresults.com/2013/04/hadoop-landscape-review-2013/
Hadoop landscape review 2013
I’ve spent some time lately to dig into the Hadoop ecosystem both from a product survey and some hands on. Here is some remarks about the state of Hadoop in April 2013. I’ve played with Greenplum HD 1.2 and CDH4.2 and read a lot of stuff about Hadoop and peripherical products.
1. Map reduce is dead
I’m not talking about HDFS here. I found HDFS convenient to use. I don’t know its technical limits but it seems sound ground.
2. Impala is serious business
3. The ecosystem rocks : Apache Oozie and Cloudera Hue
They are both currently immature, but they provide an invaluable framework around Hadoop. Oozie is an ETL tool embedded in Hadoop while Hue is a workbench where you can use Hive, Impala, … well every Hadoop tools. The big momentum around Hadoop means that most issues you face in your daily life will be solved. It’s a bit like Java, no matter what you want there is a library for that. Same thing happen with Hadoop.
In conclusion, Hadoop will be the of common use soon.
Real-time SQL is getting real, the stack is getting quite complete (Oozie, Hue), the momentum is huge and it’s free. The mix is getting better and better. It’s way too early to do something serious with it, as I can see better alternatives for most usage (excluding the living archive one, not my area anyway). Major change will happen this year and it’s the time to look closely at Hadoop.
Possible related posts:
Hadoop is dead thanks to EMC, long live to Hadoop Book review : Competing on analytics Book review : Marketing calculator Big data and mobile BI : New hype but same old issue Data Manipulation Part 1 : SQL