The Big 5 of Big Data

by Rudy P
The article I chose to blog about this week is,” Big Data Right Now: Five Trendy Open Source Technologies” by Mr. Tim Gasper of The article starts of by saying Big Data is on everyones mind, and companies “will have spent $4.3 billion on Big Data technologies by the end of 2012” (Gasper, 2012). However,  author believes this is just the tip of the iceberg and states these initial investments will cause a chain reaction for upwards of $34 Billion in spendings through 2013. The field is so expansive, and there are so many players in Big Data (with more to come) the author provided a picture to show just how big this field is.

With all the choices, and so much involved in the field of Big Data the author has given his opinion on the five new technologies that are making the most noise in Big Data.

1. Storm and Kafka: felt the future of stream processing. They are currently used by many companies including Groupon, Alibaba, and The Weather Channel.

2. Drill and Dremel:  “make large-scale, ad-hoc querying of data possible, with radically lower latencies that are especially apt for data exploration” (Gasper, 2012).

3. R : An extremely powerful R open source statistical programming language. This is quickly becoming the standard for statistics.

4. Gremlin and Giraph: help empower graph analysis, and are often used coupled with graph databases.

5. SAP Hana: in-memory analytics platform that” includes an in-memory database and a suite of tools and software for creating analytical processes and moving data in and out, in the right formats” (Gasper, 2012).


This article seemed interesting to me because initially it seemed as though it would portray Big Data and NoSQL in a negative light. It does in some aspects, as shown in the image, by stating there are so many types of Databases and different technologies in Big Data that people may not know what is the best fit. These one size fits all solutions are not for everyone, so simply choosing a big time name in Big Data, or choosing Big Data itself may not be the correct choice for every company.

This article was a real eye opener to me, due to how it showed just how many players there were in Big Data. One of the reasons that makes Big Data so interesting, is due to how much noise it is making through out the industry ( and the Classroom for that matter). It seems as though most are blogging about big data or NoSQL and this shows why. It is getting an incredible amount of money and attention and there are just so many technologies and vendors to choose from.

Gasper, T. (2012, October 27).Big Data Right Now: Five Trendy Open Source Technologies . Retrieved October 28, 2012, from

3 thoughts on “The Big 5 of Big Data

  • October 28, 2012 at 5:42 pm

    Great read, the article opened my eye on the types of databases and varied technologies in big data that best fit peoples need. The picture was complicated for me at the first time but it did make me “Wow” after spending mins on it. Love the way it classified everything in big ports as rational and non-rational, and then it again classified everything more specifically.

  • October 28, 2012 at 6:41 pm

    Nice post. The image really helps me to understand what it means by “One size doesn’t fit all”. When dealing with larger data sets, companies face challenges in being able to create, manipulate, and manage Big Data. Big data is particularly a problem in business analytic because standard tools and procedures are not designed to search and analyze massive data sets.

  • October 28, 2012 at 10:47 pm

    Big data is so complex that it becomes difficult to process using on-hand database management tools, I guess that is why the field is expanding and there is so much opportunity for companies. Companies are looking for the best Big Data technologies to help process all the data into useful information so they are able to make most out of it to help their company advanced in the business. Good post.

Comments are closed.