Tuesday, February 12, 2013

The Rise of the Column Store

**** Looking BACK to 2013 ****



NoSQL and related column-oriented databases are offered as a solution for datasets with requirements for high volume, velocity, variety and veracity.

The underlying principles are sound. Database design follows function, which it s say, data is organized optimally for problems at hand.

Perhaps the greatest advantage: Data is left 'as-is' - or at least - minimally processed.  Contrary to conventional wisdom, this uncategorized and partially cleansed data become usable in a wider variety of applications.

The downside: Data is not always properly normalized meaning records can overlap, be duplicated and are missing altogether.

In the next post, I will examine the NoSQL behaviors that make this type of datasource a winner for Big Data analysis.

1 comment:

  1. Agree with the potential advantages of data being left as-is. Obvious benefits there are faster time to analysis and more analysis freedom!

    ReplyDelete