**** Looking BACK to 2013 ****
NoSQL and related column-oriented databases are offered as a solution for datasets with requirements for high volume, velocity, variety and veracity.
The underlying principles are sound. Database design follows function, which it s say, data is organized optimally for problems at hand.
Perhaps the greatest advantage: Data is left 'as-is' - or at least - minimally processed. Contrary to conventional wisdom, this uncategorized and partially cleansed data become usable in a wider variety of applications.
The downside: Data is not always properly normalized meaning records can overlap, be duplicated and are missing altogether.
In the next post, I will examine the NoSQL behaviors that make this type of datasource a winner for Big Data analysis.
Agree with the potential advantages of data being left as-is. Obvious benefits there are faster time to analysis and more analysis freedom!
ReplyDelete