Thursday, November 22, 2012

Big Data Types

**** Looking BACK to 2012 ****


We intuitively know Big Data is about new datasources.  It is data that comes from new places, in many formats.

Here are the types of information often mentioned:
  • Transactional - HIGH VOLUME operational data often in its pre-warehoused state.  This data runs the business but doesn't necessarily rollup into summary form without a fair bit of help (i.e., lots of cleansing, normalizing and correlating)
  • Machine Data - FAST MOVING real-time data from automated sources.  Often very messy.  Can be difficult to relate to warehouse data without plenty of extra semantic muscle.
  • Social Data - The INFINITELY VARIABLE source of all knowledge - and the often source of nothing at all.  Finding the relevant needles in this giant haystack is challenging.
  • Enterprise - THE VALUABLE STUFF THAT RUNS YOUR BUSINESS.  Many of my colleagues say this data has 'veracity'.  I just say, 'This is the data that business trusts'.  
Next week I want to start talking about data volume, velocity, variety, and value.  I'll explode a few myths - and have some fun too.

Tomorrow with be (Off) Topic Friday - where I raise an unrelated but hopefully enlightening topic to celebrate TGIF.




No comments:

Post a Comment