**** Looking BACK to 2012 ****
We intuitively know Big Data is about new datasources. It is data that comes from new places, in many formats.
Here are the types of information often mentioned:
- Transactional - HIGH VOLUME operational data often in its pre-warehoused state. This data runs the business but doesn't necessarily rollup into summary form without a fair bit of help (i.e., lots of cleansing, normalizing and correlating)
- Machine Data - FAST MOVING real-time data from automated sources. Often very messy. Can be difficult to relate to warehouse data without plenty of extra semantic muscle.
- Social Data - The INFINITELY VARIABLE source of all knowledge - and the often source of nothing at all. Finding the relevant needles in this giant haystack is challenging.
- Enterprise - THE VALUABLE STUFF THAT RUNS YOUR BUSINESS. Many of my colleagues say this data has 'veracity'. I just say, 'This is the data that business trusts'.
Next week I want to start talking about data volume, velocity, variety, and value. I'll explode a few myths - and have some fun too.
Tomorrow with be (Off) Topic Friday - where I raise an unrelated but hopefully enlightening topic to celebrate TGIF.
No comments:
Post a Comment