Cloud Computing Services - Cloudurable Introduction to Kafka – Published 2017 What is Kafka? Kafka is a platform for building a fault tolerant, distributed publish-subscribe messaging system engineered for processing of high-volume data streams, which is able to handle hundreds of thousands of messages. This 79 page eBook provides a comprehensive overview of topics such as:
Dark Data Analytics - By Complexity Labs - Published February 2018
This video highlights that the dark data explosion of data is far outstripping our capacities to use it. A small fraction is in a traditional structure form that is easily accessible and usable by organizations, a larger section of big data is unstructured but at least somewhat accessible, while the vast majority is simply hidden all together going unseen and unused, this, we can call dark data.
Data may be considered dark for a number of different reasons because it is unstructured, because it is behind a firewall on the internet, it may be dark because of speed or volume, or because people simply have not made the connections between the different datasets.
In many organizations, large collections of both structured and unstructured data sit idle. On the structured side, it's typically because connections haven't been easy to make between disparate data sets that may have meaning- especially information that lives outside of a given system, business unit or function.