Apache Hadoop didn’t disrupt the datacenter, the data did.
Shortly after Corporate IT functions within enterprises adopted large scale systems to manage data then the Enterprise Data Warehouse (EDW) emerged as the logical home of all enterprise data. Today, almost every enterprise has a Data Warehouse that serves to model and capture the essence of the business from their enterprise systems.
The emergence and explosion of new types of data in recent years has put tremendous pressure on all of the data systems within the enterprise. These new types of data stem from ‘systems of engagement’ such as websites, or from the growth in connected devices.
The data from these sources has a number of features that make it a challenge for a data warehouse:
Exponential Growth. 85% of this data growth is expected to come from new types like Clickstream data, Server Logs, Geolocation, Machine and Sensor data,..
Varied Nature. The incoming data can have little or no structure, or structure that changes too frequently for reliable schema creation at time of ingest.
Value at High Volumes. The incoming data can have little or no value as individual, or small groups of records. But high volumes and longer historical perspectives can be inspected for patterns and used for advanced analytic applications.
Read the white paper
Challenges of capture and storage aside, the blending of existing enterprise data with the value found within these new types of data is being proven by many enterprises across many industries from Retail to Healthcare, from Advertising to Energy.
The maturation of Apache Hadoop in recent years has broadened its capabilities from simple data processing of large data sets to a fully-fledged data platform with the necessary services for the enterprise from Security to Operational Management and more.
From an architectural perspective, the use of Hadoop as a complement to existing data systems is extremely compelling: an open source technology designed to run on large numbers of commodity servers. Hadoop provides a low cost scale-out approach to data storage and processing and is proven to scale to the needs of the very largest web properties in the world.Contact us