MODERN DATA WAREHOUSING
Business Objective
- Clients mission is to proactively secure relevant external data sets, organize internal/external information, make it intelligent, useful and accessible to decision makers/decisioning processes/wider ecosystem through use of scalable technologies.
Challenges Faced by the client
- Data Storage - Structured, semi-structured and unstructured data to be stored in data lake
- Data velocity - Enabling data ingestion at batch / real time with no data loss
- Ease Access - For descriptive, prescriptive, predictive data analysis
- Data Availability - Data from data lake to downstream systems – such as reporting, web application, analytics etc. as per defined SLA
- Data Modelling – Optimized data models and data marts creation for various LOB
- Fast and easy querying, Historical data storage and analysis
- Data Governance – Data Security, metadata management
Solution(s) Proposed
- Designed and implemented big data - lambda architecture using platform from cloudera - CDH5.13
- For data lake HDFS to be used as object store
- For ODS to enable real time data update - suggested to use HBASE
- For datamarts and providing data to visualization layer used Hive
Similar Case Studies