Back to Case Studies

MODERN DATA WAREHOUSING

Business Objective
  • Clients mission is to proactively secure relevant external data sets, organize internal/external information, make it intelligent, useful and accessible to decision makers/decisioning processes/wider ecosystem through use of scalable technologies.
Challenges Faced by the client
  • Data Storage - Structured, semi-structured and unstructured data to be stored in data lake
  • Data velocity - Enabling data ingestion at batch / real time with no data loss
  • Ease Access - For descriptive, prescriptive, predictive data analysis
  • Data Availability - Data from data lake to downstream systems – such as reporting, web application, analytics etc. as per defined SLA
  • Data Modelling – Optimized data models and data marts creation for various LOB
  • Fast and easy querying, Historical data storage and analysis
  • Data Governance – Data Security, metadata management
Solution(s) Proposed
  • Designed and implemented big data - lambda architecture using platform from cloudera - CDH5.13
  • For data lake HDFS to be used as object store
  • For ODS to enable real time data update - suggested to use HBASE
  • For datamarts and providing data to visualization layer used Hive



Download Full Case Study