This involves building a data lake. Data sources use
Hadoop tools to transfer data to and from HDFS and some of the sources, were
imported using sqoop, Then storing the raw data into HIVE tables in ORC format
in order to facilitate the data scientists to perform analytics using HIVE. New
use cases were developed and dumped into a NOSQL database (Hbase) for