Provides
a 360 degree view of the customer so that a Salesperson is well aware of all
the facts when talking to customer. This gives a much better chance to close
the deal.
This involves building a data lake. Data sources use
Hadoop tools to transfer data to and from HDFS and some of the sources, were
imported using sqoop, Then storing the raw data into HIVE tables in ORC format
in order to facilitate the data scientists to perform analytics using HIVE. New
use cases were developed and dumped into a NOSQL database (Hbase) for
further analytics.
Responsibilities:
• Developed SQOOP scripts to import the source data from
Oracle database into HDFS for further processing.
• Developed HIVE Script to store raw data in ORC format.
• Involved in gathering requirements, designing,
development and testing.
• Generated reports using Hive for business requirements
received on ADHOC basis.
• Environment: Cloudera CDH 5.4.4
• Tech Stack: Hadoop, HDFS, Hive, Sqoop,
Hbase.
Like this project
0
Posted Feb 24, 2023
I worked on different tools Big data tools to a build a hassle free platform for client which is of required to host customers information