Customer Data Hub-Hadoop: Express by Dil GurungCustomer Data Hub-Hadoop: Express by Dil Gurung

Customer Data Hub-Hadoop: Express

Dil Gurung

Data Engineer

Created/Updated ETL scripts of Customer Data Hub(CDH). The project was named Loyalty Relaunch. Customers were tracked for loyalty status.

The client wanted to have Customer Loyalty data pipeline introduced to their ETL architecture for reporting purpose.

Data Sources drop customer related files in an SFTP. Files are read by Bash ETL process. First loading data into Hive stage area, doing necessary transformations(data cleansing, repairing, lookup, deduplication, etc) and populating gold and smith layer of hive google cloud respectively. A script will export that file into a SFTP which was read and populated into Teradata. Operational reports built were on Teradata warehouse.

Handling of JIRA tickets (Bug Fixes) during development.

Running batches in Control-M for testing the ETL process.

Technologies used: BigQuery, Teradata, Scala, Jira, Bash Scripting.

Like this project

Posted Nov 21, 2021

Likes

Views

Clients

Express

Customer Data Hub-Hadoop: Express

Challenges

Challenges