Data Engineer

Bazaarvoice API

1. Developed the python and pyspark scripts to import API data to AWS s3 bucket in json formats.

Data Scientist
Data Analyst
Data Engineer
AWS
GitHub
Python

Cloud Data Platform (CDP)

Project involved building AWS Pipelines for loading data from Hana or Oracle into Redshift using Python and Pyspark for processing data as DataFrames.

Data Scientist
Data Analyst
Data Engineer
AWS
GitHub
Python

Enterprise Data Lake Migration

Led the creation of AWS pipelines to facilitate the loading of data from Oracle, PostgreSQL, Hana databases into Redshift. Employed Python PySpark scripts.

Data Scientist
Data Analyst
Data Engineer