Cloud Data Platform (CDP)

Mohammed Ahamad Basha

0

Data Scientist

Data Analyst

Data Engineer

AWS

GitHub

Python

Project involved building AWS Pipelines for loading data from Hana or Oracle into Redshift using Python and
Pyspark for processing data as DataFrames.
Ensured proper data population and data cleansing.
Designed pipelines based on the given requirements.
Performed performance tuning, scheduled pipelines, and developed Glue jobs and Lambda scripts.
Maintained code versioning using CI/CD Pipeline.
Like this project
0

Posted Nov 6, 2023

Project involved building AWS Pipelines for loading data from Hana or Oracle into Redshift using Python and Pyspark for processing data as DataFrames.

Likes

0

Views

2

Clients

Signet Jewelers

Tags

Data Scientist

Data Analyst

Data Engineer

AWS

GitHub

Python

Enterprise Data Lake Migration
Enterprise Data Lake Migration
Bazaarvoice API
Bazaarvoice API