Data Scientist at Sony PlayStation by Muhammad HaseebData Scientist at Sony PlayStation by Muhammad Haseeb

Data Scientist at Sony PlayStation

Muhammad Haseeb

Muhammad Haseeb

Data Scientist at Sony PlayStation

I spent 4 years as a Data Scientist at Sony PlayStation, where I focused on fixing and scaling their internal service data systems.

The Problem

The internal teams were struggling badly. They had to process over 500,000 invoices every cycle, but the old ETL pipeline was painfully slow. People who needed the data for decisions were constantly waiting. The system simply couldn’t keep up with the volume, and it was slowing everyone down.

What I Did

I completely rewrote the ETL pipeline from scratch using Java. The new version ran 120 times faster and handled all 500,000+ invoices smoothly.
On top of that, I built:
Interactive visualizations so the data was actually easy to understand
Custom web apps tailored for different teams
Real-time dashboards that gave people the insights they needed without waiting
I worked closely with the internal teams throughout — listening to their actual pain points, making sure everything fit into their existing workflows, and continuously improving based on their feedback.

Why It Was Challenging

Massive scale (hundreds of thousands of invoices)
Extremely poor performance in the old system
Had to integrate cleanly without breaking anything already in place
Managing stakeholders across multiple teams with different priorities
Building something that would last and could evolve as the business grew

Technologies Used

Java, ETL pipeline development, custom web applications, data visualization, interactive dashboards, large-scale data processing.

The Outcome

The 120x speed improvement was a game changer. Teams stopped wasting hours waiting for data and could finally make decisions quickly. It was a tough project, but one of the most impactful ones I’ve done.
Like this project

Posted Jun 4, 2024

At Sony PlayStation for 4 years. Processed 500k+ invoices, built dashboards/web apps, rewrote ETL pipeline in Java (120x faster). Complex, rewarding project.