Hybrid Data Platform and Integration Hub Development

Gautam Gupte

The client, a leading EU private equity and wealth management firm, needed a modern data platform to address legacy silos, manual processes, and reporting inefficiencies. A hybrid data warehouse using SQL Server (on-prem) and Snowflake (cloud) was built to support legacy compatibility and cloud-native scalability. Additionally, a centralized integration hub on AWS to unify disparate enterprise systems, replacing point-to-point workflows with scalable, event-driven architecture. Leveraged API Gateway, Lambda, Step Functions, SQS, EventBridge, and Cognito etc. For DG use case refer to details

Client Description:

Client is a well-known name in private equity fund administration and wealth management services in the EU region.

Data platform project description:

Inorganic growth over the years had resulted in siloed legacy systems and fragmented reporting, leading to high processing overheads and error-prone manual tasks. To address this, a modern data platform was envisioned to consolidate data sources, streamline reporting, and to enable monetization of data through enhanced offerings like Platform-as-a-Service, Data-as-a-Service, and Data Warehouse-as-a-Service.
A hybrid data warehouse solution was developed using MS SQL Server (on-premise) and Snowflake (cloud). This architecture ensured backward compatibility with existing reporting while providing flexibility for clients to integrate via on-prem or cloud technologies. Snowflake features like tasks, streams, stored procedures, and transient tables were leveraged for cloud-based efficiencies.

DG use case: Data Governance Enablement Using Collibra

A parallel governance stream initiated using Collibra, needed to complimented with the new data platform details. The objective was to ensure data discoverability, ownership, and compliance standards across the organization.

Key Contributions:

Technical Sub Communities were created for representing the data platforms, reporting groups
Domains such as Logical Models, DW Policy, Data Quality Dimensions and Tech Asset Domain were structured to organize assets meaningfully.
Datasets such as Client Data, Transaction Data, and Performance Metrics were structured to organize assets meaningfully.
A unified Data Catalog was established by ingesting metadata from Snowflake and SQL Server, improving data accessibility.
Policies, business glossary terms and Data Quality Rules were defined and linked to corresponding datasets.
Published Collibra assets included:
Datasets
Reports
Glossary Terms
Stewardship Roles and Responsibilities
Supporting WFs

Integration hub project description:

Additionally, an integration hub was developed to serve as a unified interface between disparate systems. It eliminated point-to-point integrations, reduced turnaround time for new workflows, and supported a microservices architecture. This hub was built using AWS cloud services including API Gateway, Lambda, Step functions, SQS, Event Bridge, and KDS, enabling both batch and streaming, synchronous/asynchronous integrations.

Technical Environment:

Snowflake Enterprise Edition, MS SQL Server 2016, Mosaic ETL Tool, Mosaic Catalog, Azure Cloud Services, Jira, AWS Services

Role:

Delivery Manager overseeing technical implementation
Managed scope, timeline, cost, communication, and quality
Liaised with stakeholders and ensured successful end-to-end delivery
Like this project

Posted Jun 11, 2025

Developed a hybrid data platform and integration hub for a leading EU private equity firm.

Datalake & Real-Time Analytics for Media Streaming Platform
Datalake & Real-Time Analytics for Media Streaming Platform
Healthcare Informatics: Adv. analytics platform for pharma firm
Healthcare Informatics: Adv. analytics platform for pharma firm

Join 50k+ companies and 1M+ independents

Contra Logo

© 2025 Contra.Work Inc