Privacy-Preserving Data Platform for Disability Services
Sunnyfield — Disability Services
days → ~6h
Identity model runtime
8
Source systems unified
dev / rc / prod
Environments
Challenge
Sunnyfield runs on eight separate enterprise systems spanning HR, payroll, rostering, learning, finance and care management. Data arrived through an unpredictable mix of SQL, SFTP, API and Excel with constant schema drift, and strict privacy obligations around participant and workforce PII meant any analytics layer had to be airtight. Leadership had no reliable single view of the workforce or participants, and no scalable way to build out BI or dashboards on top.
Approach
We designed and built a two-layer production platform. A Python and Prefect orchestrator extracts from every source, applies deterministic PII hashing and generates master entity IDs. A dbt analytics layer sits on top with a four-stage dimensional model — raw, interim, staging, analytics — using SCD Type 2 temporal modelling and comprehensive data quality tests. The platform runs on Azure Synapse and SQL Server across three isolated environments, with row-level security enforced at the warehouse and column-level encryption on the identity store. Power BI consumes directly with RLS preserved end-to-end.
Results
Identity resolution runtime dropped from days to around six hours, unblocking downstream analytics. The pipeline is now resilient to real-world source drift — filename changes, header typos, format shifts — and has been steady-state for months. Ongoing data-engineering burden dropped to a fraction of one FTE. The platform now underpins executive dashboards and a second product line built on top of it.
Related Case Studies
Xavier: AI-Powered Transformer Architecture Visualisation
Xephyr
We built Xavier — an interactive 3D visualisation of transformer architecture — to make complex AI concepts tangible for clients and prospects.
Read Case StudyFinancial ServicesReal-Time Risk Data Platform for a European Fintech
Confidential — European Fintech
Rebuilt a legacy batch risk pipeline into a real-time lakehouse platform processing 4M events per day with sub-second latency.
Read Case StudyHealthcarePatient Outcome Analytics Platform for NHS Trust
Confidential — NHS Trust
Built a decision-centric analytics platform enabling clinical leads to identify at-risk patient cohorts 72 hours earlier than previous methods.
Read Case StudyRetail MediaGroup Retail Media Network Across Four Brands
Wesfarmers (OneDigital)
Designed the data operating model, identity architecture and business case that moved a multi-brand group retail media network from exploratory concept to board-approved execution across Bunnings, Officeworks, Priceline and Kmart.
Read Case StudyReady to Transform Your Data?
Book a call with our team to discuss how AI and data can drive results for your business.