Project Info

Project Description

Data is not the key to the future.
Data does not define success.
The key to the future, what defines success, is people.

We are a data integration platform tailored for the healthcare industry, standardising the way health data is shared and incentivising collaboration between hospitals, researchers, and service providers.

Data Story

Data integration from various data sources takes 80% of a data scientist's time. We will unlock and connect these numerous data sources (including AIHW hospital records, ABS (SA3) demographic data and geospatial data) from numerous formats into a smart format like GeoJSON.

This enables users to focus on finding answers from the data; to establish new correlations, make more informed predictions for resourcing allocation, and better combat broader challenges such as antibiotic resistance and disease spread. Through this, users are incentivised to add their own data to the platform and collaborate.

In our proof-of-concept, we illustrate the significant improvement achieved via data integration and the statistical analysis such as multinomial logistic regression since made possible.

Evidence of Work



Team DataSets

Australia Institute of Health and Welfare

Description of Use: This data provided the fundamental spatial dataset of hospital locations and heath related data Australia wide. We joined the datasets to assist with visualising the data. By joining attribute data using the hospital name as the common linking key, we are able to visualise trends on a spatial scale as opposed to the current tabular form.

Data Set

ABS - Australian Statistical Geography

Description of Use: These boundaries were used to aggregate demographic data with health statistics from other data sources. The ABS statistical boundaries are used to integrate various data sources within a spatial context

Data Set

ABS - Statistical Demographic Data

Description of Use: This data was used to join demographic data with local government areas. This aggregation of data is to prove the concept of multiple intput datasets to visualise various data sources on our platform.

Data Set


