Insolvency, Facts vs Spin

Project Info

Altis Canberra thumbnail

Project Description


We trained random forest and neural network classification models to do predictive modelling on cases of insolvency non-compliance. From these models were also able to extract useful indicators for potential non-compliance. We then combined the AFSA insolvency data set with 7 other datasets to explore a range of potential additional correlations.


Data Story


Beginning this project, we found that it was a relatively easy task to simply mash together a few datasets and generate visualisations that suggest one thing or another. However, we realised there wasn’t necessarily an immediate basis for these claims, so we took a step back and decided to take a more scientific approach.
By leveraging random forests and a deep learning algorithm based on a triple layered neural network, we were able to train our system to recognise the key correlating factors that contributed toward non-compliance prediction and potential causation factors for personal insolvency.
Only once we found these correlated contributing factors, we chose linking fields associated with these factors to find relationships. We are not claiming to have found perfect causation factors for non-compliance or insolvency, but we have built a robust system to identify where potential causality could lie in order to assist with identifying venues for further research. 


Evidence of Work

Video

Homepage

High-Res Image

Team DataSets

(National) Non-compliance in personal insolvencies

Data Set

Melbourne Housing Prices

Description of Use: Used to generate average housing prices per postcode to correlate more expensive suburbs vs less expensive compared to insolvency

Data Set

Population Growth by SA3

Description of Use: Used to correlate population and population density with insolvency statistics

Data Set

ACT Crime Statistics

Description of Use: Used to generate crime statistics by SA3 by connecting from suburb to postcode and then to SA3

Data Set

Crime Stats Agency

Description of Use: Used to generate crime levels for Victorian SA3 levels to correlate to insolvency statistics

Data Set

ASGS Geographic Correspondences (2016)

Description of Use: Used to map between postcode-based data and SA3-based data

Data Set

Socio-economic Index for Individuals (SEIFI) 2006 - Index of Relative Socio-economic Disadvantage 4 groups

Description of Use: Used to correlate with insolvency statistics

Data Set

Personal Insolvency Statistics

Description of Use: Used to further assist with the limited granularity of the insolvency dataset

Data Set

Challenges

Bounty: Is seeing truely believing?

Region: Australia

Challenge

Story telling with data

Region: Australian Capital Territory

Challenge

To bankruptcy or not to bankruptcy, keeping the process real.

Region: Australia

Challenge
Back to Projects