We used the National Skills Commission Skills Classifications dataset which has a text description component for each of the titles and the ANZCO Codes.
We used the Natural Language ToolKit (NLTK) and Term Frequency–Inverse Document Frequency (TF-IDF) to take in the ANZCO titles and descriptions, break it down into keywords and create the model.
When the user inputs their text descriptions, we can repeat the process to match that to the closest ANZCO description of a role or title. By doing so we can classify them against the existing codes and find where they sit on the model, which we can display in a graphical format.
Dataset showing title and description against codes
This program uses another component of the National Skills Commission Skills Classifications dataset, which classifies roles against 10 core competencies and scores them on relative proficiency level. This allows us to use this as features to map out the roles and display the relative re-training required along the x-axis of the map.
The y-axis is a logarithmic output of the Labour Market Information Portal Job Vacancies Data and uses the ANZSCO_Code as a key to match with the x-axis data. This allows us to plot the job map to the user, and the graph below shows the difference between core competencies of the two roles.
The Labour Market Information Portal - IVI Job Vacancies Data set counts online job advertisements on Seek, CareerOne and Australian Jobsearch and catalogues them against state/territories.
X-axis dataset showing proficiency ratings against codes
IVI dataset showing ANZCO Codes against job vacancies for y-axis
We wanted to incorporate the ABS API portal to be able to show insights regarding the roles the user would look into. In this case we worked with the IT Specialist role pulling in the data, and then linking it up with the keys provided by the mentors in the slack channel.
We used this to display a heatmap of of IT Specialists by industry and size of company.
ABS API Raw Data pull
ABS API Data Keys provided from Mentors on Slack
Heatmap displaying insights obtained from dataset.
Source code for ABS API Exploration at Collab
Full write-up and more at Website