R-Ladies Meet-up

Teaching Computers to See Scatterplots with Scagnostics

Harriet Mason

Package Co-Authors: Di Cook, Ursula Laa, Stuart Lee

2023-11-08

Overview

Big Data - Scagnostics - Cassowaryr - AFLW

Hi everyone, I’m Harriet Mason, a PhD student at Monash University
Today I’m going to be talking about scagnostics and the package that calculates them, cassowaryr
What are scagnostics you may be thinking, it is pretty likely you have never come across the term before
They are a group of measures that evaluate the visual features of a scatter plot
Scatterplots are particularly useful for examining all kinds of association between variables
and we assess that association by looking at the shape made by the points in a scatter plot, that is, its visual featues
unfortunately big data has too many variables to plot them all.
So, instead of looking at every pairwise plot, we instead picked out an interesting subst and only looked at those? That is the main idea behind scagnostics
In this presentation I’m first going to explain how scagnostics work, then i’m going to explain the structure of the cassowaryr package that calculates the scagnostics, and finally I’ll show how you can use the package yourself by going through an example using Australian football league statistics

R-Ladies Meet-up Teaching Computers to See Scatterplots with Scagnostics Harriet Mason Package Co-Authors: Di Cook, Ursula Laa, Stuart Lee 2023-11-08