Topological Data Analysis Ball Mapper for the Social Sciences Workshop
Just Added

Topological Data Analysis Ball Mapper for the Social Sciences Workshop

By methods@manchester

Overview

This workshop introduces the Topological Data Analysis Ball Mapper algorithm as a model free tool for visualising data. By Dr Simon Rudkin.

This workshop introduces the Topological Data Analysis Ball Mapper algorithm as a model free tool for visualising data and is led by Dr Simon Rudkin. Dr Rudkin is a member of the University of Manchester's Cathie Marsh Institute for Social Research (CMI) and his work has pioneered application of Ball Mapper in the social sciences. The workshop will showcase the strengths of the algorithm and the research opportunities available for exploitation.


The importance of data visualisation in statistical analysis is understood from Anscombe’s Quartet (Anscombe, 1973). In the example, four datasets with identical linear regression fits are plotted as scatter plots. The independent variable is on the horizontal axis and the dependent variable is on the vertical axis. By seeing the data, it is very clear that the linear regression is only appropriate for one of the four cases. Moreover, the datasets have identical means, standard deviations and correlation. The cautionary tale provided is often neglected in empirical work. The scatter plots that Anscombe (1973) uses are two-dimensional, but the lessons apply when the dataset has more variables.


The BallMapper algorithm is a tool of Topological Data Analysis (TDA) proposed in the original working paper of Dlotko (2019). TDA considers data as points in a multidimensional space, a point cloud. The TDA toolkit is developed to analyse the point cloud. This workshop focuses on the use of TDA for data visualisation. Ball Mapper allows the user to understand the shape of their data in exactly the way the scatter plot allows the understanding of two-dimensional data. Because paper is inherently two-dimensional, it is necessary to apply a mapping to convert the multi-dimensional space into something that can be visualised in research. The Ball Mapper algorithm creates an abstract visualisation of multidimensional data. We will see how the visualisation is produced, and how the visualisations created by Ball Mapper are interpreted. Examples will be provided with UK Census data.


The strength of the Ball Mapper algorithm for visualising data in the social sciences can be seen in Rudkin and Webber (2024), Rudkin et al. (2024), Otway and Rudkin (2024), Rudkin and Dlotko (2024), and Tubadji and Rudkin (2025). Examples from Finance with extensions of the visualisation algorithm include Qiu et al. (2020), Dlotko et al. (2024) and Rudkin et al. (2025). These examples give a flavour of the additional benefits the Ball Mapper algorithm can bring to your data.


The workshop will be conducted using R and Python. Data and code files will be shared with participants via GitHub. R users will need the package BallMapper on their computer. Python users will need the PyBallMapper library. For those who would like to look at their own data, support will be given during the practical elements of the workshop.


Not sure if your data is suitable?

We have a one-hour virtual pre-meeting for participants who would like to know more about the suitability of their data for visualising with Ball Mapper.

The basic guide is that if you would be happy to draw a scatter plot with your variables, the construction of the point cloud makes sense. As a rule, the axis variables for the cloud should be ordinal and have sufficiently many different values. Email enquiries to Simon.Rudkin@manchester.ac.uk on suitability of data are also welcomed.

Category: Science & Tech, Science

Speakers

Good to know

Highlights

  • 6 hours
  • In person

Location

The University of Manchester

Oxford Road

Manchester M13 9PL United Kingdom

How do you want to get there?

Agenda
10:00 - 12:00

Session 1 - Introduction

Session 1 will focus on implementing the BallMapper algorithm. This will be geared for those applying the code for the first time.

12:00 - 14:00

Lunch

Vegetarian/vegan lunch will be provided.

14:00 - 16:00

Session 2 - Inference

Building on session 1, attendees will focus on the inference that can be drawn from BallMapper algorithm output. Participants who feel confident with the basic implementation may choose to attend only Session 2.

Frequently asked questions

Organised by

methods@manchester

Followers

--

Events

--

Hosting

--

Free
Nov 18 · 10:00 GMT