Pilot of collecting depersonalised WiFi data reveals how information could be used to reduce crowding and prioritise transport investment

"This pilot has revealed useful insights into how people criss-cross London using the Tube, and the potential benefits this depersonalised data could unlock, from providing better customer data to helping address overcrowding, are enormous"

Data reveals 18 different routes taken by customers between King's Cross St Pancras and Waterloo - and many people don't use the shortest route changing Tube lines

TfL now working with the Information Commissioner's Office, privacy campaigners and consumer groups about how the data collection could be undertaken on a permanent basis

Tube journeys could be improved through TfL harnessing WiFi data to make more information available to customers as they move around London, new research has shown.

The four-week TfL pilot, which ran between November and December last year, studied how depersonalised WiFi connection data from customers' mobile devices could be used to better understand how people navigate the London Underground network, allowing TfL to improve the experience for customers.

The pilot focused on 54 stations within Zones 1-4 and saw more than 509 million depersonalised 'probing requests', or pieces of data, collected from 5.6 million mobile devices making around 42 million journeys. The data collected was depersonalised, so that no individuals could be identified, and no browsing data was collected from devices. No data collected through the trial was made available to any third parties, and the pilot included clear communication with customers about how to opt out should they wish to do so.

These journeys were analysed by TfL's in-house analytics team and broken into different aggregated 'movement types' to help understand what customers were doing at particular points of their journeys - such as entering or exiting a station, changing between lines or just passing through the station while on a train.

By using this data, TfL was able to get a much more accurate understanding of how people move through stations, interchange between services and how crowding develops.

The pilot revealed a number of results that could not have been detected from ticketing data or paper-based surveys. For example, analysis showed that customers travelling between King's Cross St Pancras and Waterloo take at least 18 different routes, with around 40% of customers observed not taking one of the two most popular routes.

The data collected through the WiFi pilot could have a number of benefits for TfL and its customers, including:

Allowing staff to better inform customers of the best way to avoid disruption or unnecessary crowding

Helping customers plan the route that best suits them - whether based on travel time, crowding or walking distance

Enabling greater sophistication in proving real-time information to customers as they travel across London

Helping further prioritise transport investment to improve services and address regular congestion points - ensuring the maximum benefits to customers

Providing a better insight on customer flows which could increase commercial revenue from companies which advertise or rent retail units on the transport network

While the usual ticketing data for major interchange stations such as Oxford Circus can show the levels of people entering and exiting the stations, it cannot show the huge numbers of people interchanging during peak hours, or precise local areas where crowding occurs on platforms or around escalators, whereas WiFi data can.

TfL has now begun discussions with key stakeholders, including the Information Commissioner's Office, privacy campaigners and consumer groups about how this data collection could be undertaken on a permanent basis, possibly across the full Tube network.

Val Shawcross, Deputy Mayor for Transport, said: "We're determined to use the latest technology to improve the experience of every passenger using transport in London, and I'm delighted the trial has been a success. The analysis of secure, depersonalised WiFi data could enable us to map the journey patterns of millions of passengers and understand in much greater detail how people move around our transport network. It will provide real benefits helping TfL tackle overcrowding, provide more information for passengers about their best journey route, and help us prioritise new investment where it's most needed."

Lauren Sager Weinstein, Chief Data Officer at Transport for London, said: "Technology is transforming our lives, from how we work and enjoy our leisure time to the way we travel. This pilot has revealed useful insights into how people criss-cross London using the Tube, and the potential benefits this depersonalised data could unlock, from providing better customer data to helping address overcrowding, are enormous.

"We are now working closely with key stakeholders to examine our next steps and, as with the pilot, will keep our customers informed while also respecting their privacy and offering a way to opt-out should they wish."

Sue Daley, Head of Programme for Cloud, Data, Analytics and AI, techUK, said: "The TfL Wi-Fi pilot is a powerful example of how data collection and analysis can make a real difference to our everyday lives. By applying big data analytics and machine learning technologies, TfL gained real-time understanding of how people are using the Capital's transport system and these insights will help reduce overcrowding, improve service efficiency and customise information for travellers.

"The transparency and openness shown by TfL is to be applauded. The steps taken to make customers aware of the data collection and its purpose should be seen as a blueprint for others. If UK organisations are to realise the full potential of real-time data-driven decisions, it is vital that we bring the public on this journey by building a culture of data trust and confidence."

Dr Hannah Fry from the Centre for Advanced Spatial Analysis at University College London, said: "By doing this study, TfL have demonstrated the very real way that big data can benefit us all. Using WiFi to understand how people are moving through underground stations gives us the chance to choose what we want from our journeys. Prefer a less busy train and don't mind waiting an extra five minutes? Have a massive bag you need to carry through the station and looking for the emptiest route between platforms? By knowing where people are, TfL can offer those alternatives and, in turn, change our experience of using the Tube.

"The WiFi data offers a completely new way to view what's happening underground. It exposes the pinch-points in the network and can help TfL to understand how and why overcrowding happens - an essential step to making the Tube as safe and efficient as possible. As a Londoner whose journeys are no doubt included in the data collected, I was impressed by how far TfL have gone to take how we feel about our privacy seriously, at every stage they have preserved our anonymity, been transparent about the way the data is used and offered us the option to opt out. Their study serves as an exemplary model of how to treat your customers in the era of big data."

For more information, and to download the full report, please visit tfl.gov.uk/corporate/publications-and-reports/wifi-data-collection

Notes to editors: