Alex Pentland was once named one of the seven most influential data scientists in the world. Five years ago, his ground-breaking book Social Physics described how ideas flow through social networks and are ultimately transformed into behaviours.
It demonstrated the value of âdigital breadcrumbsâ â vast quantities of small pieces of data created by the widespread use of mobile phones, credit cards and social media. This information shows what people really do with their lives, not what they say they do when answering surveys.
Professor Pentland, based at the globally-renowned Massachusetts Institute of Technology, believes collecting and analysing this data could help tackle an enormous range of global problems.
This is because the information is genuine, real-time and paints a clear picture of interactions (social physics) at a neighbourhood level. He relates this specifically to COVID-19: âYou can do a good job managing public health using census data, but you need neighbourhood level data to see disparities in health outcomes. You cannot answer âWhy are these people dying and those people not?â until you make local-level maps.â
He cites one specific recent example: âOur COVID-19 work in New York showed many people were getting infected in grocery stores, so they changed the policy and brought orders outside rather than letting people in. âYou can also use data to identify local patterns, like more people suddenly going to hospital, to predict where outbreaks will happen. Israel uses that operationally to identify COVID-19 outbreaks as early as possible.â
Professor Pentland says greater use of live data is crucial: âWe have seen it deployed recently to identify appropriate social distancing and the use of smart thermometers.
âWe must start using real-time data globally to understand things as they happen and act in a timely way. We see governments doing more but old habits die hard; it will take time.
âItâs frustrating to look at the potential of what data can deliver and what it actually delivers, but this agenda involves re-training enormous numbers of people and massive investment. Itâs a huge endeavour; society and big systems donât change quickly.â
Professor Pentland is fascinated by the tension between personal privacy and using big data for public good; what he calls, in Social Physics, the âconsensus around trade-offs between personal and social valuesâ.
He relates this to the use of personal phone data to help track the movement of infected people and trace those they came into contact with, to slow the spread of COVID-19.
South Korea was lauded for using a government app to flatten its Covid-curve, but Professor Pentland warns that couldnât happen in most democratic societies. âTo be effective, you need a small number of cases and for everybody to use the app. âSouth Korea could do that, but in the UK or US, privacy concerns arise. The âBig Brotherâ aspect comes up, but you can ameliorate that.
âIf you are sick in hospital and asked who you had contact with, itâs hard to remember. But you might be happy for the medics to check your phone because you are protecting your friends. Where itâs a problem is big national databases.â
"To avoid data being concentrated in the hands of powerful corporations and governments, we need data unions (or data co-operatives) to empower communities to understand data and control its use"
So how far have we come in moving to a consensus around social good and personal privacy since Social Physics was written?
âThere was relatively little conversation in 2015, but after the Cambridge Analytica scandal in 2018, we heard lots about evil surveillance. That conversation forgot about the good side of data â spotting crop failures, highlighting corruption, caring for under-privileged communities.
âI think the pandemic has brought the good side of data to the fore in terms of helping tackle huge health problems. I think a consensus is developing that data, particularly neighbourhood data, is a public good. You have to aggregate and anonymise the data to protect privacy, but this neighborhood data can let us start to see if governments are doing the right thing⊠or not.â
This is all part of the New Deal on Data, which Professor Pentland first proposed to the World Economic Forum in 2007: âSignificant figures around the world were involved back then and bought into the idea, which led to what became GDPR in Europe. Those ideas also led to big data being recognised as a way to measure UN Sustainable Development Goals.
âThatâs real change. These are important steps in using big data to forge a much more transparent world.â
However, there is a wider question about who data is for, Professor Pentland believes:
âWhen large organisations take control, individual citizens lose out. Large-scale industrialisation led to workers forming labour unions, while the power of the banks led to credit unions. To avoid data being concentrated in the hands of powerful corporations and governments, we need data unions (or data co-operatives) to empower communities to understand data and control its use.
âItâs about moving away from centralisation of data to distributed control. We have the community-level data now to allow that to happen.â
This article was originally published here for the The Data-Driven Innovation initiative. #DDIdiscussions.
The Data-Driven Innovation initiative is part of the Edinburgh and South East Scotland City Region Deal, and aims to help organisations and all our citizens benefit from the data revolution.
Working together to deliver the 10-year programme are the University of Edinburgh and Heriot-Watt University, whose researchers will collaborate with industry on data partnerships in the public, private and third sectors.