Scientific Reports

Machine learning in the prediction of human wellbeing

Scientific Reports

Authors: Prof. Jan-Emmanuel De Neve Prof. Andrew Clark Dr. Caspar Kaiser Dr. Ekaterina (Katya) Oparina


Ekaterina Oparina, Caspar Kaiser, Niccolò Gentile, Alexandre Tkatchenko, Andrew E. Clark, Jan-Emmanuel De Neve and Conchita D’Ambrosio

Abstract

Subjective wellbeing data are increasingly used across the social sciences. Yet, despite the widespread use of such data, the predictive power of approaches commonly used to model wellbeing is only limited. In response, we here use tree-based Machine Learning (ML) algorithms to provide a better understanding of respondents’ self-reported wellbeing. We analyse representative samples of more than one million respondents from Germany, the UK, and the United States, using data from 2010 to 2018. We make three contributions. First, we show that ML algorithms can indeed yield better predictive performance than standard approaches, and establish an upper bound on the predictability of wellbeing scores with survey data. Second, we use ML to identify the key drivers of evaluative wellbeing. We show that the variables emphasised in the earlier intuition- and theory-based literature also appear in ML analyses. Third, we illustrate how ML can be used to make a judgement about functional forms, including the existence of satiation points in the effects of income and the U-shaped relationship between age and wellbeing.