The CBR Leximetric Datasets are the product of work carried out at the Centre for Business Research (CBR) in Cambridge, beginning in 2005 when the Centre received funding from the Economic and Social Research Council to carry out a research project on law, development and finance. Further funding from the ESRC, the European Union’s FP5 and FP6 programmes, the Isaac Newton Trust, the Cambridge Political Economy Society and the International Labour Organization made it possible to expand the original datasets. Current work on the datasets is funded by the ESRC via its Digit Research Centre (Digital Futures at Work) and by the NORFACE consortium through its support for the POPBACK project (Populist Backlash, Democratic Backsliding, and the Future of the Rule of Law in Europe.
As of July 2016, there are 3 principal datasets, coding, respectively, for labour laws in 117 countries between 1970 and 2013 (the CBR Labour Regulation Index), shareholder protection in 30 countries between 1990 and 2013 (the CBR Extended Shareholder Protection Index), and creditor protection in 30 countries between 1990 and 2013 (the CBR Extended Creditor Protection Index). The coding of legal data is carried out using a so-called leximetric coding methodology developed in the CBR and more fully explained in the codebooks which accompany each of the datasets.
Taken together, the datasets provide a unique time series which enables researchers and other research users to track changes in labour, company and insolvency law over long periods of time for many countries. A distinguishing feature of these datasets is that all legal sources for the data coding are fully described in the relevant codebooks, thereby assisting transparency, external validity and replicability of results. The work of further developing the datasets on shareholder and creditor rights, so that they match the labour regulation index in terms of years and countries covered, is ongoing. A new version of the Labour Regulation Index, updated to the end of 2022, will be published in the course of 2023.
Each dataset takes the form of an Excel spreadsheet containing the data and a Codebook containing the sources of the coding and an explanation of the coding methodology.
Every effort has been made to ensure that the laws coded in the datasets are accurately sourced and coded, but the scale of the data collected as part of this exercise means that errors or omissions are possible. We are pleased to receive clarifications and corrections and will periodically report on any necessary updates to the datasets to reflect information received.
If you wish to provide a clarification, and/or have an inquiry relating to any of the datasets, please contact Simon Deakin.