The continuous sample of working livesimproving its representativeness

  1. Juan Manuel Pérez-Salamero González 1
  2. Marta Regúlez-Castillo 2
  3. Carlos Vidal-Meliá 1
  1. 1 Universitat de València

    Universitat de València

    Valencia, España


  2. 2 Universidad del País Vasco/Euskal Herriko Unibertsitatea

    Universidad del País Vasco/Euskal Herriko Unibertsitatea

    Lejona, España


SERIEs : Journal of the Spanish Economic Association

ISSN: 1869-4195

Year of publication: 2017

Volume: 8

Issue: 1

Pages: 43-95

Type: Article

DOI: 10.1007/S13209-017-0154-0 DIALNET GOOGLE SCHOLAR lock_openOpen access editor

More publications in: SERIEs : Journal of the Spanish Economic Association

Sustainable development goals


This paper studies the representativeness of the Continuous Sample of Working Lives (CSWL), a set of anonymized microdata containing information on individuals from Spanish Social Security records. We examine several CSWL waves (2005–2013) and show that it is not representative for the population with a pension income. We then develop a methodology to draw a large dataset from the CSWL that is much more representative of the retired population in terms of pension type, gender and age. This procedure also makes it possible for users to choose between goodness of fit and subsample size. In order to illustrate the practical significance of our methodology, the paper also contains an application in which we generate a large subsample distribution from the 2010 CSWL. The results are striking: with a very small reduction in the size of the original CSWL, we significantly reduce errors in estimating pension expenditure for 2010, with a p value greater or equal to 0.999.

Funding information