This report assesses the quality and consistency of age and date reports in DHS surveys conducted since 2000 in 67 countries. It is the most recent of several reports on various aspects of DHS data quality. The first chapter describes the steps of editing and imputing during fieldwork and data processing. Great
care is taken to train and supervise the interviewers to obtain the best possible estimates of ages and dates. The eligibility of adults for the surveys of women and men depends on obtaining accurate values of age
near the lower and upper age boundaries within the household survey. The eligibility of young children for the detailed health questions depends on obtaining accurate estimates of when they were born within the
surveys of women. Age-specific fertility rates, under-five mortality rates, immunization rates, anthropometry scores, and many other DHS indicators depend on accurate estimates of age.
An appendix provides an inventory of all the locations in DHS surveys where the respondents are asked for ages and dates. This assessment focuses on just a few of those locations: the ages of all household members, provided by the household respondent during the household survey; the self-reported ages and birthdates of women and men in the surveys of women and men; women’s self-reports of age and date of first union
in the survey of women; the birthdates (and ages, if living) of children in the birth histories, provided by the mother; and the women’s and men’s estimates of their respective spouses’ ages in the surveys of women
The second chapter assesses the ages listed above, other than spousal estimates, in terms of three types of measures: incompleteness, heaping, and transfers. A total of 11 indicators are used. For each indicator, the
distribution across all surveys is described and the surveys with the most extreme levels are identified. All of these measures vary substantially. There are many surveys with values close to zero on all measures, and
others with very high values. There are some surveys in which month of birth is hardly ever given. Age/date transfers are sometimes large but in a direction opposite to what we would expect, particularly around age
15 or around the date for the health questions, clearly as a result of over-correction during training and supervision. Surveys with extreme values are listed. Summary indices of incompleteness, heaping, and
transfers are constructed and tracked over time. The indicators fluctuated substantially from 2000 to 2015 and did not show a systematic trend. A single composite index is constructed for each of the 67 countries.
The countries in the highest quintile (with the most problems) and the lowest quintile (with the fewest problems) are identified.
The third chapter investigates the quality and consistency of spousal age estimates compared to self-reports in the surveys that included interviews of men and where women and/or men were asked to estimate the age of their spouse(s). In the absence of an age gap between spouses, women tend to estimate that their husbands are older than their self-reported age and men tend to estimate that their wives are younger than their self-reported age. Evidence indicates that where there is an age difference between spouses, women tend to estimate in a way that reduces the gap: they underestimate the age of older husbands and overestimate the age of younger husbands. Men underestimate the age of wives who are older than they are, which reduces the gap, but they also tend to underestimate the age of wives who are younger, which increases the gap. In the vast majority of countries, there was more heaping for estimates of spouse’s age
than for self-reported age. There is evidence that displacement and heaping, in particular, can be reduced, through training and
supervision, but there is also evidence that too much focus on displacement of children or on heaping at final digit 0 can lead to over-correction. The biggest determinant of good age reporting is probably the
value, to the respondents, in everyday life, of knowing their ages or the ages of their children. This component of data quality varies from one setting to another and is outside the control of a survey operation.