The estimates of vaccination coverage are measured from administrative data and from population based survey. While both card-based and recall data are collected through population survey, and the recall is when the card is missing, the preferred estimates remain of the card-based due to limited validity of parental recalls. As there is a concern of missing cards in poor settings, the evidence on validity of parental recalls is limited and varied across vaccine types, and therefore timely and needed. We validated the recalls against card-based data based on population survey in Tanzania.
We used a cross-sectional survey of about 3000 households with women who delivered in the last 12 months prior to the interview in 2012 from three regions in Tanzania. Data on the vaccination status on four vaccine types were collected using two data sources, card and recall-based. We compared the level of agreement and identified the recall bias between the two data sources. We further computed the sensitivity and specificity of parental recalls, and used a multivariate logit model to identify the determinants of parental recall bias.
Most parents (85.4%) were able to present the vaccination cards during the survey, and these were used for analysis. Although the coverage levels were generally similar across data sources, the recall-based data slightly overestimated the coverage estimates. The level of agreement between the two data sources was high above 94%, with minimal recall bias of less than 6%. The recall bias due to over-reporting were slightly higher than that due to under-reporting. The sensitivity of parental recalls was generally high for all vaccine types, while the specificity was generally low across vaccine types except for measles. The minimal recall bias for DPT and measles were associated with the mother’s age, education level, health insurance status, region location and child age.
Parental recalls when compared to card-based data are hugely accurate with minimal recall bias in Tanzania. Our findings support the use of parental recall collected through surveys to identify the child vaccination status in the absence of vaccination cards. The use of recall data alongside card-based estimates also ensures more representative coverage estimates.
In an effort to achieve the sustainable development goal three (SDG 3) and further reduce under-five mortality and morbidity , effective provision of lifesaving childhood immunisations is urgently needed [2,3,4]. Immunisation is one of the most important public health interventions, in terms of potential health impact and cost effectiveness, and has been universally recommended . It is therefore important to track and monitor progress, and evaluate immunisation programs such as the Expanded Programme on Immunisation (EPI) and the Global Alliance for Vaccines and Immunisations (GAVI) by estimating vaccination coverage through routine and/or any other reliable measurements. Vaccination coverage estimates are important for planning, priority setting, and implementing interventions to improve population health [4, 6,7,8].
There are two main methods of estimating vaccination coverage [4, 9]: provider administrative records as routinely collected information during service delivery, and population-based household surveys such as Multiple Indicators Cluster Surveys (MICS)  and Demographic and Health Surveys (DHS) . Each method has pros and cons . In low- and middle- income countries (LMICs) for example, coverage based on administrative data has the advantage of being routinely available [12, 13], but its reliability has been questioned due to bias (inaccuracy in population denominators, registration, or reporting) [2, 4, 9, 14] and missing data on community outreach and national vaccination day campaigns. Vaccination coverage based on population surveys are based on valid population denominators [8, 15], but they cover only a sampled population and surveys are not carried out routinely due to resource constraints . Population surveys have also been used as an independent check on coverage levels estimated through routine administrative data [12, 16].
Vaccination coverage is commonly assessed within household surveys with reference to vaccination cards and/or parental recall [4, 16]. Although data from vaccination cards are considered more reliable than recall, both approaches are prone to selection bias (use of an inaccurate sampling frame) and information bias (inaccurate data from respondents) [8, 17].
The absence of vaccination cards within households is a frequent issue in LMIC settings. For example, more than half of mothers surveyed in India  and Sudan  did not have vaccination cards; about one-third of mothers in Bangladesh had no cards ; and almost one-third of households on average did not have cards according to population surveys across 101 countries, mostly in LMICs . Given this, in LMICs the estimated vaccination coverage through population surveys may be incomplete where cards are missing, and data from parental recall provides an alternative in such cases.
Evidence on the validity of parental recall for measuring vaccination coverage in relation to vaccination cards or routine administrative data is limited in LMICs, especially in Sub-Saharan Africa. For example, when compared to administrative data, parental recall was generally found to underestimate vaccination coverage across vaccine types [4, 7, 22, 23]. On the other hand, when compared to data from vaccination cards, parental recall was found to be relatively consistent overall with slight variation across vaccine types and settings [19, 20, 24,25,26]. However, these studies are dated, with no studies being carried out in the last 10 years. In addition, among the available studies only two were in Sub-Saharan Africa, and their focus has been on measles and diphtheria-tetanus-pertussis (DPT) vaccine types, with less attention to other vaccines. To date there has been no study from East Africa, although missing vaccination cards are a concern in these countries. Since population surveys are commonly conducted to estimate vaccination coverage in LMICs, it is important to assess whether parental recall is a valid measure of coverage that could supplement card-based estimates where cards are missing to provide coverage estimates. Further, there has been very limited attention to the determinants or factors related with recall bias for vaccination coverage, with few exceptions such as [19, 20, 26].
In this study, we assessed the validity of vaccination coverage data based on parental recall, as compared to data from vaccination cards, and identified the individual and household-level determinants of recall bias in Tanzania. We used data from a cross-sectional household survey (for households with a child of less than 12 months of age) capturing data from vaccination cards and from parental recall based on four vaccine types: Bacillus Calmette–Guérin (BCG) vaccine, oral polio vaccine (OPV), diphtheria-tetanus-pertussis (DTP) vaccine, and measles vaccine. The BCG vaccine and the first doses of OPV and DPT are administered after birth, while the first dose of measles vaccine is administered after nine months. Our household survey data were selected in preference to national DHS data as the latter only ask for parental recall when the vaccination card or its data are missing.
This study was conducted in 11 districts from three regions (i.e. Pwani, Morogoro and Lindi) out of 30 regions in Tanzania. The population of Pwani region is just above a million, over two million in Morogoro region, and less than a million in Lindi region . The data were derived from a baseline survey done as part of an impact evaluation of a payment for performance (P4P) programme in Pwani region [28, 29]. The survey was done in all districts of Pwani region and in four districts from Morogoro and Lindi region (Morogoro rural, Morogoro urban, Mvomero, and Kilwa). The four districts from Morogoro and Lindi region were selected for comparison purpose such that they were geographically contiguous to districts in Pwani and similar in relation to indicators of: poverty, literacy rates, rate of institutional deliveries, infant mortality, population per health facility, and the number of children under 1 year of age per capita . Comparison districts were also free from programmes to improve maternal and child health, which could confound the evaluation of P4P.
Data sources and sampling
Data were collected from a cross-sectional household survey across all 11 districts in the three regions that were survey. Health facilities were the primary sampling unit. We included all 6 hospitals and 16 health centres that were eligible for inclusion in P4P, and randomly sampled 53 dispensaries from Pwani region. An equivalent number of facilities in comparison districts were sampled by facility level of care. In total, we included equal number of facilities between study arms; i.e. 75 facilities from Pwani region (intervention arm) and 75 facilities from Morogoro and Lindi (comparison arm). We also randomly sampled 20 households of women aged (15–49 years) who had delivered in 12 months prior to the survey from the catchment area of each health facility . In total, we aimed to surveyed 3000 households with eligible women from across the three regions. The sampling strategy of facilities and households is detailed elsewhere .
The household survey was carried out in January and February 2012. Trained field researchers administered a structured questionnaire to the household head and to an eligible woman with a child of less than 12 months of age. The survey questionnaire was adapted from the World Bank Impact Evaluation Toolkit . The household questionnaire captured information on household background characteristics (e.g. ownership of assets) to assess the household’s socioeconomic status. The women’s questionnaire captured data on background characteristics of women, and data on service utilisation for maternal and child health services, including childhood vaccination status.
Measures of vaccination status
Four vaccination types were considered: BCG vaccine, OPV, DTP vaccine, and measles vaccine. Vaccination status was obtained by reviewing the vaccination card where available and through recording the appropriate vaccine type and number of doses received. All parents were also asked to recall the vaccination status of their child for any dose across vaccine types. However, the validity of parental recall was assessed among parents with both information on recall and vaccination card. Parental recall was assessed mainly in terms of vaccination coverage rather than number of doses.
We first calculated vaccination coverage levels from vaccination cards and parental recalls across all vaccine types considered. To test whether parental recall over- or under-estimated vaccination coverage relative to cards, we compared the level of agreement in coverage levels derived from each source, for those observations that contained estimates from both sources: recall and vaccination card.
We compared parental recall to card-based data, assuming the latter to be accurate. Recall bias was defined as the discrepancy in vaccination status between the two data sources (i.e. false positives and false negatives). We then disaggregated the recall bias into whether parental recall over-reported (recalled as vaccinated while not) or under-reported (recalled as not–vaccinated while vaccinated). We further computed the level of agreement between the estimates from the two data sources, and estimated the sensitivity and specificity of parental recall based on the two-by-two table (Table 1):
The level of agreement/concordance is the percentage of children whose parents accurately recalled the vaccination status of their children ([TP + TN]/Total). Sensitivity of recall is the percentage of children whose parents recalled they were vaccinated and they were vaccinated according to their vaccination cards (TP/[TP + FN]). Specificity of recall is the percentage of children whose parents recalled they were unvaccinated and they were not vaccinated according to their vaccination cards (TN/[FP + TN]).
We also calculated Kappa statistics for each type of vaccine as an alternative assessment of the degree of agreement between two data sources. The Kappa statistic is a measure of reliability that takes into account the agreement expected on the basis of chance [31, 32]. A Kappa statistic ≤0.20 shows slight to poor agreement, 0.21–0.40 shows a fair agreement, 0.41–0.60 moderate agreement, 0.61–0.80 substantial agreement, and 0.81–1.00 indicates almost perfect agreement . However, the Kappa statistic is commonly affected by the prevalence of an indicator and bias/ level of disagreement which leads to a trade-off paradox (high agreement but low Kappa) [33, 34]. In response, T Byrt, J Bishop and JB Carlin  proposed the use of Prevalence and Bias Adjusted Kappa (PABAK) to avoid the paradox. This study applied both approaches of estimating Kappa statistics.
To identify the determinants of parental recall bias, we applied a series of multivariate logit regression models by vaccine types. Our binary dependent variable for recall bias took the value of 1 if the two data sources disagreed and 0 otherwise for a given vaccine. A set of individual and household characteristics were included in the model as potential determinants of recall bias. These included child age (in months), maternal age (in years), marital status, health insurance status, religion, maternal education, occupation, number of births/parity, place of residence (rural/urban district), and household wealth status in quintiles. The wealth quintiles were generated from wealth scores derived by the principal component analysis based on 42 items of household characteristics and asset ownership [36, 37]. All analyses were performed using STATA version 13.
Although we sampled 3000 households, only 2882 were interviewed (i.e. 96% response rate). Out of 2882 households interviewed in 2012, the majority of them 2816 (97.7%) reported having received a child vaccination card from the health facility, and 2460 (85.4%) were able to present the vaccination card during the survey. The average age of children with vaccination cards was 8 months, and that of their mothers was 26 years (Table 2). Most mothers were married, uninsured, Muslim, with primary or above education, and located in rural districts.
Overall vaccination coverage levels from the two data sources were generally similar (Fig. 1). However, the recall data resulted in slightly higher coverage estimates relative to vaccination cards. Most children (> 97%) received BCG, at least one dose of OPV and DPT, and about two-thirds of children aged 9 months or more received measles vaccine. However, when all children under a year of age were considered, the coverage for measles vaccine was about 40% (95% CI: 38.1–41.9%) from recall and 39% (95% CI: 37.4–41.3%) from vaccination cards.
The level of agreement between parental recall compared to cards was above 94% across vaccine types (Table 3), with minimal parental recall bias. The highest level of recall bias was for measles, that is about 5.8% (Table 3). Recall bias due to over-reporting was slightly higher than that due to under-reporting. The sensitivity of parental recall was generally high for all vaccine types, while the specificity was generally low across vaccine types except for measles (Table 3). The unadjusted Kappa statistics revealed a trade-off paradox, because the Kappa values were low for vaccines with higher level of agreement and vice versa. In contrast, the adjusted Kappa values showed perfect agreement between the two data sources with values above 0.87 for all vaccine types . Adjusted Kappa values were also consistent with the prior level of agreement and recall bias, that is relatively lower for measles vaccine than other vaccine types (Table 3).
Parental recall bias for BCG and OPV was invariant across individual and household characteristics (Table 4). In contrast, recall bias for DPT was greater for parents with younger children, for older mothers, and for those with insurance. Those with low levels of education and from Pwani region had higher chance of recall bias for measles (Table 4). Specifically, recall bias for DPT reduced by 12% for each additional month of child age, and increased by 6% with each additional year of maternal age. Mothers who were insured were almost two times more likely to inaccurately recall DPT vaccination status than uninsured mothers. Furthermore, mothers with no education were almost three times more likely than educated mothers to inaccurately recall measles vaccination status. Recall errors for measles vaccine were twice as likely among mothers residing in Pwani region than those in other regions. The results in Table 4 remained unchanged when the regression analyses were adjusted for clustering at the facility level (data not shown).
This study assessed the validity of parental recall against vaccination cards for estimating vaccination coverage using a population survey in Tanzania. We used card-based data as the reference as these data are commonly preferred to recall data in population surveys such as the DHS . To this end, we compared the two data sources and quantified the level of agreement, recall bias, and identified the determinants of recall bias. Both sources of data estimated higher coverage levels for BCG, OPV and DPT, compared to measles vaccine, reflecting the age of our sample (average 8 months), with measles vaccine typically being administered from 9 months. Our study found that the discrepancy in coverage levels between the two data sources was minimal with limited recall bias (< 6%). Recall bias was greatest for measles vaccine. The sensitivity of parental recall was generally high whereas its specificity was low except for measles. Factors such as maternal age, education, location and age of the child were significantly associated with recall bias for DPT; while the level of education and region of residence significantly associated with recall bias for measles.
Our finding that vaccination coverage was slightly overestimated with a recall-based compared to a card-based approach is similar to findings from other studies [19, 24]. The overestimation through recall may possibly be explained by social desirability bias, especially when child vaccination practice is considered socially desirable . The high level of agreement between recall and card-based data is similar to that reported in Egypt , Sudan , Bangladesh  and India . In the above studies, the accuracy of recall was more than 80% for measles and ranged from 61 to 98% for BCG and DPT. In Costa Rica, the recall and card-based data correlated at 71% .
The high sensitivity and specificity of recall for measles vaccine could be due to the level of detail provided in the respective question in the survey. The survey question for measles vaccine was different from others, as respondents were told specifically that the measles vaccine is given at the age of nine months and above. Indeed, data collection techniques are known to affect levels of recall bias . There should be a need to explicitly specify age limits in all survey questions that seeks parental recall for vaccine types. Previous studies in LMICs have reported both high sensitivity and specificity of parental recall across vaccines of at least 80 and 67% respectively [19, 20]. The high sensitivity and specificity suggests that the recall can largely identify correctly the vaccination status of being vaccinated and of being unvaccinated respectively .
We also found that parents with younger children were more likely to demonstrate recall bias for DPT than those with older children, similar to that reported in Cost Rica . Our finding that younger mothers were less likely to be subject to recall bias is consistent with findings from Bangladesh . The findings based on age partly suggests the following: first, the survey which seeks parental recall on child vaccination should follow women with slightly older children to improve the accuracy in recall; and the sample of older women with experience in child bearing should be reduced to minimise the risk of recall bias. The finding of greater accuracy on recalling measles vaccine status among educated mothers is also consistent with findings from Bangladesh , but contrary to findings in Sudan . A study by SS Coughlin  revealed that factors like age, education and socioeconomic status of the respondents, affected the accuracy of recall.
Our findings have important implications for vaccination programmes in LMICs. In these settings, the quality of administrative data is typically poor, and the estimates from population household surveys are therefore being used for coverage estimation. As long as parental recall of vaccinations is accurate, it can be used to supplement card-based data in situations where cards are missing. Our findings support the use of recall data alongside card-based estimates, in populations where missing cards are widespread, to ensure more representative coverage estimates. Although parental recall could serve as an additional source of data, efforts are also needed to encourage high card-retention rate with complete and reliable information documented since we found almost 12.3% received the cards but were unavailable during the survey. To encourage high card-retention rate can be through educating mothers on the importance of keeping the vaccination cards or any other medical records. In the case of not being given any vaccination card, health providers could be encouraged to routinely record, document and handle medical cards to all attended clients. Moreover, more efforts are needed to strengthen the administrative data which are routinely collected and relatively less expensive and providing a much more complete time series compared to population based surveys which are only carried out every few years. This study also indicates that recall bias was generally random across the surveyed population, with few systematic associations between bias and parental education, age, health insurance status and child age.
This study has the following limitations. First, we assessed the validity of parental recall by assuming card-based data were the reference, but these data are sometimes incomplete or inaccurate. While studies in high-income countries have used administrative data as a benchmark to validate parental recalls , we were unable to use these data in our setting due to its limited completeness and reliability at the time of the study (i.e. inaccuracy in population denominators, registration, or reporting) [2, 4, 9, 14] compared to card-based data. Second, our assessment was based on a sample of children under one-year age as opposed to most vaccination studies which uses children aged 12–23 months. However, our main goal was to compare the accuracy on vaccination status between two data sources rather than assessing the coverage estimates, since the later might be underestimated when using younger children. Nevertheless, further validation among older children is important given that recall bias is potentially affected by a recall time interval . Third, we used household data from a random sample of 20 households per facility to assess the validity of parental recall. It is quite possible that the sample of households did not accurately represent the entire population of women with children eligible for vaccination in the catchment area. Fourth, we were unable to compare the appropriate number of doses for DPT and OPV between data sources because the recall data were unable to distinguish the number of doses. The recall and coverage of BCG and measles vaccines were relatively easier to obtain compared to OPV and DPT which typically include three doses. Lastly, we were unable to validate the recall of parents without cards (i.e. 12.6%) as we restricted our comparison to parents with data from both sources, card and recall. However, it is conceivable that those with cards have more accurate recall than those without. A further concern is if parents with cards are systematically different to those without cards and that these differences affect recall. To validate the recalls for parents without cards, it needs to be compared to administrative data for example, and this is an important area for future research in our setting. However, if the cards are missing at random, which was the case in our dataset, then it seems reasonable to extrapolate the findings to households without cards.
This study assessed the validity of parental recalls against card-based data in Tanzania, and found most of the children’s vaccination status across vaccine types were accurately identified through parental recall. The limited recall bias for DPT and measles were associated with the mother’s age, education level, health insurance status, region location and child age. Our findings support the use of parental recall collected through surveys to identify the child vaccination status in the absence of vaccination cards. However, further research is needed to validate these findings against routine administrative data.
Demographic and Health Surveys
Expanded Programme on Immunisation
Global Alliance for Vaccines and Immunisations
Low- and middle- income countries
Multiple Indicators Cluster Surveys
Oral Polio Vaccine
Prevalence and Bias Adjusted Kappa
Sustainable Development Goal three
United Nations. The Sustain Dev Goals Report 2016. New York: United Nations; 2016.
Lim SS, Stein DB, Charrow A, Murray CJ. Tracking progress towards universal childhood immunisation and the impact of global initiatives: a systematic analysis of three-dose diphtheria, tetanus, and pertussis immunisation coverage. Lancet. 2008;372(9655):2031–46.
Bos E, Batson A. Using immunization coverage rates for monitoring health sector performance. Washington, DC: The World Bank; 2000.
Burton A, Monasch R, Lautenbach B, Gacic-Dobo M, Neill M, Karimov R, Wolfson L, Jones G, Birmingham M. WHO and UNICEF estimates of national infant immunization coverage: methods and processes. Bull World Health Organ. 2009;87(7):535–41.
Haddad S, Bicaba A, Feletto M, Fournier P, Zunzunegui MV. Heterogeneity in the validity of administrative-based estimates of immunization coverage across health districts in Burkina Faso: implications for measurement, monitoring and planning. Health Policy Plan. 2010;25(5):393–405.
Eisele TP, Rhoda DA, Cutts FT, Keating J, Ren R, Barros AJ, Arnold F. Measuring coverage in MNCH: total survey error and the interpretation of intervention coverage estimates from household surveys. PLoS Med. 2013;10(5):e1001386.
Brown J, Monasch R, Bicego G, Burton A, Boerma JT. An assessment of the quality of national child immunization coverage estimates in population-based surveys, vol. 700. Chapel Hill (North Carolina): MEASURE Evaluation; 2002.
Binyaruka P, Patouillard E, Powell-Jackson T, Greco G, Maestad O, Borghi J. Effect of paying for performance on utilisation, quality, and user costs of health Services in Tanzania: a controlled before and after study. PLoS One. 2015;10(8):e0135013.
Borghi J, Mayumana I, Mashasi I, Binyaruka P, Patouillard E, Njau I, Maestad O, Abdulla S, Mamdani M. Protocol for the evaluation of a pay for performance programme in Pwani region in Tanzania: a controlled before and after study. Implement Sci. 2013;8:80.
We would like to thank the whole P4P evaluation research team, including data collectors and field coordinators. We also acknowledge the support from all field respondents, and various officials such as Regional Medical Officers, District Medical officers, Health facility in-charges, and P4P implementing agency.
The Government of Norway funded the data collection for the programme evaluation survey where this study was nested (grant numbers: TAN-3108 and TAN 13/0005 (http://www.norad.no/en/). The funding also includes the costs of data analysis and writing of the paper. The funding bodies had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
PB designed this sub-study and performed the data analysis. PB together with JB oversaw data collection. JB designed the impact evaluation study within which this sub-study is embedded. PB wrote the first draft of the manuscript. JB involved in data interpretation, and revision of the manuscript. Both authors read and approved the final manuscript.
This study was part of the large programme evaluation in Tanzania. Ethical approval for the evaluation study was obtained from the Institutional Review Board of the Ifakara Health Institute (approval number: 1BI1IRB/38) and the Ethics Review Board of the London School of Hygiene & Tropical Medicine. Study participants provided written consent to participate in this study, requiring them to sign a written consent form that was read out to them by the interviewers. This consent form was reviewed and approved by the ethics committees prior to the start of the research.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.