education enhances comprehension of survey questions, ability to recall and record life events, and familiarity
with written documentation (Bollen et al., 2001; Zulu & Dodoo, 1998). Higher education may also be
associated with better interaction with enumerators and increased access to official health records, which
further reduces reporting errors.
Economic status was another robust determinant of reporting consistency. Women from the richest households
reported more accurately, with the lowest proportions of large mismatches, while women from poorer
households exhibited significantly greater discrepancies. This association mirrors findings from earlier NFHS
assessments and other large-scale surveys in low- and middle-income countries (LMICs), where poverty often
correlates with lower literacy, weaker access to health infrastructure, and greater barriers in communicating
with survey fieldworkers (Curtis & Blanc, 1997; Pullum, 2006).
The rural–urban divide also played a notable role. Urban women demonstrated higher reporting accuracy than
rural women, with the gap exceeding 5 percentage points in NFHS-V. This pattern may be linked to higher
literacy, stronger health system penetration, and wider availability of medical records in urban areas, as
reported in other evaluations of survey data quality in India (Chandrasekhar et al., 2017; Guilmoto, 2012). In
rural and remote areas, logistical constraints, interviewer workload, and cultural barriers may increase the
likelihood of incomplete or inconsistent reporting.
By contrast, religion and caste displayed weaker associations with reporting quality. While some variations
were observed, for example, slightly higher mismatches among Muslims and Scheduled Tribes, these effects
were relatively modest. The attenuation of religious effects between NFHS-IV and NFHS-V suggests that data
collection practices may have become more standardized across groups, reducing disparities. However, the
persistently higher mismatch rates among Scheduled Tribes point to challenges linked to geographical
isolation, language barriers, and enumeration difficulties, which align with broader discussions on survey
undercoverage of marginalized populations in India (Desai & Dubey, 2011; Borooah, 2005).
In addition to these sociodemographic determinants, important spatial patterns were evident. Mismatches were
more prevalent in central and southern India, while reporting accuracy was comparatively higher in the
Northeast. Several explanations may account for these regional differences. Central and southern states such as
Madhya Pradesh, Chhattisgarh, and Andhra Pradesh have larger rural populations, higher proportions of
Scheduled Castes and Tribes, and greater socioeconomic inequality, all factors linked to weaker reporting
accuracy. These regions also experienced historically higher levels of fertility and child mortality, which may
compound recall difficulties, particularly among older women. By contrast, the Northeast is characterized by
smaller populations, stronger community-based networks, and comparatively higher literacy rates, particularly
among women (Dutta, 2020). Tighter kinship structures and smaller family sizes in many northeastern states
may also facilitate more accurate recall of fertility histories. Moreover, survey implementation in smaller states
may allow fieldworkers to provide closer supervision and adapt more effectively to local contexts, thereby
reducing enumeration errors. These findings align with prior studies showing that regional heterogeneity in
survey quality often reflects differences in administrative capacity, demographic histories, and social
organization (Casterline & el-Zeini, 2014).
Overall, these results reaffirm that survey data quality is not only a technical issue but also a reflection of
broader social inequalities and regional disparities. The groups most vulnerable to inconsistent reporting, older,
less educated, rural, and economically disadvantaged women in central and southern states, are also those
often-facing structural disadvantages in health and social outcomes. This has important implications for both
research and policy. Inaccuracies in reporting fertility histories can bias estimates of demographic indicators
such as fertility, mortality, and population projections, which in turn inform program design and resource
allocation (United Nations Population Fund [UNFPA], 2019; United Nations, 2017).
To address these challenges, targeted survey strategies are essential. Enhanced interviewer training, culturally
adapted tools, and simplified questionnaires have been shown to improve data quality in complex survey
contexts (Mensch et al., 2014; Groves et al., 2009). For older and less educated respondents, incorporating
visual aids, calendar methods, or community-based verification may help reduce recall error. In addition,