Search
Advanced Search
Share this Article info
  • Facebook StumbleUpon Connotea CiteULike Bibliography

Open Access

Correspondence

Error and Bias in Geocoding School and Students’ Home Addresses

Eric A. Whitsel

Department of Epidemiology, School of Public Health, University of North Carolina, Chapel Hill, Chapel Hill, North Carolina, E-mail:ewhitsel@email.unc.edu

Zandbergen and Green (2007) recently described the effect of positional error on the distance between geocoded addresses and major roads, an often-used proxy for traffic-related exposures. They found a 200–500 m range of mean positional errors in their study of 126 Orange County, Florida, public school addresses, a somewhat higher range than that associated with geocodes assigned by four commercial vendors to a larger variety and number of street addresses in the 48 contiguous U.S. states (Whitsel et al. 2006). In both studies, however, the ranges exceeded commonly used thresholds for identifying those at greatest potential risk of traffic-related exposures, raising due cause for concern.

Zandbergen (2007) found that the use of such low thresholds to define traffic-related exposure surrogates leads to the consistent overestimation of the number of Orange County school children at risk. In this recent study (Zandbergen and Green 2007), the finding has been extended to the schools the children attend. To explain the overestimates, Zandbergen and Green illustrated the idiosyncratic positioning of schools and homes—both within land parcels and along street segments—and the uniformly higher percentage of false positive versus negative determinations of whether the geocoded locations were inside or outside the 50–1,000-m buffer radii examined in their studies.

The collective findings of Zandbergen and Green (2007) nonetheless differ from those based on a previously described 5% random sample of 2,608 street addresses from the Environmental Epidemiology of Arrhythmogenesis in WHI (EEAWHI) (Whitsel et al. 2006). In that study, we found that the fraction of participants’ addresses determined to be < 100 m from the nearest highway was relatively constant across mean positional errors of 150–600 m, a finding driven by the counterbalance of approximately equal false positive and negative rates over the same range. The sensitivity and specificity of the 100-m threshold tested in EEAWHI—one-fifth the minimum distance to schools deemed acceptable by Zandbergen and Green—were also around 90% at positional errors of 250–300 m. Moreover, even when the sensitivity and specificity of the 100-m threshold exceeded 90%, its strength of association with coronary heart disease was still underestimated, albeit in the absence of confounding and under the assumption of nondifferential misclassification.

It is tempting to generalize about the magnitude of error and direction of bias observed by Zanbergen and Green (2007)—to students’ school and home addresses outside Orange County, or more generally to epidemiologic measures of environmental exposure–health outcome association—but the most prudent course of action may be to wait until the external validity of their potentially important findings is established.

References Top

  1. Whitsel EA, Quibrera PM, Smith RL, Catellier DJ, Liao D, Henley AC, et al. 2006. Accuracy of commercial geocoding: assessment and implications Epidemiol Perspect Innov 3(8).10.1186/1742-5573-3-8 [Online 20 July 2006] Find this article online
  2. Zandbergen PA. 2007. Influence of geocoding quality on environmental exposure assessment of children living near high traffic roads BMC Public Health 7(37).10.1186/1471-2458-7-37 [Online 16 March 2007] Find this article online
  3. Zandbergen PA, Green JW. 2007. Error and bias in determining exposure potential of children at school locations using proximity-based GIS techniques Environ Health Perspect 115:1363–1370.17805429 Find this article online
Post Your Note (For Public Viewing)
Compose Your Note
 
Declare any competing interests.
Add a note to this text.
Please follow our guidelines for notes and comments and review our competing interests policy. Comments that do not conform to our guidelines will be promptly removed and the user account disabled. The following must be avoided:
  • Remarks that could be interpreted as allegations of misconduct
  • Unsupported assertions or statements
  • Inflammatory or insulting language
Add a note to this text.
You must be logged in to add a note to an article. You may log in by clicking here or cancel this note.
Add a note to this text.
You cannot annotate this area of the document. Close
Add a note to this text.
You cannot create an annotation that spans different sections of the document; please adjust your selection.
Close
Rate This Article
Please follow our guidelines for rating and review our competing interests policy. Comments that do not conform to our guidelines will be promptly removed and the user account disabled. The following must be avoided:
  1. Remarks that could be interpreted as allegations of misconduct
  2. Unsupported assertions or statements
  3. Inflammatory or insulting language
Compose Your Annotation
 
Declare any competing interests.