RESEARCH PAPER
Toward improved statistical methods for analyzing Cotinine-Biomarker health association data
Tulay Koru-Sengul 1, 2  
,  
John D. Clark 1, 3
,  
Lora E. Fleming 1, 4
,  
 
 
More details
Hide details
1
Department of Epidemiology and Public Health at Leonard Miller School of Medicine, University of Miami, Miami, Florida, USA
2
Sylvester Comprehensive Cancer Center at Leonard Miller School of Medicine, University of Miami, Miami, Florida, USA
3
Department of Internal Medicine, Kaiser Permanente, Los Angeles, California, USA
4
European Centre of Environment and Human Health (ECEHH), Peninsula College of Medicine and Dentistry, Truro, Cornwall, UK
CORRESPONDING AUTHOR
Tulay Koru-Sengul   

Department of Epidemiology and Public Health at Leonard Miller School of Medicine, University of Miami, Miami, Florida, USA
Publish date: 2011-10-03
 
Tobacco Induced Diseases 2011;9(October):11
KEYWORDS
ABSTRACT
Background:
Serum cotinine, a metabolite of nicotine, is frequently used in research as a biomarker of recent tobacco smoke exposure. Historically, secondhand smoke (SHS) research uses suboptimal statistical methods due to censored serum cotinine values, meaning a measurement below the limit of detection (LOD).

Methods:
We compared commonly used methods for analyzing censored serum cotinine data using parametric and non-parametric techniques employing data from the 1999-2004 National Health and Nutrition Examination Surveys (NHANES). To illustrate the differences in associations obtained by various analytic methods, we compared parameter estimates for the association between cotinine and the inflammatory marker homocysteine using complete case analysis, single and multiple imputation, “reverse” Kaplan-Meier, and logistic regression models.

Results:
Parameter estimates and statistical significance varied according to the statistical method used with censored serum cotinine values. Single imputation of censored values with either 0, LOD or LOD/√2 yielded similar estimates and significance; multiple imputation method yielded smaller estimates than the other methods and without statistical significance. Multiple regression modelling using the “reverse” Kaplan-Meier method yielded statistically significant estimates that were larger than those from parametric methods.

Conclusions:
Analyses of serum cotinine data with values below the LOD require special attention. “Reverse” Kaplan-Meier was the only method inherently able to deal with censored data with multiple LODs, and may be the most accurate since it avoids data manipulation needed for use with other commonly used statistical methods. Additional research is needed into the identification of optimal statistical methods for analysis of SHS biomarkers subject to a LOD.

 
REFERENCES (19)
1.
Schisterman EF, Little RJ: Opening the black box of biomarker measurement error. Epidemiology. 2010, 21 (Suppl 4): S1-S3.
 
2.
Helsel D: Nondetects and Data Analysis: Statistics for Censored Environmental Data. 2005, John Wiley & Sons.
 
3.
Helsel DR: Fabricating data: How substituting values for nondetects can ruin results, and what can be done about it. Chemosphere. 2006, 65: 2434-2439.
 
4.
Krishnamoorthy K, Mallick A, Mathew T: Model based imputation approach for data analysis in the presence of nondetectable values: Normal and Related Distributions. Annals of Occupational Hygiene. 2009, 59: 249-261.
 
5.
Armstrong BG: Effect of measurement error on epidemiological studies of environmental and occupational exposures. Occupational and Environmental Medicine. 1998, 55: 651-656.
 
6.
Rothman KJ: Methodologic frontiers in environmental epidemiology. Environmental Health Perspectives. 1993, 101 (Suppl 4): 19-21. 10.1289/ehp.93101s419.
 
7.
Clark JD, Wilkinson JD, LeBlanc WG, Dietz NA, Arheart KL, Fleming LE, Lee DJ: Inflammatory markers and secondhand tobacco smoke exposure among U.S. workers. American Journal of Industrial Medicine. 2008, 51: 626-632.
 
8.
Benowitz NL, Bernert JT, Caraballo RS, Holiday DB, Wang J: Optimal serum cotinine levels for distinguishing cigarette smokers and nonsmokers within different racial/ethnic groups in the United States between 1999 and 2004. American Journal of Epidemiology. 2009, 169: 236-248.
 
9.
NHANES, Laboratory Procedures Manual. 2001, [http://www.cdc.gov/nchs/data/n...].
 
10.
Little RJ, Rubin DB: Statistical Analysis with Missing Data. Wiley Series in Probability and Statistics. 2002, Second.
 
11.
Donders AR, van der Heijden GJ, Stijnen T, Moons KG: Review: a gentle introduction to imputation of missing values. Journal of Clinical Epidemiology. 2006, 59: 1087-1091.
 
12.
Barnard J, Meng XL: Applications of multiple imputation in medical studies: from AIDS to NHANES. Statistical Methods in Medical Research. 1999, 8: 17-36.
 
13.
Harel O, Zhou XH: Multiple imputation: review of theory, implementation and software. Statistics in Medicine. 2007, 26: 3057-3077.
 
14.
Rubin D: Multiple Imputation For Nonresponse In Surveys. 1987, John Wiley & Sons Inc.
 
15.
Schafer JL: Analysis of Incomplete Multivariate Data. CRC Monographs on Statistics & Applied Probability. 1997.
 
16.
Schafer J: Analyzing the NHANES III Multiply Imputed Data Set: Methods and examples. 2001, [http://www.nber.org/nhanes/nha...].
 
17.
Gillespie BW, Chen Q, Reichert H, Franzblau A, Hedgeman E, Lepkowski J, Adriaens P, Demond A, Luksemburg W, Garabrant DH: Estimating Population Distributions When Some Data Are Below a Limit of Detection by Using a Reverse Kaplan-Meier Estimator. Epidemiology. 2010, 21: S64-S70.
 
18.
Albert PS, Harel O, Perkins N, Browne R: Use of Multiple Assays Subject to Detection Limits With Regression Modeling in Assessing the Relationship Between Exposure and Outcome. Epidemiology. 2010, 21: S35-S43.
 
19.
The Third National Health and Nutrition Examination Survey, (NHANES III, 1988-1994). Multiply Imputed Data Set (Series 11, No. 7A). Accessed 7/7/11, [ftp://ftp.cdc.gov/pub/Health_Statist...].
 
 
CITATIONS (4):
1.
Phthalate and bisphenol A exposure among pregnant women in Canada — Results from the MIREC study
Tye E. Arbuckle, Karelyn Davis, Leonora Marro, Mandy Fisher, Melissa Legrand, Alain LeBlanc, Eric Gaudreau, Warren G. Foster, Voleak Choeurng, William D. Fraser
Environment International
 
2.
Spatial cytokine distribution following traumatic injury
Holly N. Currie, Matthew S. Loos, Julie A. Vrana, Kristen Dragan, Jonathan W. Boyd
Cytokine
 
3.
Self-reported smoking, serum cotinine, and blood DNA methylation
Yan Zhang, Ines Florath, Kai-Uwe Saum, Hermann Brenner
Environmental Research
 
4.
Serum cotinine as a biomarker of tobacco exposure and the association with treatment response in early rheumatoid arthritis
Leann B. Maska, Harlan R. Sayles, James R. O'Dell, Jeffrey R. Curtis, S. Louis Bridges, Larry W. Moreland, Stacey S. Cofield, Ted R. Mikuls
Arthritis Care & Research
 
eISSN:1617-9625