PT - JOURNAL ARTICLE AU - Hoggart, Clive AU - Brennan, Paul AU - Tjonneland, Anne AU - Vogel, Ulla AU - Overvad, Kim AU - Østergaard, Jane Nautrup AU - Kaaks, Rudolf AU - Canzian, Federico AU - Boeing, Heiner AU - Steffen, Annika AU - Trichopoulou, Antonia AU - Bamia, Christina AU - Trichopoulos, Dimitrios AU - Johansson, Mattias AU - Palli, Domenico AU - Krogh, Vittorio AU - Tumino, Rosario AU - Sacerdote, Carlotta AU - Panico, Salvatore AU - Boshuizen, Hendriek AU - Bueno-de-Mesquita, H. Bas AU - Peeters, Petra H.M. AU - Lund, Eiliv AU - Gram, Inger Torhild AU - Braaten, Tonje AU - Rodríguez, Laudina AU - Agudo, Antonio AU - Sánchez-Cantalejo, Emilio AU - Arriola, Larraitz AU - Chirlaque, Maria-Dolores AU - Barricarte, Aurelio AU - Rasmuson, Torgny AU - Khaw, Kay-Tee AU - Wareham, Nicholas AU - Allen, Naomi E. AU - Riboli, Elio AU - Vineis, Paolo TI - A Risk Model for Lung Cancer Incidence AID - 10.1158/1940-6207.CAPR-11-0237 DP - 2012 Jun 01 TA - Cancer Prevention Research PG - 834--846 VI - 5 IP - 6 4099 - http://cancerpreventionresearch.aacrjournals.org/content/5/6/834.short 4100 - http://cancerpreventionresearch.aacrjournals.org/content/5/6/834.full SO - Cancer Prev Res (Phila)2012 Jun 01; 5 AB - Risk models for lung cancer incidence would be useful for prioritizing individuals for screening and participation in clinical trials of chemoprevention. We present a risk model for lung cancer built using prospective cohort data from a general population which predicts individual incidence in a given time period. We build separate risk models for current and former smokers using 169,035 ever smokers from the multicenter European Prospective Investigation into Cancer and Nutrition (EPIC) and considered a model for never smokers. The data set was split into independent training and test sets. Lung cancer incidence was modeled using survival analysis, stratifying by age started smoking, and for former smokers, also smoking duration. Other risk factors considered were smoking intensity, 10 occupational/environmental exposures previously implicated with lung cancer, and single-nucleotide polymorphisms at two loci identified by genome-wide association studies of lung cancer. Individual risk in the test set was measured by the predicted probability of lung cancer incidence in the year preceding last follow-up time, predictive accuracy was measured by the area under the receiver operator characteristic curve (AUC). Using smoking information alone gave good predictive accuracy: the AUC and 95% confidence interval in ever smokers was 0.843 (0.810–0.875), the Bach model applied to the same data gave an AUC of 0.775 (0.737–0.813). Other risk factors had negligible effect on the AUC, including never smokers for whom prediction was poor. Our model is generalizable and straightforward to implement. Its accuracy can be attributed to its modeling of lifetime exposure to smoking. Cancer Prev Res; 5(6); 834–46. ©2012 AACR.