
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Articles |
Authors' Affiliations: Departments of 1 Bioinformatics and Computational Biology and 2 Thoracic/Head and Neck Medical Oncology, The University of Texas M. D. Anderson Cancer Center, Houston, Texas
Requests for reprints: Li Zhang, M. D. Anderson Cancer Center, Houston, TX 77030. E-mail: Lzhangli{at}mdanderson.org, or Li Mao, M. D. Anderson Cancer Center. E-mail: lmao{at}mdanderson.org.
| Abstract |
|---|
|
|
|---|
|
|
|---|
To understand why smokers and former smokers have differential responses to agents for preventing or treating lung cancer, we analyzed and compared global gene expression profiles in three independent cancer-free cohorts comprising current, former, and never smokers.
| Materials and Methods |
|---|
|
|
|---|
Bronchial brush processing and RNA extraction
For samples in MDACC-1 and MDACC-2, brushes were placed on bronchoscopy in 3 mL of plain DMEM culture (Life Technologies, Inc.) in sterile tissue culture tubes and stored at 4°C for processing the same day. The tubes were vortexed lightly to detach cells from the brushes. After removal of the brush from the tube, the cell suspension was centrifuged at 2,500 rpm for 5 min. Cell pellets were then washed with 2 mL of PBS twice, and an aliquot of material was saved at –80°C until RNA extraction. For the microarray analysis, cells from the six brushing sites of the same individual were pooled together for RNA extraction. We used TRIzol reagent (Invitrogen) for total RNA extraction according to the manufacturer's protocol, with a yield of 1 to 4 µg of total RNA per sample. Integrity of the RNA was confirmed by running it on an RNA 6000 Neno LabChip (Agilent Technologies). The samples in BMC data set were processed similarly as described (12), except that a single-round amplification protocol was used.
cRNA preparation and microarray hybridization
The first and second cDNA strands were synthesized as previously described (13). The first reverse transcription was done in the absence of biotin-labeled ribonucleotides, resulting in unlabeled cRNA, which was then used as starting material for the second cycle. In the second cycle, the first and second cDNA strands were synthesized. The second transcription was done in the presence of biotin-labeled-ribonucleotides, resulting in labeled cRNA. The cRNA was fragmented and checked by gel electrophoresis, as reported earlier (13). The Affymetrix GeneChip system was used for hybridization, staining, and imaging of the probe arrays. Hybridization cocktails of 300 µL, each containing 15 µg of cRNA and exogenous hybridization controls, were prepared as previously described and hybridized to U133A or U133A plus GeneChips (Affymetrix) overnight at 42°C. Hybridized fragments were detected with streptavidin linked to phycoerythrin (Molecular Probes). GeneChips were scanned and imaged using Affymetrix Microarray Analysis Suite version 5.0.
Microarray data normalization
There were two array types used in this study: U133A and U133 Plus 2. The U133A array contains
500,000 distinct probe features interrogating 18,400 human transcripts and variants, including 13,902 well-characterized genes. The U133 plus 2.0 array contains all probe features that are on the U133A array. In addition, there are 9,921 new probe sets representing 6,500 new genes. To facilitate straightforward comparison of the data, we used only the probes that are common to both array types. We also ignored data of MM probes. We used PM probes common to U133A and U133 Plus 2 arrays to perform quantile normalization on the probe level data (14). The procedure was done so that the distributions of the probe signal intensities of a sample are identical for all samples within a data set. Then, we used PDNN model (15) to quantify the gene expression values from the normalized probe signal intensity data. We then applied median-centering normalization on the probe set level data so that the median of expression values of a sample was made to be the same for all the samples in all of the data sets.
Identification of differentially expressed genes
Differential expression was identified to be similar to that described by Wang et al. (16). We used Z values (defined below) to assess differentially expressed genes between current and never or former smokers, in whom the magnitude of the Z values is assumed to represent the effect of smoking cessation. For a given data set containing nA and nB samples in groups A and B, respectively, we compute the following test statistic Z for each probe set:
![]() | (A) |
is the estimated standard deviation (SD) of D:
![]() | (B) |
A2 and
B2 are estimated variances of log expression values in groups A and B, respectively. These variances were estimated using Loess fit between the mean log expression values and the SD of the log expression values. The underlying assumption is that the mean and the SD are related by a smooth function, which allows the analysis method to treat the SD as if it were known.
Combining Z values from different data sets
The Z values obtained from the three data sets can be combined using the following formula:
![]() | (C) |
![]() | (D) |
1,
2, and
3 were calculated using Eq. B from MDACC-1, MDACC-2, and BMC data sets, respectively. The test statistic Z is supposed to form a T distribution if the log expression values are normally distributed. However, the observed data slightly deviate from the normal distribution because they contain more extreme values. Consequently, the significance of Z can be overestimated.
To alleviate the bias due to the assumption of normal distribution, we used permuted data to compute Z*. The expression values are randomly permuted for each probe set within each data set. The permutation was done 10 times to construct an empirical cumulative distribution function of Z*. This distribution was assumed to be the distribution of Z values under null hypothesis (i.e., no differential expression), and it was used to estimate the P values and the false discovery rate associated with Z values. The permutation was done within each of the data sets, but never across the data sets. Note that other than the permutation step, our method is the same as that described by Wang et al. (16).
| Results |
|---|
|
|
|---|
|
|
|
|
1,200 probe sets, which is our estimated number of genes that are differentially expressed between the former smokers and current smokers. Only a subset of these genes is identifiable, however. According to the BUM method (17), we found 345 probe sets that were differentially expressed at a P value of <0.01, for which the false discovery rate was estimated to be 32%. Of the 345 probe sets, 176 have a >2-fold difference in expression (details of these 176 probe sets are shown in Supplementary Table S2). These 176 probe sets represent 145 nonredundant significantly differentially expressed genes (>2-fold change; P < 0.01). These 145 genes include 9 genes (Table 2) with consistent and significant changes in each of the three data sets (P < 0.01; >2-fold change). Eight of the nine genes are down-regulated after smoking cessation; one is up-regulated. To test the general accuracy of our microarray measurements, we compared them with reverse transcription-PCR measurements of a selected panel of genes, finding that the reverse transcription-PCR and microarray measurements were highly correlated, 96% [e.g., in the case of ALDH3A1 (Supplementary Fig. S1), which is the gene with the largest change between former and current smokers (Table 2)]. Furthermore, although not calculated, the false discovery rate for the subset of 176 probe sets should be lower than that (32%) estimated for the 345 probe sets, and the false discovery rate for the nine changed genes that were validated across three data sets should be lower still because each subset adds new criteria that increase reliability.
|
11,000 probe sets. This number is >9 times greater than the number detected in the comparison between former and current smokers (Fig. 3). We found 591 nonredundant genes with statistically significant changes (>2-fold change and P < 0.01) in pooled data of the three data sets, a group that is >4 times larger than the group of such differentially expressed genes detected in the comparison between current and former smokers. Of the 145 significantly changed genes between current and former smokers, 77 are consistent with, and 68 are not consistent with, the 591 such genes between current and never smokers (Supplementary Table S2). The nine genes with consistent and significant changes between former and current smokers in each of the three data sets are in the subset of 77 common, significantly changed genes. Figure 4B compares former smokers with never smokers.
|
|
| Discussion |
|---|
|
|
|---|
Seven of the eight validated genes overexpressed in current smokers—CYP1B1, four AKRs, ALDH3A1, and NQO1 (Table 2)—are involved in drug and/or carcinogen metabolism (9, 10, 18–27). Polycyclic aromatic hydrocarbons in tobacco smoke are known to bind to and activate the aryl hydrocarbon receptor and thus induce CYP1B1 (10). CYP1B1 expression is of special interest because it may contribute both to increased drug metabolism and to carcinogenesis of the aerodigestive tract (1, 18–20). The metabolic clearance of docetaxel, tamoxifen, gefitinib, erlotinib, and other cancer prevention and therapy drugs is enhanced by YP1B1 (9, 21–23). Up-regulation of CYP1B1 and the six other validated overexpressed metabolizing genes by smoking is likely involved in the adverse interactions between smoking and drugs for lung cancer prevention and therapy; smoking cessation down-regulates these gene expressions and thus may reduce or eliminate the adverse drug interactions.
Four of the eight most-up-regulated genes we detected in current smokers (Table 2) are members of the AKR family (AKR1C1, AKR1C2, AKR1C3, and AKR1B10; ref. 28). AKR1B10 is overexpressed in non–small-cell lung cancer and squamous metaplasia in association with smoking (24, 26). AKR1C1, AKR1C2, and AKR1C3 are known to be involved in tobacco carcinogen and/or drug metabolism. AKR1C1 overexpression is correlated with a poor prognosis of non–small-cell lung cancer and is associated with chemotherapeutic drug resistance (25, 29). Data suggest that a potential role of AKR1B10 in retinoic acid signaling (30) may be a factor in the negative effects of retinoic acid (retinoids) and its relative β-carotene in smokers in chemoprevention trials (3–7). Several studies have shown that overexpression of AKR1C1, AKR1C2, or AKR1C3 contributes to the resistance of various tumor types, including lung cancer, to cisplatin-based chemotherapy (25, 31–33).
Various biases can produce inconsistencies between similar data sets. These biases can stem from differences in age, race, sex, smoking history, and sample processing. Regarding sample processing, for example, MDACC-1 and MDACC-2 samples involved two rounds of RNA amplification versus a single round in the BMC set. Two rounds of amplification are known to cause loss of signals for probes that target far away from the 3' end of mRNA sequences. The consistent changes in smoking cessation–related genes in all three independent data sets support the robustness of our present findings.
Gene expression profiling in bronchoscopy specimens offers a direct assessment of the effects of cigarette smoking in the lungs. Gene expression patterns vary greatly between individuals, however, because of genetic variations and different environmental influences. A report by Spira et al. (on a relatively broad array of differentially expressed metabolizing and antioxidant genes in current versus former smokers; ref. 12) provided us with the opportunity to increase the robustness of our gene expression analyses by adding the BMC data set to our MDACC data sets. As we prepared our present results for publication, the Spira group published another report (34) that extended their earlier study, as do the complementary and confirmatory findings we report here. We were able to identify the specific drug-metabolizing genes involved in smoking-drug interactions, including the overexpressed genes in current versus former smokers, because of the cross validation and increased statistical power provided by adding the BMC data set (12) to our MDACC-1 and MDACC-2 data sets. The combined effect of these reports is to increase the robustness of their interrelated findings and thus their appeal for hypothesis generation.
Our results also show that the scope of genetic changes following smoking cessation is much smaller than that associated with chronic smoking (Figs. 2 and 4), possibly explaining the persistent high lung cancer risk in former smokers (35). Surprisingly at the time (
10 years ago), we and others previously found in assessments limited to specific genetic alterations that smoking-related genetic changes persisted after smoking cessation in a population similar to those of MDACC-1, MDACC-2, and BMC (36, 37). Showing similar genetic alterations in current and former smokers, results of the more sophisticated global genomic profiling approach of our present and other studies are consistent with the earlier findings (12, 34).
Our findings underscore the importance of smoking status in clinical trials, showing that smoking effects on metabolizing genes potentially can interfere with drugs in standard or investigational chemoprevention or therapy not only in the lung but in other sites as well. Future research directions should include (a) increased monitoring of smoking status and increased smoking cessation efforts in any trial setting because of adverse smoking effects on drug uptake and metabolism, and (b) the development of new dosing and targeted approaches to counteract adverse smoking-drug interactions in the lung. New targeted approaches should consider the signaling pathways of drug-metabolizing genes that were validated in this study.
| Disclosure of Potential Conflicts of Interest |
|---|
|
|
|---|
| Footnotes |
|---|
Note: Supplementary data for this article are available at Cancer Prevention Research Online (http://cancerprevres.aacrjournals.org/).
Received for publication December 22, 2007.
Revision received February 18, 2008.
Accepted February 21, 2008
| References |
|---|
|
|
|---|
2. Cancer facts and figures, 2006 [article on the Internet]. American Cancer Society 2006. Available from: http://www.cancer.org/downloads/STT/CAFF2006PWSecured.pdf].
3. Mayne ST Lippman SMCigarettes: a smoking gun in cancer chemoprevention. J Natl Cancer Inst 2005;97:1319–21.
4. Gritz ER, Dresler C, Sarna LSmoking, the missing drug interaction in clinical trials: ignoring the obvious. Cancer Epidemiol Biomarkers Prev 2005;14:2287–93.
5. The
-Tocopherol, β Carotene Cancer Prevention Study GroupThe effect of vitamin E and β carotene on the incidence of lung cancer and other cancers in male smokers. N Engl J Med 1994;330:1029–35.
6. Omenn GS, Goodman GE, Thornquist MD et al Effects of a combination of β carotene and vitamin A on lung cancer and cardiovascular disease. N Engl J Med 1996;334:1150–5.
7. Lippman SM, Lee JJ, Karp DD et al Randomized phase III intergroup trial of isotretinoin to prevent second primary tumors in stage I non-small-cell lung cancer. J Natl Cancer Inst 2001;93:605–18.
8. Zhang Z, Xu F, Wang S et al Influence of smoking on histologic type and the efficacy of adjuvant chemotherapy in resected non-small cell lung cancer. Lung CancerEpub ahead of print.
9. Hamilton M, Wolf JL, Rusk J et al Effects of smoking on the pharmacokinetics of erlotinib. Clin Cancer Res 2006;12:2166–71.
10. Port JL, Yamaguchi K, Du B et al Tobacco smoke induces CYP1B1 in the aerodigestive tract. Carcinogenesis 2004;25:2275–81.
11. Lee JS, Lippman SM, Benner SE et al Randomized placebo-controlled trial of isotretinoin in chemoprevention of bronchial squamous metaplasia. J Clin Oncol 1994;12:937–45.
12. Spira A, Beane J, Shah V et al Effects of cigarette smoke on the human airway epithelial cell transcriptome. Proc Natl Acad Sci U S A 2004;101:10143–8.
13. Gold D, Coombes K, Medhane D et al A comparative analysis of data generated using two different target preparation methods for hybridization to high-density oligonucleotide microarrays. BMC Genomics 2004;5:2.[CrossRef][Medline]
14. Bolstad BM, Irizzary RA, Astrand M et al A comparison of normalization methods for high density oligonucleotide array data based on bias and variance. Bioinformatics 2003;19:185–93.
15. Zhang L, Miles MF, Aldape KDA model of molecular interactions on short oligonucleotide microarrays. Nat Biotechnol 2003;21:818–21.[CrossRef][Medline]
16. Wang J, Coombes KR, Highsmith WE et al Differences in gene expression between B-cell chronic lymphocytic leukemia and normal B cells: a meta-analysis of three microarray studies. Bioinformatics 2004;20:3166–78.
17. Pounds S Morris SWEstimating the occurrence of false positives and false negatives in microarray studies b approximating and partitioning the empirical distribution of P values. Bioinformatics 2003;19:1236–42.
18. Mahadevan B, Luch A, Atkin J et al Inhibition of human cytochrome P450 1B1 further clarifies its role in the activation of dibenzo[a,l]pyrene in cells in culture. J Biochem Mol Toxicol 2007;21:101–9.[CrossRef][Medline]
19. Roos PH Bolt HMCytochrome P450 interactions in human cancers: new aspects considering CYP1B1. Expert Opin Drug Metab Toxicol 2005;1:187–202.[CrossRef][Medline]
20. Purnapatre K, Khattar SK, Saini KSCytochrome P450s in the development of target-based anticancer drugs. Cancer Lett 2008;259:1–15.[CrossRef][Medline]
21. Sissung TM, Price DK, Sparreboom A et al Pharmacogenetics and regulation of human cytochrome P450 1B1: implications in hormone-mediated tumor metabolism and a novel target for therapeutic intervention. Mol Cancer Res 2006;4:135–50.
22. Li J, Zhao M, He P et al Differential metabolism of gefitinib and erlotinib by human cytochrome P450 enzymes. Clin Cancer Res 2007;13:3731–7.
23. Rochat B, Morsman JM, Murray GI et al Human CYP1B1 and anticancer agent metabolism: mechanism for tumor-specific drug inactivation? JPET 2001;296:537–41.
24. Fukumoto S, Yamauchi N, Moriguchi H et al Overexpression of the aldo-keto reductase family protein AKR1B10 is highly correlated with smokers' non-small cell lung carcinomas. Clin Cancer Res 2005;11:1776–85.
25. Penning TMAKR1B10: a new diagnostic marker of non-small cell lung carcinoma in smokers. Clin Cancer Res 2005;11:1687–90.
26. Woenckhaus M, Klein-Hitpass L, Grepmeier U et al Smoking and cancer-related gene expression in bronchial epithelium and non-small-cell lung cancers. J Pathol 2006;210:192–204.[CrossRef][Medline]
27. Sladek NE, Kollander R, Sreerama L et al Cellular levels of aldehyde dehydrogenases (ALDH1A1 and ALDH3A1) as predictors of therapeutic responses to cyclophosphamide-based chemotherapy of breast cancer: a retrospective study. Rational individualization of oxazaphosphorine-based cancer chemotherapeutic regimens. Cancer Chemother Pharmacol 2002;49:309–21.[CrossRef][Medline]
28. Penning TM Drury JEHuman aldo-keto reductases: function, gene regulation, and single nucleotide polymorphisms. Arch Biochem Biophys 2007;464:241–50.[CrossRef][Medline]
29. Hsu NY, Ho HC, Chow KC et al Overexpression of dihydrodiol dehydrogenase as a prognostic marker of non-small cell lung cancer. Cancer Res 2001;61:2727–31.
30. Crosas B, Hyndman D, Gallego O, Martras S, Pares X, Flynn TGHuman aldose reductase and human small intestine aldose reductase are efficient retinal reductases: consequences for retinoid metabolism. Biochem J 2003;373:973–9.[CrossRef][Medline]
31. Deng HB, Parekh HK, Chow KC, Simpkins HIncreased expression of dihydrodiol dehydrogenase induces resistance to cisplatin in human ovarian carcinoma cells. J Biol Chem 2002;277:15035–43.
32. Deng HB, Adikari M, Parekh HK, Simpkins HUbiquitous induction of resistance to platinum drugs in human ovarian cervical, germ-cell and lung carcinoma tumor cells overexpressing isoforms 1 and 2 of dihydrodiol dehydrogenase. Cancer Chemother Pharmacol 2004;54:301–7.[Medline]
33. Chen J, Adikari M, Pallai R, Parekh HK, Simpkins HDihydrodiol dehydrogenases regulate the generation of reactive oxygen species and the development of cisplatin resistance in human ovarian carcinoma cells. Cancer Chemother Pharmacol 2007Epub ahead of print 17661040.
34. Beane J, Sebastiani P, Liu G, Brody JS, Lenburg ME, Spira AReversible and permanent effects of tobacco smoke exposure on airway epithelial gene expression. Genome Biol 2007;8:R201.[CrossRef][Medline]
35. Tong L, Spitz MR, Fueger JJ et al Lung carcinoma in former smokers. Cancer 1996;78:1004–10.[CrossRef][Medline]
36. Mao L, Lee JS, Kurie JM et al Clonal genetic alterations in the lung of current and former smokers. J Natl Cancer Inst 1997;89:857–62.
37. Wistuba II, Lam S, Behrens C et al Molecular damage in the bronchial epithelium of current and former smokers. J Natl Cancer Inst 1997;89:1366–73.
Commentary
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |