INTRODUCTION

Chronic obstructive pulmonary disease (COPD) is a complex respiratory condition characterized by persistent and progressive airflow obstruction, resulting from abnormalities in the airways (such as bronchitis and bronchiolitis) or alveoli (emphysema)1. COPD is a major global health concern with an estimated prevalence of 12.16% worldwide2. It is among the top three causes of death, claiming approximately 3.2 million lives annually3. The primary pulmonary symptoms of COPD include dyspnea, breathlessness, cough and sputum, and chest tightness4. Even for those with mild COPD, the symptoms can be significant5. COPD can also lead to various extrapulmonary effects, such as weight loss, malnutrition, and skeletal muscle dysfunction6. These comorbidities significantly reduce both the survival time and quality of life for affected individuals. Although the existing treatment methods for COPD cannot reverse the course of the disease, they can effectively alleviate the clinical symptoms of patients. Drug treatment mainly includes bronchodilators. For glucocorticoids and antibiotics, non-drug treatment includes oxygen therapy, non-invasive ventilation, pulmonary rehabilitation, and surgical treatment. While the present treatment methods for COPD cannot reverse the course of the disease, they can effectively alleviate the clinical symptoms of patients7. However, there is still a pressing need to develop new and effective treatments to halt the decline in lung function in COPD patients. Encouragingly, research suggests that stem cell therapy may hold promise in improving lung function8.

Smoking is a well-established independent risk factor for COPD. Smokers are 2.55 times more likely to develop COPD compared to non-smokers2, as tobacco smoke leads to excessive mucus secretion in the trachea, alveolar wall destruction, and pulmonary fibrosis9. Despite the widely publicized harm of smoking, about 1.7% of pregnant women smoke during pregnancy, resulting in an increased risk to their fetuses’ lung health10,11. McEvoy et al.12 demonstrated a significant decrease in the time to peak tidal expiratory flow to expiratory time ratio (TPTEF/ET) among infants born to smoking mothers. The Hayatbakhsh et al.13 observational study of 1185 males and 1224 females found that male offspring exposed in utero to maternal smoking exhibited significant reductions in forced expiratory volume (FEV1) and forced expiratory flow (FEF) into early adulthood13, while the impact was not observed in female offspring. Beyer et al.14 suggested an association between maternal smoking during pregnancy and adult offspring’s COPD in a cohort of 291 patients, and this study was conducted using a structured questionnaire, which had significant confounding factors. Although these studies imply that maternal smoking during pregnancy can have lasting detrimental effects on offspring lung health and potentially increase the risk of COPD, they are observational and therefore susceptible to confounding bias and heterogeneous results. There is still a lack of robust evidence demonstrating an association between maternal smoking during pregnancy and offspring COPD.

Mendelian randomization (MR) is a causal inference method that utilizes genetic variation. The fundamental principle of MR is to infer the causal association of an exposure on an outcome using the effects of randomly assigned genotypes on phenotypes observed in natural populations15. As genetic variation is present at birth and remains stable throughout the life cycle, the resulting association obtained by MR is not affected by reverse causation or confounders16. Considering the limited evidence for an association between maternal smoking during pregnancy and offspring COPD, our study aimed to investigate this association using MR.

In this study, we adopted a two-sample MR approach to analyze the summary data from the genome-wide association study (GWAS) database. Based on the three assumptions of MR, we assessed the potential association between maternal smoking around birth as the exposure variable and offspring COPD as the outcome variable, following the STROBE-MR guidelines for conducting and reporting MR studies.

METHODS

Study design and data sources

This study conducted a two-sample MR analysis based on summary-level data, adhering to three main assumptions: 1) Relevance: genetic variations must be strongly associated with the exposure; 2) Independence: genetic variations should be independent of potential confounders; and 3) Exclusion restriction: genetic variations influence the outcome solely through the exposure, without alternative pathways (Supplementary file Figure S1). A directed acyclic graph (DAG) was drawn to illustrate the MR study design (Figure 1). The GWAS data used in the study were downloaded from the Integrative Epidemiology Unit (IEU) open GWAS project17. Developed by the MRC IEU at the University of Bristol, this public access, manually curated database complies summary statistics encompassing hundreds of billions of genetic associations from over 50000 complete GWAS datasets. These data represent different human genotypes and disease outcomes across diverse populations, enabling researchers to select appropriate genetic variations as instrumental variables (IVs), assess their association strength with exposure variables, and evaluate the potential risk of pleiotropy.

Figure 1

Directed acyclic graph (DAG) illustrating the univariate and multivariate MR analysis framework. Maternal smoking around birth is the exposure, and COPD in offspring is the outcome. The pink-colored nodes represent confounders. The blue-colored nodes represent mediators

https://www.tobaccoinduceddiseases.org/f/fulltexts/203186/TID-23-51-g001_min.jpg

The exposure variable in this study was maternal smoking around birth, while COPD served as the outcome variable. To assess the influences of potential mediators, we included a covariate for the multivariate MR analysis, which consisted of illnesses of the mother, such as high blood pressure and heart disease. Maternal smoking around birth, maternal high blood pressure, and maternal heart disease were assessed through self-reported questionnaires, while COPD was primarily identified using electronic medical records. Summary data for maternal smoking around birth involved 397732 participants (121634 controls and 276098 cases). For COPD data, there were 468475 participants, consisting of 454945 controls and 13530 cases. The data for maternal high blood pressure included 426391 participants, with 130948 controls and 295443 cases. The data for maternal heart disease involved 426240 participants, with 85620 controls and 340620 cases (Table 1). The COPD GWAS dataset was sourced from the UK Biobank and FinnGen databases, and the remaining datasets were derived from the UKB database. All participants are of European descent.

Table 1

Brief description of the genome-wide association study data used

GWAS IDYearTraitConsortiumSample sizeCaseControlnSNPs
ukb-b-176852018Maternal smoking around birthMRC-IEU3977321216342760989851867
ukb-b-181672018Illnesses of the mother: high blood pressureMRC-IEU4263911309482954439851867
ukb-b-124772018Illnesses of the mother: heart diseaseMRC-IEU426240856203406209851867
ebi-a-GCST900188072021Chronic obstructive pulmonary diseaseNA4684751353045494524180654

[i] nSNPs: number of SNPs. NA: not applicable.

Ethics

This study utilized publicly available GWAS summary data, for which the original studies had obtained ethical committee approval. No additional ethical approval or informed consent was required.

Data collation

To collate the data, we selected independent single nucleotide polymorphisms (SNPs) associated with maternal smoking around birth and maternal diseases as IVs, which had genome-wide significance, from the database. This selection process aimed to identify strongly associated variations. Strong linkage disequilibrium (LD) among the SNPs was excluded from the analysis to avoid the bias of IVs. The screening criteria were as follows: 1) determining the genome-wide significance of maternal diseases and living habits based on the genome-wide information of the 1000 Genomes Project (p<5×10-8); 2) ensuring a physical distance of more than 10000 kb between every two genes; and 3) setting an R2 threshold of LD between genes to be <0.001. The selected IVs were analyzed for associations with other phenotypes using the PhenoScanner18, defined as having a history of smoking, currently smoking, and being exposed to tobacco smoke and air pollution in the offspring, to further reduce the possible impact of horizontal pleiotropy on the study. The following SNPs associated with these phenotypes were excluded: rs10226228, rs2183947, rs576982, rs6011779, rs75596189, and rs7899608. See Table 2 for the details of selected IVs.

Table 2

Details for SNPs

PositionSNPOther alleleEffect alleleEAFβSEpR2F
rs12405972TG-0.008061530.0010.000-0.0220.01220.070729456.247
rs12923476AG-0.006864950.0010.000-0.0360.01420.012220034.260
rs1323341GA-0.006829460.0010.000-0.0080.01410.580200030.203
rs2428019AC0.007022780.0010.0000.0020.01370.888800034.158
rs35566160GA0.006373280.0010.0000.0390.01260.001894029.927
rs36072649AT-0.007173160.0010.000-0.0470.01190.000073046.093
rs4865667TC-0.005810870.0010.000-0.0190.01210.115900030.451
rs62477310CT-0.005778770.0010.000-0.0180.01130.109000031.495
rs7002049CT0.00756230.0010.0000.0210.01590.184500036.620

[i] SNPs that were used as instrumental variables met the conditions: p<5×10-8 and F>10. SE: standard error. SNP: single nucleotide polymorphism. EAF: effect allele frequency.

Statistical analysis

The primary analysis methods used to evaluate the association between maternal smoking around birth and maternal diseases on childhood COPD were inverse variance weighting (IVW), MR-Egger regression, and weighted median (WM). IVW is the primary method. It applies weights based on the inverse variance of each IV, assuming the validity of all IVs, but is sensitive to pleiotropy. IVW does not include an intercept term in the regression, and the final causal estimate represents the weighted average of the effect values from all IVs. The key distinction between the MR-Egger method and IVW lies in the MR-Egger method’s incorporation of an intercept term to adjust for pleiotropy. This method relies on the Instrument Strength Independent of Direct Effect (InSIDE) assumption, which stipulates that the pleiotropic effects of IVs should be independent of their effects on the exposure. Furthermore, ME-Egger employs the inverse of outcome variance as the fitting weight. WM calculates the median of the weighted empirical density function, providing a consistent causal estimate even when IVs are invalid, as long as at least half the weight comes from valid IVs. These three methods are often combined to minimize bias due to pleiotropy or weak instrument strength.

The causal estimate from a two-sample analysis, where data on the risk factor and outcome are obtained from separate, non-overlapping datasets, is less prone to bias, with any residual bias tending toward the null. However, when utilizing genetic consortia that include partially overlapping participant sets, the direction and magnitude of bias become uncertain. Therefore, considering that the dataset used in the study includes overlapping samples, we employed the online tool ‘Bias and Type 1 error rate for Mendelian randomization with sample overlap’ to assess bias and Type 1 error rate19.

Multivariate MR methods were employed to adjust for covariates, specifically the mother’s illnesses of high blood pressure and heart disease. Other potential risk factors, such as active smoking, passive smoking, and air pollution, were not included in the adjustment, as SNPs related to these factors were excluded during the IV selection. In multivariate MR, the SNPs utilized as IVs must satisfy the following conditions: 1) they should be associated with all exposure factors in the model, 2) they should not impact the outcome variables through alternative pathways, and 3) the number of SNPs should exceed the number of exposure factors.

We conducted sensitivity analyses to evaluate the robustness of the results. Leave-one-out analysis test was used to assess the stability of individual SNPs by sequentially excluding each IV. Cochran’s Q test was applied to identify heterogeneity, with p<0.05 representing significant heterogeneity. MR-Egger intercept test was used to assess horizontal pleiotropy, with p<0.05 indicating significant horizontal pleiotropy.

All MR analyses were carried out using the R software. IVW, WM, and MR-Egger methods were implemented with the R package Two Sample MR. Multivariate MR methods were performed using the R package Mendelian Randomization. The evaluation indices used were odds ratio (OR) and 95% confidence interval (CI). A two-sided statistical significance level of p<0.05 was considered to indicate statistical significance. This study adhered to the STROBE-MR guidelines, with a detailed checklist provided in the Supplementary file.

RESULTS

Results of univariate MR

Based on the IVW test results, it was found that maternal smoking around birth (IVW, OR=35.13; 95% CI: 10.18–121.20; p<0.001) and illnesses of the mother with high blood pressure (IVW, OR=2.51; 95% CI: 1.03–6.14; p=0.044) were significantly associated with an elevated risk of offspring COPD (Figure 2). However, there was no statistically significant association between illnesses of the mother with heart disease and COPD in the IVW test results (IVW, OR=0.25; 95% CI: 0.03–2.39, p=0.226). Similar findings of the relationship between maternal smoking around birth and offspring COPD were noted in the WM method (p<0.001) (Table 3).

Table 3

Heterogeneity and horizontal pleiotropy estimation

ExposurenSNPsMethodβSEOR (95% CI)pFDR
p
Cochran’s QMR-EggerEgger intercept
QQ_dfpQQ_dfpInterceptp
Maternal smoking around birth9Inverse variance weighting (fixed effects)3.5590.63235.13 (10.18–121.20)<0.001<0.00110.37480.24010.26770.1740.0130.795
MR Egger1.6866.9685.40 (0.00–4604595.09)0.816
Weighted median2.9460.89519.04 (3.30–109.98)<0.001
Illnesses of mother: High blood pressure29Inverse variance weighting (multiplicative random effects)0.9200.4572.51 (1.03–6.14)0.0440.02950.250280.00645.902270.013-0.0230.121
MR Egger3.9481.94551.82 (1.15–2344.28)0.052
Weighted median0.7030.5082.02 (0.75–5.46)0.166
Illnesses of mother: Heart disease8Inverse variance weighting (multiplicative random effects)-1.4071.1620.25 (0.03–2.39)0.2260.22631.3167<0.00128.1766<0.001-0.0140.445
MR Egger-0.3181.7870.73 (0.02–24.14)0.864
Weighted median-0.7760.67460.46 (0.12–1.73)0.250

[i] nSNPs: number of SNPs. SE: standard error. FDR: false discovery rate.

Figure 2

Univariate MR estimated the association of maternal smoking around birth, maternal high blood pressure, and maternal heart disease on COPD in offspring

https://www.tobaccoinduceddiseases.org/f/fulltexts/203186/TID-23-51-g002_min.jpg

No horizontal pleiotropy was observed in our MREgger intercept results (intercept=0.013, p=0.795). According to Cochrane’s Q test (p=0.24), there was no evidence of heterogeneity between maternal smoking around birth and COPD. These results also reduce concerns about the InSIDE assumption. However, heterogeneity was found in Cochrane’s Q test and MR-Egger for the associations between high blood pressure and heart disease and COPD.

The leave-one-out sensitivity analysis results indicated that upon removing each SNP one-by-one, the IVW analysis results of the remaining SNPs closely matched the analysis results obtained from including all SNPs (Figure 3, and Supplementary file Figures S2 and S3). This suggested that none of the SNPs had a strong effect on the estimated relationship.

Figure 3

MR leave-one-out sensitivity analysis of maternal smoking around birth on COPD in offspring

https://www.tobaccoinduceddiseases.org/f/fulltexts/203186/TID-23-51-g003_min.jpg

We assessed bias and Type 1 error rate using the online tool. The results showed that even with complete sample overlap, the bias and Type 1 error rate remained at 0.005 and 0.05, respectively.

Results of multivariate MR

After conducting multivariate MR adjustment for covariates, including maternal high blood pressure and maternal heart disease, the association between maternal smoking around birth and offspring COPD remained statistically significant (AOR=62.11; 95% CI: 16.60–232.46; p<0.001) (Figure 4 and Table 4). This suggested that even after accounting for these covariates, maternal smoking around birth is independently associated with an increased risk of offspring COPD. Notably, this multivariate MR analysis did not adjust for active smoking, passive smoking, and air pollution because SNPs related to these factors were excluded during IV selection.

Table 4

Multivariable Mendelian randomization results of maternal smoking around birth on COPD

ExposurenSNPsβSEAOR (95% CI)p
Maternal smoking around birth93.9780.68053.42 (14.08–202.60)<0.001
Illnesses of mother: High blood pressure271.1620.4173.20 (1.41–7.24)0.005
Maternal smoking around birth103.7830.54143.94 (15.22–126.83)<0.001
Illnesses of mother: Heart disease7-1.1150.5010.33 (0.12–0.88)0.026
Maternal smoking around birth94.1290.67362.11 (16.60–232.46)<0.001
Illnesses of mother: High blood pressure271.2910.4463.64 (1.52–8.72)0.004
Illnesses of mother: Heart disease6-0.8790.6520.42 (0.12–1.49)0.177

[i] nSNPs: number of SNPs. SE: standard error. COPD: chronic obstructive pulmonary disease. AOR: adjusted odds ratio. The multivariate MR analysis adjusted for maternal high blood pressure and maternal heart disease. SNPs associated with active smoking, passive smoking, and air pollution were excluded from instrumental variable selection, and thus these factors were not adjusted for.

Figure 4

After adjustment for maternal high blood pressure and heart disease, both individually and together, multivariate MR estimated the association of maternal smoking around birth on COPD in offspring

https://www.tobaccoinduceddiseases.org/f/fulltexts/203186/TID-23-51-g004_min.jpg

DISCUSSION

In this MR analysis, both univariate and multivariate IVW results consistently suggested a relationship between maternal smoking around birth and an increased risk of COPD in offspring. These findings provide complementary evidence to previous observational studies. For example, Beyer et al.14 reported the influence of maternal smoking during pregnancy or after delivery on the predisposition of offspring to COPD in adulthood in a German population (n=291). The Li et al.20 case-control study suggested a significant direct effect of maternal smoking during pregnancy on offspring COPD in a Chinese population (n=5943). The Upton et al.21 study found that offspring of mothers who smoked ≥25 cigarettes per day had a six-fold increased risk of developing COPD compared to those unexposed to maternal smoking during pregnancy in a British population (n=3202). However, due to ethical considerations, these studies are cross-sectional or retrospective, making it difficult to establish the chronological order of exposure and outcome. Thus, they may be subject to reverse causality, confounding bias (e.g. maternal health, environmental factors), selection bias (related to specific population), and recall bias (inaccurate self-reported maternal smoking history). The relatively small sample sizes (usually in the hundreds or thousands) can also limit statistical power and precision. In contrast, our MR analysis employed SNPs as IVs to mitigate these biases and yielded more accurate estimates of potential causality, leveraging a much larger GWAS sample of European ancestry (COPD, n=468475). Furthermore, MR studies leverage the random allocation of genetic variation, akin to the design of a randomized controlled trial, thereby strengthening causal inference beyond mere association. A previous MR study22 suggested an association between maternal smoking around birth and offspring COPD (6915 cases and 186723 controls), using SNPs associated with exposure at p<5.0×10-7. Our MR analysis provides more robust evidence for this association. We utilized a larger outcome dataset (13530 cases and 454945 controls) and employed a more stringent SNP selection threshold of p<5.0×10-8. Moreover, we adjusted for offspring active smoking and passive tobacco exposure, providing stronger support for an association between maternal smoking around birth and an increased risk of offspring COPD.

Maternal smoking during pregnancy is widely recognized to impair respiratory function in offspring during infancy or childhood23,24. The Gilliland et al.25 observational study evaluated 5762 school-aged children across 12 American communities. The study concluded that maternal smoking during pregnancy heightened the risk of physician-diagnosed asthma and wheezing in childhood, while exposure to environmental tobacco smoke was considered a trigger, rather than a factor of wheezing25. Additionally, another observational study involving 5951 children aged 8–12 years from 9 Russian cities established a strong association between prenatal exposure to maternal smoking and conditions such as asthma, chronic bronchitis, and respiratory symptoms, though early and current exposures showed a weaker link to these diseases26.

Lung development occurs in three stages, namely embryonic, fetal, and alveolar. Any disruption during these stages can significantly impact respiratory system development27. The potential mechanisms by which maternal smoking during pregnancy leads to health problems in offspring from the many toxicants in tobacco smoke include in utero hypoxia, nicotine-induced reductions in uteroplacental blood flow, placental toxicity, and toxic growth restriction28. The harmful effects of tobacco smoke are primarily mediated by the release of carbon monoxide and nicotine. Carbon monoxide can diffuse through the placental barrier and enter the fetal circulation, binding with fetal hemoglobin, and leading to an increase in carboxyhemoglobin. This causes a leftward shift in the oxygen dissociation curve, thereby reducing the oxygen released to fetal tissues, resulting in fetal hypoxia which is detrimental to the development of the lungs and brain29,30. Additionally, prenatal exposure to nicotine has been validated in numerous animal models to impair fetal lung development. Experiments conducted on rhesus monkeys have demonstrated that prenatal exposure to nicotine increases the size of fetal pulmonary vascular walls and enhances the expression of collagen and elastin in airway vessels31. These changes alter the mechanical properties of the pulmonary vasculature and result in impaired lung development. Wongtakool et al.32 found that prenatal exposure to nicotine in fetal mice significantly reduces forced expiratory flow after birth and increases the deposition of collagen proteins in airways and blood vessels. These findings suggest that nicotine exposure leads to an increase in airway length and a decrease in diameter, resulting in reduced lung function32. Similarly, Sandberg et al.33 observed similar findings in lambs, where fetal exposure to nicotine resulted in increased smooth muscle volume but decreased bronchial diameter, thereby limiting airflow during inhalation. Furthermore, other studies have suggested that prenatal exposure to nicotine significantly increases lung and parenchyma volumes, as well as septal surface area after birth34. Collectively, these studies suggest that fetal exposure to tobacco during pregnancy impairs fetal lung and tracheal development, leading to respiratory abnormalities. This may potentially explain the increased risk of COPD among offspring due to maternal smoking around birth. However, due to limited evidence of randomized control trials (RCTs), further research is needed to establish the precise link and underlying mechanisms.

Strengths and limitations

This study provides MR evidence of the relationship between maternal smoking around birth and the risk of COPD in offspring. The results maintain their robustness due to the large scale of the sample size drawn from the databases, absence of pleiotropy, and the elimination of confounders associated with the outcome, such as personal smoking and environmental factors. We further enhanced the transparency of our research process by using online tools to calculate the bias and Type 1 error rate introduced by sample overlap, thereby increasing the reproducibility and credibility of our findings.

However, there are several limitations to consider. Firstly, there was population overlap in our analysis dataset, which could potentially bias the estimated association between maternal smoking around birth and offspring COPD. Secondly, the GWAS data used in this study were derived exclusively from individuals of European ancestry, which may compromise the generalizability of the findings to non-European populations. Genetic associations may vary across populations due to potential differences in allele frequencies, LD patterns, and gene-environment interactions35,36, which we were unable to assess in this study. Future studies incorporating diverse ancestral backgrounds are needed to enhance the applicability of the results. Thirdly, although MR analysis minimizes confounding by observed factors, we cannot entirely rule out residual confounding due to unmeasured or unknown confounding factors. Fourthly, although we performed sensitivity analyses to assess for potential pleiotropy, these methods have limited power to detect and correct for all forms of pleiotropy, especially when the pleiotropic effects are balanced or acting in opposing directions. This may violate the MR assumptions and influence the causal estimates. Fifthly, it is challenging to determine the exact number of cigarettes smoked by pregnant women, which may lead to an overestimation of the results among pregnant women who smoke only a limited amount. Nonetheless, given the health risks associated with smoking during pregnancy, any form of smoking should be avoided. Finally, while this study suggests an association between maternal smoking and an increased risk of COPD with genetic variations, it should be noted that MR analysis is primarily a statistical tool and requires additional support from future clinical studies.

CONCLUSIONS

Our study presents MR evidence supporting an association between maternal smoking around birth and the increased risk of offspring COPD in European populations. This finding indicates maternal smoking as an independent risk factor for offspring COPD.