Technologic advances and changing social paradigms have led to the increased use of assisted reproductive technologies (ARTs) for the purposes of procreation.1 The main techniques to treat infertility include: ovarian stimulation and intrauterine sperm insemination as well as techniques whereby oocytes and sperm are handled in vitro, like in vitro fertilization (IVF), intracytoplasmic sperm injection, and in vitro maturation.2–4 We refer to ART as any of the aforementioned infertility treatments leading to conception outside natural coitus.
In Canada, reports indicate that the use of fertility treatments increased by 50% over the past decade.1,5 Although the short-term perinatal outcomes after ART are well established, long-term neurodevelopmental outcomes, including cognitive, motor, and language development, are still a source of controversy.3,6–9
A review from the National Institutes of Health recognized that “lingering data gaps [exist] in the equivocal literature for many neurodevelopmental disabilities relative to ART” and that “…cohorts with longitudinal assessment…of neurodevelopment…are paramount for the development of empirically-based guidance….”10 Similarly, the largest systematic review of more than 80 studies addressing long-term neurodevelopment after ART concluded that additional data were required to determine the true effect of fertility treatments on these outcomes.3
In our study, we tested the hypothesis that neurodevelopment at 2 years is related to mode of conception. As such, using standardized and validated tools, the objective of this study was to compare children's cognitive, motor, and language development at 2 years of age after ART relative to natural conception.
MATERIALS AND METHODS
We analyzed data from the 3D-Study (Découvrir, Développer, Devenir), a prospective, longitudinal cohort carried out from 2010 to 2012 by the Integrated Research Network in Perinatology of Quebec and Eastern Ontario in Canada.11,12 The 3D-Study recruited 2,366 women in their first trimester of pregnancy and their respective births across nine sites in the province of Quebec and gathered extensive data on the mother–father–child triad from conception until 2 years postpartum. At 2 years postpartum, children underwent cognitive, motor, and language testing using the Bayley Scales of Infant and Toddler Development, 3rd edition, and the MacArthur-Bates Communicative Development Our primary objective was to compare the neurodevelopment in children born with the help of fertility treatments (exposed) relative to those born off pregnancies conceived naturally (controls). Our secondary objective was to describe baseline medical and sociodemographic differences between an ART and a non-ART cohort in Quebec.
The 3D-Study enrolled: 1) pregnant women between 8 0/7 and 13 6/7 completed weeks of gestation and 2) planning delivery in a 3D-Study–associated hospital. Exclusion criteria included: 1) women younger than 18 years of age, 2) illegal intravenous drug users, 3) non-English or French speakers, 4) severe illnesses or life-threatening conditions, and 5) multiple pregnancies, which includes twins or higher order multiples and mothers whose previous pregnancies had been enrolled in the study.
The Bayley Scales of Infant and Toddler Development, 3rd edition, is a validated and standardized developmental assessment for children aged 1–42 months that includes five independent scales (cognitive, motor [fine and gross], language, adaptive function, and socioemotional).13 In our study, we used the cognitive scale, which assesses cognitive processes like memory, exploration, manipulation, and sensorimotor development as well as the motor scale, which is divided into the fine motor and gross motor subtests and evaluates quality of movement, sensory integration, perceptual–motor integration, prehension, and other milestones. Each scale consists of a series of developmental play tasks. Scale-specific raw scores of completed items are then converted to scaled scores and to composite scores as a function of age. For the fine and gross motor subtests, only scaled scores are available. The scaled and composite scores are then compared with normalized scores taken from typically developing children of similar age. Mean is set at 10 and 100 with a standard deviation of 3 and 15 for the scaled scores (fine and gross motor) and the composite score (cognitive, motor), respectively. The Bayley scales (3rd edition) have established test–retest reliability, internal consistency as well as convergent and divergent validity.13 In our study, trained individuals who were blinded to the exposure administered the tool.
To evaluate language development, we used the toddler short form of MacArthur-Bates Communicative Development Inventories,14 a norm-referenced parent questionnaire that captures important information about a child’s developing abilities. Specifically, we used a 100-word vocabulary production checklist and a question about early word combinations, which can be reported on a 100-point scale.14,15 The English MacArthur-Bates toddler short form has established reliability as well as content and concurrent validity.14,15 A French version of the short form has been adapted for French-speaking children in Québec using the approach described by Fenson et al.14
Based on the proportion of children having undergone the assessments (ART, n=175; natural conception, n=1,345), a power calculation was conducted to determine whether a minimal clinically significant difference in the Bayley scales (3rd edition) scores could be detected. Using previously reported mean and variance composite cognitive scores at 24 months of age, we used a two-sided type I error (α) of 5% and obtained 98.57% power to detect a 5-point difference between groups.16
We carried out our analysis in four steps. First, we described each subgroup according to their baseline demographic and gestational characteristics (Table 1). We included descriptors of infertility diagnoses for patients undergoing ART and those defined as subfertile (Table 2). Subsequently, we described obstetric outcomes in the ART compared with natural conception group (Table 3).
We then evaluated the Bayley scales (3rd edition) (cognitive and motor) and MacArthur-Bates (language) scores for each mode of conception using χ2 and analysis of variance statistical testing to determine within-group variability. Finally, we applied linear regression models to evaluate both the crude and adjusted effects of ART on scale scores using the natural conception group as a reference. Estimates for individual ART techniques were calculated as were estimates for grouped modes of conception: in vivo (ovarian stimulation and intrauterine insemination) and in vitro (IVF, intracytoplasmic sperm injection, and in vitro maturation). Analyses were adjusted for parental age (years), family income (Canadian dollars), maternal ethnicity (Caucasian compared with not), maternal education (level), marital status (married compared with not), maternal history of depression (yes or no), maternal smoking intake, alcohol consumption during pregnancy (yes or no), antidepressant use (yes or no), and folic acid intake during pregnancy (yes or no). Sensitivity analyses were carried out to evaluate the robustness of the model adjusting for thyroid disease, breastfeeding status as well as removing single women and same-sex couples from our model. In accordance with a provincial policy of elective single embryo transfer during the study period, the vast majority of patients undergoing embryo transfer (IVF, intracytoplasmic sperm injection, in vitro maturation) in our study received a single embryo per cycle. An exemption was made if the patient was older than 35 years of age and had prior cycle failures, in which case the transfer of two embryos was considered. We sought and received approval from the institutional ethics review board at the CHU Sainte-Justine Center (acting as the central ethics review board) in Montreal, Quebec. All analyses were conducted using SAS 9.3.
Our final cohort consisted of 2,366 women carrying singleton pregnancies. We compared 278 pregnancies after ART with 2,088 pregnancies after natural conception. The ART cohort was comprised of the following techniques: stimulation (n=53), intrauterine insemination (n=79), IVF (n=32), intracytoplasmic sperm injection (n=105), and in vitro maturation (n=9). The spontaneous conception cohort was comprised of subfertile patients (n=490) and patients achieving natural conception at less than 6 months (n=1,598). Patients undergoing ART were more likely to be older, more educated, of lower parity, and with higher rates of thyroid disease. The later finding may be the result of more intense screening in the ART group as well as underlying thyroid dysfunction leading to infertility. On the other hand, mothers in the natural conception group were more likely to be Caucasian, multiparous, and with higher rates of caffeine, smoking, and alcohol consumption before and during pregnancy (Table 1).
In Table 2, infertility characteristics were compared between patients undergoing ART and those identified as being subfertile, who conceived after 6 months of trying. Patients having undergone ART had a longer time to conception and higher rates of underlying infertility diagnoses in both females and males (P<.001).
Table 3 presents obstetric and neonatal outcomes between both groups. Neonates born after ART were more likely to be of lower birth weight (3,279 g [interquartile range 697] compared with 3,356 g [interquartile range] 1,034), more likely to be born by cesarean delivery (36.5% compared with 25.1%), and to be admitted to the neonatal intensive care unit (7.7% compared with 3.9%). Although statistical differences were noted in the gestational age at birth, these are unlikely to be of clinical significance (38.4 weeks of gestation [interquartile range 2.0] compared with 38.8 weeks of gestation [interquartile range 2.0], P=.006).
A total of 175 of 278 children in the ART group (62.9%) and 1,345 of 2,088 in the natural conception group (64.4%) underwent neurodevelopmental assessments at 24 months. No significant differences were observed in cognitive (composite mean score±standard deviation: 98.5±11.2 compared with 100.1±11.4, P=.08), fine motor (scaled mean score 11.4±2.3 compared with 11.6±2.7, P=.41), gross motor (scaled mean score 8.8±2.0 compared with 8.9±2.3, P=.37), or language scores (53.9±23.6 compared with 55.6±24.4, P=.50) (Table 4). Finally, Table 5 showcases the linear regression models. After adjusting for relevant confounders, children born after ART showed no difference in Bayley scales (3rd edition) cognitive composite scores (B1 [standard error]=−1.60 [0.9], β′=−0.045, P=.08), composite motor scores (B1 [standard error]=−1.33 [1.0], β′=−0.036, P=.18), or MacArthur-Bates language scores (B1 [standard error]=−0.28 [2.1], β′=−0.003, P=.89) relative to natural conception. No significant differences were observed when comparing in vivo and in vitro techniques separately (P>.05) nor when comparing independent techniques individually. However, our study was not powered to compare the latter (Appendix 2, available online at http://links.lww.com/AOG/A911). Sensitivity analyses showed no differences in the model estimates when adjusting for thyroid disease, breastfeeding rates nor when removing single women or same-sex couples from the model.
Relative to participants lost to follow-up in the ART cohort, mothers of children who underwent testing were more likely to be Caucasian and of higher income. Among the natural conception cohort, mothers of children who underwent testing were more likely to be Caucasian, older, of higher education and income, and of lower parity (Appendix 3, available online at http://links.lww.com/AOG/A911).
Creating families through ART raises a number of concerns about potentially adverse consequences for child development.2,3,17–19 However, these concerns stem from largely retrospective studies with small sample sizes and heterogeneous methodologies.20 By specifying the infertility treatments used, accounting for predictors of development, and using standardized testing, our prospective study overcomes some of these limitations and provides reassuring results in that children born after ART appear to have similar cognitive, motor, and language skills than children born after natural conception at 2 years of age.
The recent Upstate KIDS Study sought to assess the same question in this report, notably, the association between the mode of conception and children's development.9 According to its results, children's development at age 3 years appears independent on mode of conception.9 Although the prospective nature of the KIDS study is a major strength, a number of its limitations are addressed by our study. Whereas the KIDS study recruited newborns, the 3D-Study recruited mothers during the first trimester, allowing us to prospectively gather data on prenatal factors that may have affected neurodevelopment such as antidepressant, folic acid, alcohol, and smoking exposure. Second, their study used the Age and Stage Questionnaires to assess neurodevelopment. Unlike the Bayley scales (3rd edition), which are administered by a third party blinded to the exposure, the Age and Stage Questionnaires require parental administration, which may introduce confirmatory bias.21 Third, although the 3D-Study required a prospective, two-step verification of exposure including ovarian stimulation and intrauterine insemination, the KIDS study could not verify the validity of the exposure because there is no registry in the United States.9 Nevertheless, the replication of similar findings in both studies despite the use of different methodologies is encouraging and may serve to reassure patients undergoing ART.
Each facet of neurodevelopment after ART has been studied previously. To date, two large systematic reviews of more than 80 studies addressed cognitive development after ART, concluding that, “there is sufficient data to support…no difference in development…between IVF and spontaneously conceived children”3,20 and that “most studies showed no associations with cognitive…development.”3 Because we cannot preclude that differences in cognition may appear later in life, a follow-up of children from prospective studies such as this one may be necessary.
Similarly, prospective evidence of motor skills at 24 months of age evaluated with standardized testing is lacking in the literature. Although some studies do point to delays in motor development between 16 and 18 months,22 our findings concur with the majority of the literature that motor development is not affected by the mode of conception.
Most of the controversy seems to be found in language development after ART.7,22,23 As evidenced by the lack of consensus, there is a call for prospective evaluation of children's language skills after ART as we have done in our study, in which we find no significant difference in MacArthur-Bates scores at 24 months of age.
The strengths of the present study include: the use of a prospective cohort of pregnant women with up to 3 years of follow-up, the use of standardized tools administered by professionals blinded to exposure, and the analysis of a number of ART techniques. In addition, we adjusted for a vast array of pertinent confounders, including maternal depression, which is notably lacking in the literature.24 Likewise, our study uses North American data, which may enhance external validity amongst Canadian and U.S. centers. Finally, we conducted sensitivity analyses, which confirmed the robustness of our model.
On the other hand, a number of limitations are worth mentioning. Although this study was powered to estimate the effect of ART as an overall category, it was not powered to detect a difference among individual techniques. Likewise, we considered the main ART technique as exposure and could not account for the type of cycle (natural compared with stimulated) used. Furthermore, although loss to follow-up rates were moderate in each group, a post hoc power calculation reveals adequate power to answer the study question. Moreover, given the study design, we were not able to untangle the effects of the underlying infertility from the ART technique used, because this is an example of confounding by indication. Finally, the children in our study population were young, and in certain cases, developmental characteristics may have a limited predictive value for long-term development.
All in all, the findings hereby presented may be useful in the clinical counseling of patients undergoing ART. Future prospective studies with long-term follow-up, powered to study individual ART techniques as well as evaluation of behavioral outcomes (such as attention deficit or hyperactivity and autism-like behaviors), are necessary.
1. Vélez MP, Connolly MP, Kadoch IJ, Phillips S, Bissonnette F. Universal coverage of IVF pays off. Hum Reprod 2014;29:1313–9.
2. Pandey S, Shetty A, Hamilton M, Bhattacharya S, Maheshwari A. Obstetric and perinatal outcomes in singleton pregnancies resulting from IVF/ICSI: a systematic review and meta-analysis. Hum Reprod Update 2012;18:485–503.
3. Bay B, Mortensen EL, Kesmodel US. Assisted reproduction and child neurodevelopmental outcomes: a systematic review. Fertil Steril 2013;100:844–53.
4. Society of Obstetricians and Gynaecologists of Canada, Okun N, Sierra. Pregnancy outcomes after assisted human reproduction. J Obstet Gynaecol Can 2014;36:64–83.
5. Zelkowitz P, King L, Whitley R, Tulandi T, Ells C, Feeley N, et al. A comparison of immigrant and Canadian-born patients seeking fertility treatment. J Immigr Minor Health 2015;17:1033–40.
6. Bowen JR, Gibson FL, Leslie GI, Saunders DM. Medical and developmental outcome at 1 year for children conceived by intracytoplasmic sperm injection. Lancet 1998;351:1529–34.
7. Gibson FL, Ungerer JA, Leslie GI, Saunders DM, Tennant CC. Development, behaviour and temperament: a prospective study of infants conceived through in-vitro fertilization. Hum Reprod 1998;13:1727–32.
8. Bonduelle M, Ponjaret I, Van Steirteghem A, Derde MP, Devroey P, Liebaers I. Developmental outcome at 2 years of age for children born after ICSI compared with children born after IVF. Hum Reprod 2003;18:342–50.
9. Yeung EH, Sundaram R, Bell EM, Druschel C, Kus C, Ghassabian A, et al. Examining infertility treatment and early childhood development in the Upstate KIDS Study. JAMA Pediatr 2016;170:251–8.
10. Hediger ML, Bell EM, Druschel CM, Louis GMB. Assisted reproductive technologies and children's neurodevelopmental outcomes. Fertil Steril 2013;99:311–7.
11. Reboul Q, Delabaere A, Luo ZC, Nuyt AM, Wu Y, Chauleur C, et al. Prediction of small for gestational age neonates by third trimester fetal biometry and impact of ultrasound-delivery interval. Ultrasound Obstet Gynecol 2016 May 6 [Epub ahead of print].
12. Pamidi S, Marc I, Simoneau G, Lavigne L, Olha A, Benedetti A, et al. Maternal sleep-disordered breathing and the risk of delivering small for gestational age infants: a prospective cohort study. Thorax 2016;71:719–25.
13. Weiss LG, Oakland T, Aylward GP. Bayley-III clinical use and interpretation. London (UK): Academic Press; 2010.
14. Fenson L, Pethick S, Renda C, Cox JL, Dale PS, Reznick JS. Short-form versions of the MacArthur communicative development inventories. Applied Psycholinguistics 2000;21:95–116.
15. Fenson L, Marchman V, Thal D, Dale P, Reznick J. MacArthur-Bates communicative development inventories: user's guide and technical manual Brookes. Baltimore (MD); Brookes Publishing Co.; 2007.
16. Vanderveen JA, Bassler D, Robertson CM, Kirpalani H. Early interventions involving parents to improve neurodevelopmental outcomes of premature infants: a meta-analysis. J Perinatol 2009;29:343–51.
17. Pochiraju M, Nirmalan PK. Type of conception and outcomes in women with singleton pregnancy. J Clin Diagn Res 2014;8:103–5.
18. Wan HL, Hui PW, Li HW, Ng EH. Obstetric outcomes in women with polycystic ovary syndrome and isolated polycystic ovaries undergoing in vitro fertilization: a retrospective cohort analysis. J Matern Fetal Neonatal Med 2015;28:475–8.
19. Leunens L, Celestin-Westreich S, Bonduelle M, Liebaers I, Ponjaert-Kristoffersen I. Follow-up of cognitive and motor development of 10-year-old singleton children born after ICSI compared with spontaneously conceived children. Hum Reprod 2008;23:105–11.
20. Ludwig AK, Sutcliffe AG, Diedrich K, Ludwig M. Post-neonatal health and development of children born after assisted reproduction: a systematic review of controlled studies. Eur J Obstet Gynecol Reprod Biol 2006;127:3–25.
21. Salomonsson B, Sleed M. The Ages & Stages Questionnaire: Social-Emotional: a validation study of a mother-report questionnaire on a clinical mother-infant sample. Infant Ment Health J 2010;31:412–31.
22. Zhu JL, Basso O, Obel C, Hvidtjorn D, Olsen J. Infertility, infertility treatment and psychomotor development: the Danish National Birth Cohort. Paediatr Perinat Epidemiol 2009;23:98–106.
23. Middelburg KJ, Heineman MJ, Bos AF, Hadders-Algra M. Neuromotor, cognitive, language and behavioural outcome in children born following IVF or ICSI—a systematic review. Hum Reprod Update 2008;14:219–31.
24. Hart R, Norman RJ. The longer-term health outcomes for children born as a result of IVF treatment. Part II—Mental health and development outcomes. Hum Reprod Update 2013;19:244–50.