Article
Open access
Published: 07 February 2022

Prioritization of putatively detrimental variants in euploid miscarriages

Scientific Reports volume 12, Article number: 1997 (2022) Cite this article

2184 Accesses
12 Altmetric
Metrics details

Subjects

Abstract

Miscarriage is the spontaneous termination of a pregnancy before 24 weeks of gestation. We studied the genome of euploid miscarried embryos from mothers in the range of healthy adult individuals to understand genetic susceptibility to miscarriage not caused by chromosomal aneuploidies. We developed gp , a pipeline that we used to prioritize 439 unique variants in 399 genes, including genes known to be associated with miscarriages. Among the prioritized genes we found STAG2 coding for the cohesin complex subunit, for which inactivation in mouse is lethal, and TLE4 a target of Notch and Wnt, physically interacting with a region on chromosome 9 associated to miscarriages.

The genetic architecture of sporadic and multiple consecutive miscarriage

Article Open access 25 November 2020

Expanded clinical validation of Haploseek for comprehensive preimplantation genetic testing

Article 26 March 2021

Whole-genome risk prediction of common diseases in human preimplantation embryos

Article Open access 21 March 2022

Introduction

Miscarriage, the spontaneous termination of a pregnancy before 24 weeks of gestation, occurs in 10–15% of all pregnancies^1,2,3 and has both environmental and genetic causes¹. Miscarriages are often the result of chromosomal aneuploidies of the gametes but they can also have non random genetic causes like small mutations (SNPs and indels), both de-novo or inherited from parents. Miscarriages are mostly studied using parental genetic information^4,5 and at a resolution that leaves the vast majority of the genome unexplored. Comparative genomic hybridization detects variants of several thousand base pairs^6,7,8, while targeted resequencing resolves point mutations. Both are currently the most accurate methods for the genetic analysis of parental DNA of miscarriages but are not sensitive to small variants, or target only a few coding regions. Using a different approach, the only study so far that tests for genome-wide genetic association in a large cohort of miscarriages is also based on maternal information⁹. Depending on the mode of inheritance, the study of parental genome might be ineffective as there is uncertainty about which parts of the parental genomes are actually inherited by the embryo, and it provides no way to identify de novo mutations. Therefore, extending the analysis to fetal genomes is the necessary next step to fully understand the genetics of miscarriages.

DNA sequence information of miscarried fetuses has been already used to determine the genetic component of miscarriages^10,11. Most studies adopt a family-based approach integrating pedigree and parental genomic data, often with focus on a reduced range of fetal phenotype^12,13,14,15. Very often the focus is on candidates genes. Examples are the identification of a mutation in the X-linked gene FOXP3 in siblings male miscarriages¹⁶, and the identification of truncating TCTN3 mutations in unrelated embryos¹⁷. A number of studies analyze instead exome sequences with different approaches^{18,19,20,21,22}. Among them, one study selects only variants transmitted to both sibling miscarriages¹⁹, others limit to autozygous variants^17,18, while some concentrate on delivering accurate diagnosis²¹. All these studies consider number of cases in the order of the tens and in most cases are motivated by phenotypic information mostly deriving from ultrasound scans. Two other studies adopt a cohort-based approach analyzing up to thousands of embryonic genomes with a range of phenotypes^23,24. One of them focuses on searching causative variants, demonstrating that exome sequencing effectively informs genetic diagnosis in about one third of the 102 cases considered²⁴. The other one focuses on conserved genes in copy number variable (CNV) regions in 1810 cases to identify 275 genes, often in clusters, located in the CNVs and potentially implicated in essential embryonic developmental processes²³. Because the number of embryos they analyze is too small for genetic association analysis to be effective, all studies mentioned so far perform sequencing followed by variant annotation and prioritization. All investigate apparently euploid embryos and focus on rare variation, but they use different criteria to select the variants and never release code to fully reproduce the variant prioritization.

In this study, we analyzed whole-genome sequencing on euploid embryos from idiopathic spontaneous pregnancy losses (both first and recurrent) and developed gp , a pipeline to prioritize putatively causative variants in coding regions. gp performs filtering of high-quality genomic variants based on prediction of the functional effect of the variants and using a set of parameters that can be specified by the user. This first selection is completed by filters for technical artifacts (e.g. mapping errors, read depth) and for false positives through resampling in a control cohort. Our pipeline can incorporate prior information on candidate genes, but is also robust to the discovery of novel genes. We prioritize on average 49 variants per embryo with high and moderate impact in genes relevant for embryonic development and mitochondrial metabolism, some of which were previously identified for having a role in miscarriages. We demonstrate that variant prioritization can be effective also when dealing with a limited number of samples and develop an approach that can be applied to a larger-scale project. Results from this study can be used to inform molecular diagnosis of pregnancy loss.

Results

To understand genetic susceptibility to miscarriage we studied the genome of forty-six spontaneously miscarried embryos. The embryos’ gestational age at pregnancy termination, calculated as the interval between the pregnancy termination date and the last menstruation date, ranges from 7.14 to 19.43 weeks (median is 10.3 weeks). Twenty-one embryos are classified as the product of recurrent miscarriages²⁵. The mothers of the embryos are mostly of European origin (87%) and their median age at the date of collection was 36.7 ± 5.9 years, with slightly significant higher age in recurrent cases compared to first ones (Fig. 1, Mann–Whitney p-value = 0.02). Medical records of the mothers of the embryos report no major comorbidities. Folic acid was taken by 71% of the mothers with no difference between first and recurrent cases (Fig. 1, Chi-square p-value = 0.96). Median body mass index and menarche age are comparable between first and recurrent cases, as well as comparable to a group of control women (Fig. 1). Altogether, from the available medical records, we suppose that the recruited mothers of the embryos were in the range of healthy adult individuals.

It is known from literature that about half of the miscarriages in the first trimester are due to large chromosomal aneuploidies, such as trisomies or deletions of large chromosomal chunks²⁶. In this study we want to focus on cases in which the genome is euploid, therefore the forty-six embryos were screened for chromosomal aneuploidies prior to whole-genome sequencing. We find that 15 (32.6%) samples were euploid while 56.6% of the embryos presented aneuploidies (Figure 2). The most common aneuploidy in our data set is the trisomy of chromosome 22 (26.9%), followed by trisomy of chromosome 16 (15.4%). In particular, a first round of detection of aneuploidies on chromosomes 13, 15, 16, 18, 21, 22, X, and Y through Short Tandem Repeats analysis discarded 45.7% of samples, and a subsequent analysis through comparative genomic hybridization and copy number variation detection from low-coverage sequencing discarded another 10.9% of the samples. Finally, a number of embryos (10.9%) dropped off the analysis due to low-quality DNA or maternal contamination.

Due to resources constrains, the whole genome of ten (6 first and 4 recurrent miscarriages) out of 15 euploid embryos was sequenced using Illumina short-reads at 30 X coverage. In the set of embryos genomes, we identified 11 M single-nucleotide polymorphisms (SNPs) and 2 M small insertions or deletions (indels).

Prioritization of genetic variants in coding genomic regions

Genomic variations were analyzed by prioritizing variants at single embryolevel, in the hypothesis of genetic heterogeneity among causative variants and phenotypes and considering the limited sample size that would limit conclusions form aggregate analyses. We developed the gp pipeline to prioritize putatively damaging genetic variants from sequencing data. gp takes as input genomic variants information from cases and controls (including the per-individual allelic counts) in form of a vcf file and outputs a table of variants prioritized according to user-defined parameters. gp uses functional annotations of genomic variants, information from publicly available sequence data of presumably healthy individuals, and, if available, knowledge of genes involved in the trait under study. gp currently analyze coding regions and performs four filtering steps (Fig. 3). The first filter (Filter I) retains variants based on: (i) an overall impact on the gene product classified as moderate or high²⁷; (ii) a user-defined threshold of allele frequency in control populations; (iii) the combined property of being putatively damaging (quantified by the CADD score²⁸) and located in genes intolerant to loss of function (determined by the pLI score²⁹). In addition it is possible to incorporate one or more user-defined lists of genes relevant to the trait under study. Variants retained by Filter I (hits) are further filtered to control for false positives with Filters II and III. In particular Filter II removes variants in genes with too many hits, while Filter III determines the chance for genes to be selected in a control population based on criteria specified in Filter I. In practice, a number of control individuals are sampled a number of times and their genetic data filtered using Filter I to obtain a list of genes selected by chance (Fig. 3C). Finally, Filter IV excludes private variants with read depth outside the range found in non-private ones.

We applied the gp pipeline to data from the high-coverage whole-genome sequences of genomic DNA of the embryos. For Filter I we set allele frequency < 5% in the 1000 Genomes³⁰ and gnomAD²⁹ reference populations. While there is no rule to define a precise threshold for rare variants, more commonly 1% or less is used. Here we use a higher threshold, in the hypothesis that some of the variants that we are interested in might have a mild effect and act under recessive or compound heterozygosis mode of inheritance. The functional effect of the variant within the gene context was taken into account in two ways: either selecting for putatively deleterious variants (CADD score > 90th percentile) in genes highly intolerant to loss of function (pLI score > 0.9), or selecting for variants in genes known to be involved in early embryonic development. In particular for this last option we included five lists of genes, namely genes involved in embryo development (Gene Ontology GO:0009790), genes lethal during embryonic stages³¹, essential for embryo development³¹, genes discovered through the Deciphering Developmental Disorders project³², and a manually curated list of candidate genes known to be involved in miscarriages. We requested the variant to satisfy one or both these criteria: (i) be in a gene present in at least two of the five lists or (ii) have CADD score above the 90th percentile and be in a gene with pLI > 0.9. Overall, filter I retained 1038 variants (hits) in embryos.

Filter II removed variants in genes with > 5 hits, under the assumption that variants found in these genes are likely to be sequencing and alignment artifacts. With few exceptions, we observed that the number of hits per gene at the 99th percentile was five, even if there is no significant correlation between number of hits and gene length (Spearman r² = 0.05 p-value = 0.124), and that hits in genes with >5 hits are enriched for private variants which are more prone to be sequencing errors (Fig. 4A).

For Filter III we used as control population 929 individuals from the Human Genome Diversity Project³³ from which we resampled 100 times ten individuals after assessing that there is no stratification between our samples and the HGDP data (Fig. S2). On each resampled set we performed Filter I analysis and recorded the genes that were retained. Overall 5,488 unique genes were retained in controls with different frequencies in samples across replicates (Fig. 4B). When considering the 95th percentile, 1531 genes are found >5% of times across replicates. Therefore when prioritizing genes in the embryos, hits within these genes (1223 genes) were removed by Filter III.

Filter II and III, retained 447 hits of which 21% are private with respect to 1000 Genomes and gnomAD data sets. Despite comparable read depth between private and non-privates variants (Fig. 4C, KS test p-value = 0.99, F-test p-value = 0.06), to control for possible artifacts due to scanty coverage, we applied a further filter that removes hits which are private variants with read depth outside the range found in non-private ones. While this last feature is not strictly necessary for this study it might be useful for other studies.

Properties and biological significance of the prioritized variants and genes

After all filters, gp prioritize 439 unique variants in 399 genes that code for 980 transcripts (Supplementary Table 2). Of the 439 prioritized variants 182 are not found in the HGDP data set, while for the remaining 257 (58.5%) the minor allele frequency is at most 1% in the full HGDP cohort. Almost all the prioritized genes (n=378) have an OMIM accession number and 18.8% of them were not in the lists of candidate genes used by gp as input during the prioritization, demonstrating that gp is robust to detection of genes never investigated before in relationship to the phenotype under study.

Nine genes are involved in the pathway of mitochondrial translation (Reactome identifier R-HSA-5368287) and this number represent a significant 4.9 fold enrichment over random expectations (Supplementary Table 3, p-value = 1.45E–04, FDR = 0.03). Similarly, we observe over overrepresentation of genes involved in cell cycle checkpoints (R-HSA-69620) and signaling by Rho GTPases (R-HSA-194315). With reference to the cellular compartments where the gene product are expressed, we observe a 7.7 fold significant enrichment (p-value 7.82E–04, FDR = 0.04) of protein expressed in the mitotic spindle pole or in associated complexes (Supplementary Table 4), among which the product of STAG2 for which we observe an high-impact mutation in one embryo from this study. Finally, seven genes (BHLHE40,DBN1, FOXA1, HSPD1, PLXNA3, SLC35A2, SRF) were previously identified as essential genes in copy-number variable regions from the analysis of hundreds of miscarried fetuses²³.

In the embryos 4.1% of the prioritized variants are stop gains/loss, frameshift indels, and variants that disrupt splicing sites, all classified as having high impact on the gene products, while missense mutations prevail among the variant with moderate effect (Fig. 5A). Averages per embryos are 48.9 ± 8.0 genomic variants in 47.8 ± 7.7 genes coding for 113.5 ± 24.6 transcripts (Fig. 5B). In almost all prioritized genes, gp retains only one variant per embryo, with few exceptions (five cases with two e and one with three variants per gene), as shown in Fig. 5B, where the allele dosage and impact are also shown.

Mutations in STAG2, FLAD1, TLE4, FRMPD3, and FMNL2 in the embryos

Among the selected mutations through variant prioritization we would like to highlight a few cases for their relevance. The male embryo FE130 carries two high-impact mutations in single copy. The first is a one extremely rare T>G transversion (rs913664484, G frequency is 4.7e–05 in 42.7 k individuals from gnomAD) at the 5′ end of the first intron of the Stromal antigen 2 (STAG2) gene. The mutation disrupts a splicing site, possibly having a high impact on the gene product. STAG2 is located on the X chromosome and its inactivation is the cause of severe congenital and developmental defects in embryos and infants^32,34,35,36 as well as chromosomal aneuploidies in several types of human cancers³⁷. Interestingly, only mildly-deleterious mutations have been found in alive human males, while females can carry highly deleterious mutations in heterozygosis³⁵. STAG2 codes for the cohesin subunit SA-2³⁸. Cohesins are ring-shaped protein complexes that bring into close proximity two different DNA molecules or two distant parts of the same DNA molecule and are responsible for the cohesion of sister chromatids³⁹. In mouse, inactivation of Stag2 causes early embryo lethality⁴⁰.

The second high-impact mutation of FE130 is a stop gain in the Flavin Adenine Dinucleotide Synthetase 1 (FLAD1) gene that is expressed in the mitochondrial DNA where it catalyzes the adenylation of flavin mononucleotide (FMN) to form flavin adenine dinucleotide (FAD) coenzyme⁴¹. The FAD synthase is an essential protein as the products of its activity, the flavocoenzymes play a vital role in many metabolic processes and in fact FAD synthase deficiencies (OMIM 255100) associated with homozygous severe mutations cause death in the first months of life⁴². In FE130 the stop mutation p.Q159* affects one of the five isoforms (Uniprot identifier Q8NFF5-5) at the second last residue, therefore we can speculate that it might not seriously compromise the function of the protein.

The embryo FE136 carries an heterozygous missense mutation (rs41307447) in the Transducin-like enhancer protein 4 gene (TLE4, synonym GRG-4) that causes a substitution of a polar amino acid with another polar amino acid (Ser > Tre) in the seventh exon of the gene, corresponding to a low complexity domain of the protein. The rs41307447 polymorphism is tolerated (SIFT score 0.18) and supposed to be benign (PolyPhen score 0.003), nevertheless the TLE4 gene is classified as highly intolerant to loss of function (pLI score 0.999) and the CADD score associated to rs41307447 is in the 99.8th percentile. TLE4 is a trascriptional repressor of the Groucho-family expressed in the embryonic stem cells where it represses naive pluripotency gene⁴³ and it is a direct transcriptional target of Notch⁴⁴. TLE4 is also expressed in the extravillous trophoblasts⁴⁵ where it is part of the Wnt signaling pathway that promotes implantation, trophoblast invasion, and endometrial function⁴⁶. Finally, a study in a cohort of 750 women finds significant association between the A allele of rs7859844 on chromosome 9 and recurrent miscarriages, further showing that rs7859844 physically interact with TLE4⁹. In our study among all embryos only FE106 carries the intergenic variant rs7859844.

Among prioritized variants shared by more than one embryo, the male FE165 and female FE106 embryos share a stop gain mutation (p.Q1758*) in the X-linked FERM and PDZ domain containing 3 (FRMPD3) gene, which is highly intolerant to loss of function (pLI = 0.91). The mutation falls at the protein C-terminal in a polyQ stretch (27 residues). While little is known in humans about this gene, a study in lion head goose finds significant association between high expression of FRMPD3 and low production of eggs⁴⁷.

Five embryos, among which the carrier of the missense mutation in TLE4, share one copy of an haplotype composed of two T alleles 4bp apart causing stop-gain (rs750755379) and missense (rs866373641) substitutions in the Formin-like protein 2 gene (FMNL2, Fig. 5C). The two alleles are found at <4% frequency in the gnomAD (v2.1.1) data base (Fig. 6A) and are in perfect linkage disequilibrium (r² = 1) in the embryos. In addition to the two mutations described above, the embryo FE165 has a deleterious and probably damaging missense mutation in phase with the two others (rs189416564, SIFT = 0, PolyPhen = 0.969). FMNL2 codes for a formin-related protein expressed in multiple human tissues and in particular in gastrointestinal and mammary epithelia, lymphatic tissues, placenta, and in the reproductive tract⁴⁸. In the fetus FMNL2 is expressed in the cytoplasm of brain, spinal cord, and rectum⁴⁹. FMNL2 is an elongation factor of actin filaments that drives cell migration by increasing the efficiency of lamellipodia protrusion^50,51, and its overexpression is associated with cancer⁵². The stop-gain mutation we find in the five embryos is located in the first domain of the protein, a Rho GTPase-binding/formin homology 3 (GBD/FH3) domain involved in subcellular localization and regulation of activation (Fig. 6B). The stop codon produces a truncated protein that lacks the Formin Homology-2 (FH2) domain, which directly binds to the actin filament catalyzing its nucleation and elongation.

Discussion

Miscarriages are frequent events with a complex aetiology whose genetic components have not been completely understood. We developed a scalable pipeline that investigates small genetic variation which has rarely been considered in the context of miscarriages. We use our pipeline to analyze coding regions of the genome of ten miscarried euploid embryos to prioritize putatively detrimental variants in genes that are relevant for embryonic development.

Our pipeline prioritized 439 putatively causative single nucleotide polymorphisms among 11M variants discovered in the ten embryos. Through systematic investigation of all coding regions gp selected about 47 genes per embryo and by manual curation of the selected genes we highlight a few cases. Among them, we find three examples relevant to embryonic development. An hemizygous splice site mutation in one male embryo on STAG2, known in literature for its role in congenital and developmental disorders as well as in cancer^{32,34,35,36,37}. A missense mutation in TLE4, a gene that interacts with the genomic region on chromosome 9 genetically associated with miscarriages in a genome-wide study on mothers⁹. TLE4 appears to be a key gene in embryonic development, as it is expressed in both embryonic and extraembryonic tissues where it participates in the Wnt and Notch signalling pathways^43,44,45. Finally, a 4-bp haplotype in five embryos, containing a stop gain and a missense mutations in FMNL2, a gene involved in cell motility with a major role in driving cell migration^50,51. We speculate that the stop gain mutation truncates the protein well before the main functional domain of FMNL2, i.e. the domain that binds the actin filaments, therefore causing a complete loss of function of the protein product. In this study we focus on single nucleotide variants. gp combines functional information on variants and genes with population genomics and literature information to sift millions of variants in search for the relevant ones. This approach closes a gap as genetic analyses of miscarriages mostly focused on detecting chromosomal aneuploidies and large chromosomal aberrations (which explain less than half of the cases) leaving unexplored small size genetic variants, the most abundant type of genomic variation. To some extent small genetic variants have been considered in a number of cases that performed target resequencing of candidate genes^16,17, a valid but still not systematic approach because it does not fully exploit genomic information. gp filter both variants in genes known from literature to be associated with miscarriages, and variants never described before in this context but potentially highly damaging and in genes intolerant to loss of function. As a result, our approach is robust to both discovery of novel association and investigation of genes with known association to miscarriages, overcoming the major limitation of candidate genes studies.

Variant prioritization is done at an individual level. While we expect that the same gene might be the cause of multiple miscarriages, given our limited sample size we do not expect that the same exact mutation to cause the gene’s loss of function. Therefore, by filtering at the individual level gp accounts for inter-individual variation, i.e. the larger fraction of genomic variability, as well as for the occurrence of de novo mutations. Nevertheless, in five embryos gp selected the exact same combination of two linked alelles in FMNL2, showing that while it is individual-based gp is still capable of finding variants shared by more than one case. In our analysis we retain both heterozygous and hemizygous variants for two reasons. First, more than one variant might contribute to the trait in compound heterozygosis or in epistasis. Secondly, the mutation could be de-novo and produce a dominant phenotype, although parents are not available to verify this hypothesis at this stage.

Our pipeline is reproducible and easy to scale to larger studies and different phenotypes, like recurrent oocyte and early embryonic arrest in in vitro fertilization cycles⁵³. To improve robustness, it includes a control population to filter out genes that can be prioritized by chance. gp is suitable for cases where it is not possible to rely on an adequate number of samples to perform association analysis. The future integration of genomic information on parents (not available in this collection) will allow us to infer inheritance mechanisms and distinguish between de novo and recessive mutations, with implications for clinical applications in the case of causative recessive mutations in the parents. Collecting genomic information from larger families, with several miscarriages/live births from the same couple will also further increase the strength of mendelian segregation analysis and the true discovery rate.

In conclusion, this exploratory study demonstrates that filtering and prioritizing is effective in identifying genomic variants putatively responsible for miscarriages and provides indications and tools for developing a larger study. Possible strategies to follow up and validate the results reported here include replication in a different and larger cohort and experimental assays on molecular and cellular properties of the genes. It would be ideal to experimentally validate results in human derived organoids, however in vitro they do not progress until the post-implantation stage, that is the stage at which miscarriages happen. Compared to previous similar studies our work focuses on a systematic exploration of the genome that combines previous knowledge with hypothesis-free prioritization, making it robust not only to the discovery of mutations in genes known to be associated with miscarriage, but also in the identification of novel genes. While only providing a proof of concept study, we give indications about genes that can be used to test genetic predisposition to miscarriages in parents that are planning to conceive or particularly for recurrent miscarriage patients. In a wider context, the results of this study might be relevant for genetic counseling and risk management in miscarriages. Future development will include the extension of the analysis to non-coding regions and to structural variants, as well as the enrollment of trios to fully exploit parental information⁵⁴.

Methods

Embryo data and samples collection

The study protocol was examined by the Comitato Etico di Area Vasta Emilia Centro (CE-AVEC) of the Azienda Ospedaliero—Universitaria di Bologna Policlinico S. Orsola-Malpighi. The committee gave the ethical approval of the study (reference CE/FE 170475). All participants provided written informed consents before entering the study. Cases were recruited at the Unit of Obstetrics and Gynecology of the Sant’Anna University Hospital in Ferrara, Italy, from 2017 to 2018. The inclusion criteria were: age between 18 and 42 years and gestational age up to 24 weeks. Exclusion criterion was any clinical condition that could prevent full-term pregnancies. Known causes of pregnancy losses were excluded by standard diagnostic protocol including hysteroscopy, laparoscopy, ultrasound, karyotype analysis, detection of immunological risk factors (anti-cardiolipin, lupus anticoagulant, antinuclear antibodies) and hormonal status (gonadotrophins, FSH, LH, prolactin, thyroid hormones, thyroperoxidase) before inclusion in the study. Gestational weeks were calculated from the last menstrual period. Demographic, anthropometric and clinical data of cases, including obstetric history, family history of malformations, and periconceptional supplementation with folic acid, were anonymized and linked to biological samples by coding. We use as control set for demographic, anthropometric and clinical data a cohort of 148 women (17 Africans, 117 Europeans, 14 Asians) of age 18–46 years (mean age 29.77 years, median 28.8) undergoing voluntary termination of pregnancy within 12 weeks of gestational age. Samples form IVF were excluded and paternal data was not available.

DNA preparation and sequencing

Retained product of conception was removed from uterus using a suction curette, and chorionic villi (CV) were carefully dissected from decidual tissue. We used dry homogenization after exploring a range of possibilities (Fig.S1A). Genomic DNA was extracted from CV samples using QIAamp DNA Mini Kit (Ref: 51304, Qiagen) according to manufacturer’s protocol. This kit was chosen after considering the yield of two types of resin and one membrane (Fig. S1B). DNA was titrated using Qubit 2.0 Fluorometer (Life Technologies). Whole-genome sequencing of the genomic DNA extracted from chorionic villi was done through a service provider (Macrogen). In particular, libraries for sequencing were prepared using the Illumina TruSeq DNA PCR-free Library (insert size 350 bp) and samples were sequenced at 30 X mapped (110Gb) 150bp PE on HiSeqX.

Detection of chromosomal aneuploidies in embryos

A rapid screening of sex and aneuploidies for chromosomes 13, 15, 16, 18, 21, 22 and X was carried out on genomic DNA extracted from the chorionic villi performing five multiplex Quantitative Fluorescent PCR (QF-PCR) assays. QF-PCR assays were performed in a total volume of 25 μl containing 40–100 ng of genomic DNA, 10mM dNTP (Roche), 6-30 pmol final concentration of each primer, 1 × Fast taq polymerase buffer (15 mmol/l MgCl₂) (Roche), and 2.5 U of Fasta taq polymerase (Roche). QF-PCR conditions were as follows: denaturation at 95 °C for 10 min followed by 10 cycles consisting of melting at 95 °C for 1 min, annealing at 65 °C (− 1 °C / cycle) for 1 mins, and then extension at 72 °C for 40 s, then for 23 cycles at 95 °C for 1 min, 55 °C for 1 min, and 72 °C for 1 min. Final extension was for 10 min at 72 °C and at 60 °C for 60 min. Fluorescence-labelled QF-PCR products were electrophoresed in an CEQ 8000 Backman by combining 40 μl of Hi-Di Formamide and 0.5 μl of DNA size standard 400 (Backman); QF-PCR products were visualized and quantified as peak areas of each respective repeat lengths. In normal heterozygous subjects, the QF-PCR product of each STR should show two peaks with similar fluorescent activities and thus a ratio of peak areas close to 1:1 (ranging from 0.8 to 1.4:1). A trisomy is suspected when the ratio is above or below this range (peak area ratios less than 0.6 and greater then 1.8, trisomic diallelic pattern), otherwise there are three alleles of equal peak area with a ratio of 1:1:1 (trisomic triallelic). The presence of trisomic, triallelic or diallelic patterns for at least two different STRs on the same chromosome is considered as evidence of trisomy. Trisomic patterns observed for all chromosome-specific STRs are indicative of triploidy. Therefore accurate X chromosome dosage, to perform diagnosis of X monosomy, can be assessed by TAF9L marker. This gene has a high degree of sequence identity between chromosome 3 and chromosome X; primers on this gene amplify a 3 b.p. deletion generating a chromosome X specific product of 141 b.p. and a chromosome 3 specific product of 144 b.p. Maternal contamination was also checked by QF-PCR comparing the alleles found in miscarriages with those found in maternal blood.

Comparative Genomic Hybridization was carried out using the Agilent SurePrint G3 Human CGH Microarray. Samples underwent DNA quantification and quality analysis prior to be labeled and hybridized on the microarray. Following hybridization samples were washed and the chip was scanned at 3 microns using the Agilent SureScan Microarray Scanner. The LogRatio from the arrays were segmented into regions of estimated equal copy number using both the method implemented in the Agilent CytoGenomics V3.0.4 software, and the Penalized least square implemented in the R package Copynumber (PLS,⁵⁵). Classification as copy number of gains or losses (copy number variants) was done using as criteria at least five probes and Zscore < 0.0016 (SD*4)⁵⁶.

Statistical and sequence analyses

Data cleaning, refining, and analysis (summary statistics, hypothesis testing) were performed using R⁵⁷. Reads in the FASTQ file sequence data were aligned against the reference genome GRChg38.p12 using bwa⁵⁸ and samtools⁵⁹. Variant calling of SNP and INDEL was done using freebayes⁶⁰. Freebayes version v1.3.2 was used with the following default parameters: minimal count of alternate allele reads of 2, minimal fraction of alternate reads of 20%. The resulting VCF files were refined in further steps: vcffilter⁶¹ was used to filter variants for total read depth at the locus > 10 and quality score > 20, leaving only variants with estimated 99% probability of a polymorphic genotype call; vt⁶² was used to normalize variants and deconstruct multiallelic variants. Refined VCF files were compressed and indexed using samtools⁵⁹. Variants were annotated for functional effects and allele frequency in other populations using Variant Effect Predictor²⁷. Phasing was done using Beagle 5.1⁶³ under standard parameters.

Principal component analysis was done with PLINK⁶⁴ using 1,2 M autosomal SNPs.

The gp pipeline for variant prioritization is written in Python and R and the code is publicly available (https://github.com/SilviaBuonaiuto/gpPipeline). The manually curated list of genes associated with miscarriages (recurrent and spontaneous) was obtained through a comprehensive search of the published literature. We considered seven studies highlighting the association of genes with miscarriages^{4,5,9,19,20,65,66}. This compendium was further supplemented by genes from curated repositories such as Human Phenotype Ontology (HPO) [URL: https://hpo.jax.org/app/browse/term/HP:0200067 last accessed: 1/12/2020 11:01:00 PM⁶⁷] and DisGeNET [http://www.disgenet.org/search last accessed: 1/12/2020 11:12:00 PM⁶⁸]. The search terms used were “recurrent miscarriages”, “abortion”, “spontaneous abortion”, and “recurrent spontaneous abortion”. After filtering by removing the duplicates, combining the gene sets obtained from the literature and databases yielded a total of 608 unique genes (Supplementary Table 1). Additional information of genes such as HGNC symbol, HGNC ID, Gene Stable ID, Chromosomal coordinates (GRChg38), karyotype band, transcript count, protein stable ID were extracted from Ensembl Biomart⁶⁹.

Overrepresentation tests and protein classification were performed using the R package ReactomePA⁷⁰.

References

Larsen, E. C., Christiansen, O. B., Kolte, A. M. & Macklon, N. New insights into mechanisms behind miscarriage. BMC Med. 11, 154 (2013).
PubMed PubMed Central Google Scholar
Ammon Avalos, L., Galindo, C. & Li, D.-K. A systematic review to calculate background miscarriage rates using life table analysis. Birth Defects Res. A 94, 417–423 (2012).
CAS Google Scholar
Andersen, A.-M.N., Wohlfahrt, J., Christens, P., Olsen, J. & Melbye, M. Maternal age and fetal loss: Population based register linkage study. BMJ 320, 1708–1712 (2000).
PubMed Central Google Scholar
Pereza, N., Ostojić, S., Kapović, M. & Peterlin, B. Systematic review and meta-analysis of genetic association studies in idiopathic recurrent spontaneous abortion. Fertil. Steril. 107, 150–159 (2017).
PubMed Google Scholar
Quintero-Ronderos, P. et al. Novel genes and mutations in patients affected by recurrent pregnancy loss. PLoS ONE 12, e0186149 (2017).
PubMed PubMed Central Google Scholar
Robberecht, C., Schuddinck, V., Fryns, J.-P. & Vermeesch, J. R. Diagnosis of miscarriages by molecular karyotyping: Benefits and pitfalls. Genet. Med. 11, 646 (2009).
PubMed Google Scholar
Kudesia, R., Li, M., Smith, J., Patel, A. & Williams, Z. Rescue karyotyping: A case series of array-based comparative genomic hybridization evaluation of archival conceptual tissue. Reprod. Biol. Endocrinol. 12, 19 (2014).
PubMed PubMed Central Google Scholar
Mathur, N., Triplett, L. & Stephenson, M. D. Miscarriage chromosome testing: Utility of comparative genomic hybridization with reflex microsatellite analysis in preserved miscarriage tissue. Fertil. Steril. 101, 1349–1352 (2014).
CAS PubMed Google Scholar
Laisk, T. et al. The genetic architecture of sporadic and multiple consecutive miscarriage. Nat. Commun. 11, 1–12 (2020).
ADS Google Scholar
Rajcan-Separovic, E. Next generation sequencing in recurrent pregnancy loss-approaches and outcomes. Eur. J. Med. Genet. 63, 103644 (2020).
PubMed Google Scholar
Filges, I. & Friedman, J. M. Exome sequencing for gene discovery in lethal fetal disorders-harnessing the value of extreme phenotypes. Prenat. Diagn. 35, 1005–1009 (2015).
PubMed Google Scholar
Bondeson, M.-L. et al. A nonsense mutation in cep55 defines a new locus for a Meckel-like syndrome, an autosomal recessive lethal fetal ciliopathy. Clin. Genet. 92, 510–516 (2017).
CAS PubMed Google Scholar
Dohrn, N. et al. Ecel1 mutation causes fetal arthrogryposis multiplex congenita. Am. J. Med. Genet. A 167, 731–743 (2015).
CAS Google Scholar
Wilbe, M. et al. Musk: A new target for lethal fetal akinesia deformation sequence (fads). J. Med. Genet. 52, 195–202 (2015).
CAS PubMed Google Scholar
Cristofoli, F., De Keersmaecker, B., De Catte, L., Vermeesch, J. R. & Van Esch, H. Novel stil compound heterozygous mutations cause severe fetal microcephaly and centriolar lengthening. Mol. Syndromol. 8, 282–293 (2017).
CAS PubMed PubMed Central Google Scholar
Rae, W. et al. A novel foxp3 mutation causing fetal akinesia and recurrent male miscarriages. Clin. Immunol. 161, 284–285 (2015).
CAS PubMed Google Scholar
Thomas, S. et al. Tctn3 mutations cause Mohr–Majewski syndrome. Am. J. Hum. Genet. 91, 372–378 (2012).
CAS PubMed PubMed Central Google Scholar
Shamseldin, H. E. et al. Identification of embryonic lethal genes in humans by autozygosity mapping and exome sequencing in consanguineous families. Genome Biol. 16, 116 (2015).
PubMed PubMed Central Google Scholar
Qiao, Y. et al. Whole exome sequencing in recurrent early pregnancy loss. MHR Basic Sci. Reprod. Med. 22, 364–372 (2016).
CAS Google Scholar
Fu, M. et al. Whole-exome sequencing analysis of products of conception identifies novel mutations associated with missed abortion. Mol. Med. Rep. 18, 2027–2032 (2018).
CAS PubMed PubMed Central Google Scholar
Meier, N. et al. Exome sequencing of fetal anomaly syndromes: Novel phenotype-genotype discoveries. Eur. J. Hum. Genet. 27, 730–737 (2019).
CAS PubMed PubMed Central Google Scholar
Yates, C. L. et al. Whole-exome sequencing on deceased fetuses with ultrasound anomalies: Expanding our knowledge of genetic disease during fetal development. Genet. Med. 19, 1171–1178 (2017).
PubMed Google Scholar
Chen, Y. et al. Characterization of chromosomal abnormalities in pregnancy losses reveals critical genes and loci for human early development. Hum. Mutat. 38, 669–677 (2017).
CAS PubMed PubMed Central Google Scholar
Zhao, C. et al. Exome sequencing analysis on products of conception: A cohort study to evaluate clinical utility and genetic etiology for pregnancy loss. Genet. Med. 1, 1–8 (2020).
Google Scholar
Christiansen, O. B. et al. Eshre guideline: Recurrent pregnancy loss. Hum. Reprod. Open 2018, hyo004 (2018).
Google Scholar
van den Berg, M. M., van Maarle, M. C., van Wely, M. & Goddijn, M. Genetics of early miscarriage. Biochim. Biophys. Acta 1822, 1951–1959 (2012).
PubMed Google Scholar
McLaren, W. et al. The ensembl variant effect predictor. Genome Biol. 17, 122 (2016).
PubMed PubMed Central Google Scholar
Rentzsch, P., Witten, D., Cooper, G. M., Shendure, J. & Kircher, M. Cadd: Predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res. 47, D886–D894 (2019).
CAS PubMed Google Scholar
Karczewski, K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020).
CAS PubMed PubMed Central ADS Google Scholar
Consoddrtium, G. P. et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
ADS Google Scholar
Dawes, R., Lek, M. & Cooper, S. T. Gene discovery informatics toolkit defines candidate genes for unexplained infertility and prenatal or infantile mortality. NPJ Genom. Med. 4, 1–11 (2019).
CAS Google Scholar
Studdy, T. D. D. D. et al. Large-scale discovery of novel genetic causes of developmental disorders. Nature 519, 223–228 (2015).
ADS Google Scholar
Bergström, A. et al. Insights into human genetic variation and population history from 929 diverse genomes. Science 367, 10 (2020).
Google Scholar
Mullegama, S. V. et al. De novo loss-of-function variants in stag2 are associated with developmental delay, microcephaly, and congenital anomalies. Am. J. Med. Genet. A 173, 1319–1327 (2017).
CAS PubMed PubMed Central Google Scholar
Mullegama, S. V. et al. Mutations in stag2 cause an x-linked cohesinopathy associated with undergrowth, developmental delay, and dysmorphia: Expanding the phenotype in males. Mol. Genet. Genom. Med. 7, e00501 (2019).
Google Scholar
Aoi, H. et al. Nonsense variants of stag2 result in distinct congenital anomalies. Hum. Genome Variat. 7, 1–7 (2020).
Google Scholar
Solomon, D. A. et al. Mutational inactivation of stag2 causes aneuploidy in human cancer. Science 333, 1039–1043 (2011).
CAS PubMed PubMed Central ADS Google Scholar
Cuadrado, A. & Losada, A. Specialized functions of cohesins stag1 and stag2 in 3d genome architecture. Curr. Opin. Genet. Dev. 61, 9–16 (2020).
CAS PubMed Google Scholar
McNicoll, F., Stevense, M. & Jessberger, R. Cohesin in gametogenesis. In Current topics in developmental biology, vol. 102, 1–34 (Elsevier, 2013).
De Koninck, M. et al. Essential roles of cohesin stag2 in mouse embryonic development and adult tissue homeostasis. Cell Rep. 32, 108014 (2020).
PubMed Google Scholar
Brizio, C. et al. Over-expression in Escherichia coli and characterization of two recombinant isoforms of human fad synthetase. Biochem. Biophys. Res. Commun. 344, 1008–1016 (2006).
CAS PubMed Google Scholar
Balasubramaniam, S., Christodoulou, J. & Rahman, S. Disorders of riboflavin metabolism. J. Inherit. Metab. Dis. 42, 608–619 (2019).
CAS PubMed Google Scholar
Laing, A. F., Lowell, S. & Brickman, J. M. Gro/tle enables embryonic stem cell differentiation by repressing pluripotent gene expression. Dev. Biol. 397, 56–66 (2015).
CAS PubMed Google Scholar
Menchero, S. et al. Transitions in cell potency during early mouse development are driven by notch. Elife 8, e42930 (2019).
PubMed PubMed Central Google Scholar
Meinhardt, G. et al. Wnt-dependent t-cell factor-4 controls human etravillous trophoblast motility. Endocrinology 155, 1908–1920 (2014).
PubMed Google Scholar
Sonderegger, S., Pollheimer, J. & Knöfler, M. Wnt signalling in implantation, decidualisation and placental differentiation-review. Placenta 31, 839–847 (2010).
CAS PubMed PubMed Central Google Scholar
Zhao, Q. et al. Genome-wide association analysis reveals key genes responsible for egg production of lion head goose. Front. Genet. 10, 1391 (2020).
PubMed PubMed Central Google Scholar
Gardberg, M. et al. Characterization of diaphanous-related formin fmnl2 in human tissues. BMC Cell Biol. 11, 55 (2010).
PubMed PubMed Central Google Scholar
Lizio, M. et al. Gateways to the fantom5 promoter level mammalian expression atlas. Genome Biol. 16, 22 (2015).
CAS PubMed PubMed Central Google Scholar
Block, J. et al. Fmnl2 drives actin-based protrusion and migration downstream of cdc42. Curr. Biol. 22, 1005–1012 (2012).
CAS PubMed PubMed Central Google Scholar
Kühn, S. et al. The structure of fmnl2-cdc42 yields insights into the mechanism of lamellipodia and filopodia formation. Nat. Commun. 6, 1–14 (2015).
ADS Google Scholar
Zhu, X.-L. et al. Fmnl2 is a positive regulator of cell motility and metastasis in colorectal carcinoma. J. Pathol. 224, 377–388 (2011).
CAS PubMed Google Scholar
Capalbo, A. et al.. A standardized approach for case selection and genomic data analysis of maternal exomes for the diagnosis of oocyte maturation and early embryonic developmental arrest in IVF. medRxiv (2021).
Buonaiuto, S. et al. Prioritization of putatively detrimental variants in euploid miscarriages. medRxiv (2021).
Nilsen, G. et al. Copynumber: Efficient algorithms for single-and multi-track copy number segmentation. BMC Genom. 13, 591 (2012).
CAS Google Scholar
Vermeesch, J. R. et al. Molecular karyotyping: Array cgh quality criteria for constitutional genetic diagnosis. J. Histochem. Cytochem. 53, 413–422 (2005).
CAS PubMed Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing. (R Foundation for Statistical Computing, 2019).
Li, H. Aligning sequence reads, clone sequences and assembly contigs with bwa-mem. arXiv:1303.3997 (2013).
Li, H. A statistical framework for snp calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27, 2987–2993 (2011).
CAS PubMed PubMed Central Google Scholar
Garrison, E. & Marth, G. Haplotype-based variant detection from short-read sequencing. arXiv:1207.3907 (2012).
Garrison, E. vcflib. https://github.com/vcflib/vcflib (2020).
Tan, A., Abecasis, G. R. & Kang, H. M. Unified representation of genetic variants. Bioinformatics 31, 2202–2204 (2015).
CAS PubMed PubMed Central Google Scholar
Browning, B. L., Zhou, Y. & Browning, S. R. A one-penny imputed genome from next-generation reference panels. Am. J. Hum. Genet. 103, 338–348 (2018).
CAS PubMed PubMed Central Google Scholar
Chang, C. C. et al. Second-generation plink: Rising to the challenge of larger and richer datasets. Gigascience 4, s13742-015 (2015).
Google Scholar
Colley, E. et al. Potential genetic causes of miscarriage in euploid pregnancies: A systematic review. Hum. Reprod. Update 25, 452–472 (2019).
CAS PubMed Google Scholar
Rull, K., Nagirnaja, L. & Laan, M. Genetics of recurrent miscarriage: Challenges, current knowledge, future directions. Front. Genet. 3, 34 (2012).
CAS PubMed PubMed Central Google Scholar
Peter N Robinson et al. The Human Phenotype Ontology: a tool for anno-tating and analyzing human hereditary disease. In: Am J Hum Genet. 83(5), 610–615 (2008).
Article CAS PubMed PubMed Central Google Scholar
Janet Piñero et al. DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes. In: Database 2015, bav028 (2015).
Article CAS PubMed PubMed Central Google Scholar
Kinsella, R. J. et al. Ensembl biomarts: A hub for data retrieval across taxonomic space. Database 2011 (2011).
Yu, G. & He, Q.-Y. Reactomepa: An r/bioconductor package for reactome pathway analysis and visualization. Mol. BioSyst. 12, 477–479 (2016).
CAS PubMed Google Scholar

Download references

Acknowledgements

We are thankful to all volunteers that were enrolled in the study, as well as all the medical personnel that contributed to the collection of samples. We are also thankful to Prof. Nicole Soranzo and Prof. Erik Garrison for help in the early stage of the project.

Funding

P.O.R. Campania FSE 2014-2020 and EMBO STF 7919 to V.C. The computational work has been executed on the IT resources of the ReCaS-Bari data center, which have been made available by two projects financed by the MIUR (Italian Ministry for Education, University and Re-search) in the PON Ricerca e Competitività 2007-2013” Program: ReCaS (Azione I - Interventi di rafforzamento strutturale, PONa3_00052, Avviso 254/Ric) and PRISMA (Asse II - Sostegno all’innovazione, PON04a2_A).

Author information

These authors contributed equally: Silvia Buonaiuto and Immacolata Di Biase.

Authors and Affiliations

University of Campania Luigi Vanvitelli, Naples, 81100, Italy
Silvia Buonaiuto
MeriGen Research, Naples, 80131, Italy
Immacolata Di Biase, Palmira D’Ambrosio, Gabriella Esposito & Sebastiano Di Biase
Department of Neurosciences and Rehabilitation, University of Ferrara, Ferrara, 44121, Italy
Valentina Aleotti, Amin Ravaei & Michele Rubini
Igenomix Italy, Marostica, 36063, Italy
Adriano De Marino & Antonio Capalbo
University of Naples Federico II, Naples, Italy
Gianluca Damaggio
Fondazione Bruno Kessler, DSH Lab, Trento, 38123, Italy
Marco Chierici
Monash University Malaysia Genomics Facility, Tropical Medicine and Biology Multidisciplinary Platform, 47500, Bandar Sunway, Malaysia
Madhuri Pulijala & Qasim Ayub
School of Science, Monash University Malaysia, 47500, Bandar Sunway, Malaysia
Madhuri Pulijala & Qasim Ayub
HK3 Lab, Rovereto, 38068, Italy
Cesare Furlanello
Department of Medical Sciences, University of Ferrara, Ferrara, 44121, Italy
Pantaleo Greco
Institute of Genetics and Biophysics, National Research Council, Naples, 80111, Italy
Vincenza Colonna

Authors

Silvia Buonaiuto
View author publications
You can also search for this author in PubMed Google Scholar
Immacolata Di Biase
View author publications
You can also search for this author in PubMed Google Scholar
Valentina Aleotti
View author publications
You can also search for this author in PubMed Google Scholar
Amin Ravaei
View author publications
You can also search for this author in PubMed Google Scholar
Adriano De Marino
View author publications
You can also search for this author in PubMed Google Scholar
Gianluca Damaggio
View author publications
You can also search for this author in PubMed Google Scholar
Marco Chierici
View author publications
You can also search for this author in PubMed Google Scholar
Madhuri Pulijala
View author publications
You can also search for this author in PubMed Google Scholar
Palmira D’Ambrosio
View author publications
You can also search for this author in PubMed Google Scholar
Gabriella Esposito
View author publications
You can also search for this author in PubMed Google Scholar
Qasim Ayub
View author publications
You can also search for this author in PubMed Google Scholar
Cesare Furlanello
View author publications
You can also search for this author in PubMed Google Scholar
Pantaleo Greco
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Capalbo
View author publications
You can also search for this author in PubMed Google Scholar
Michele Rubini
View author publications
You can also search for this author in PubMed Google Scholar
Sebastiano Di Biase
View author publications
You can also search for this author in PubMed Google Scholar
Vincenza Colonna
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Q.A., A.C., S.D.B., and V.C. conceived and designed the study. S.B., A.D.M., G.D.M., M.P., and V.C. wrote the code and performed the bioinformatics analyses. M.C. and C.F. provided support to bioinformatics analyses. V.A., A.R., I.D.B., P.D.A., and G.E. performed the experiments. P.G., and M.R. contributed to the clinical samples and collected clinical data. S.B. and V.C. wrote the manuscript. All authors critically reviewed the manuscript and approved the final version.

Corresponding author

Correspondence to Vincenza Colonna.

Ethics declarations

Competing interests

A.C. is a full time employee of Igenomix. A.D.M. was employee of Igenomix while working on this project. I.D.B., P.D.A., G.E., S.D.B. are full time employees of the MeriGen Research. All other authors declare that they have no conflicts of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figures.

Supplementary Tables.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Buonaiuto, S., Biase, I.D., Aleotti, V. et al. Prioritization of putatively detrimental variants in euploid miscarriages. Sci Rep 12, 1997 (2022). https://doi.org/10.1038/s41598-022-05737-3

Download citation

Received: 22 April 2021
Accepted: 11 January 2022
Published: 07 February 2022
DOI: https://doi.org/10.1038/s41598-022-05737-3

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Prioritization of putatively detrimental variants in euploid miscarriages

Subjects

Abstract

Similar content being viewed by others

The genetic architecture of sporadic and multiple consecutive miscarriage

Expanded clinical validation of Haploseek for comprehensive preimplantation genetic testing

Whole-genome risk prediction of common diseases in human preimplantation embryos

Introduction

Results

Prioritization of genetic variants in coding genomic regions

Properties and biological significance of the prioritized variants and genes

Mutations in STAG2, FLAD1, TLE4, FRMPD3, and FMNL2 in the embryos

Discussion

Methods

Embryo data and samples collection

DNA preparation and sequencing

Detection of chromosomal aneuploidies in embryos

Statistical and sequence analyses

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Figures.

Supplementary Tables.

Rights and permissions

About this article

Cite this article

Comments

Search

Quick links

Subjects

Abstract

Similar content being viewed by others

The genetic architecture of sporadic and multiple consecutive miscarriage

Expanded clinical validation of Haploseek for comprehensive preimplantation genetic testing

Whole-genome risk prediction of common diseases in human preimplantation embryos

Introduction

Results

Prioritization of genetic variants in coding genomic regions

Properties and biological significance of the prioritized variants and genes

Mutations in STAG2, FLAD1, TLE4, FRMPD3, and FMNL2 in the embryos

Discussion

Methods

Embryo data and samples collection

DNA preparation and sequencing

Detection of chromosomal aneuploidies in embryos

Statistical and sequence analyses

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Figures.

Supplementary Tables.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links