Centre National de Recherche en Génomique Humaine

facilityÉvry-Courcouronnes, Île-de-France, France

Research output, citation impact, and the most-cited recent papers from Centre National de Recherche en Génomique Humaine (France). Aggregated across the NobleBlocks index of 300M+ scholarly works.

Total works

1.5K

Citations

410.2K

h-index

250

i10-index

2.1K

Also known as

Centre National de Recherche en Génomique HumaineNational Center of Human Genomics Research

Top-cited papers from Centre National de Recherche en Génomique Humaine

A map of human genome variation from population-scale sequencing

Richard M. Durbin, John Burton, David M. Carter, Carol Churcher +4 more

2010· Nature8.1Kdoi:10.1038/nature09534

The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype. Here we present results of the pilot phase of the project, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms. We undertook three projects: low-coverage whole-genome sequencing of 179 individuals from four populations; high-coverage sequencing of two mother–father–child trios; and exon-targeted sequencing of 697 individuals from seven populations. We describe the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants, most of which were previously undescribed. We show that, because we have catalogued the vast majority of common variation, over 95% of the currently accessible variants found in any individual are present in this data set. On average, each person is found to carry approximately 250 to 300 loss-of-function variants in annotated genes and 50 to 100 variants previously implicated in inherited disorders. We demonstrate how these results can be used to inform association and functional studies. From the two trios, we directly estimate the rate of de novo germline base substitution mutations to be approximately 10−8 per base pair per generation. We explore the data with regard to signatures of natural selection, and identify a marked reduction of genetic variation in the neighbourhood of genes, due to selection at linked sites. These methods and public data will support the next phase of human genetic research. This issue of Nature contains the first publication from The 1000 Genomes Project, an international collaboration that will produce an extensive public catalogue of human genetic variation. The plan, in fact, is to sequence about 2,000 unidentified individuals from 20 populations around the world. This first paper presents the results from the project's pilot phase, testing three different strategies for genome-wide sequencing with high-throughput platforms: low-coverage whole-genome sequencing of 179 individuals in three population groups, high-coverage sequencing of two mother–father–child trios, and exon-targeted sequencing of 697 individuals from seven populations. The goal of the 1000 Genomes Project is to provide in-depth information on variation in human genome sequences. In the pilot phase reported here, different strategies for genome-wide sequencing, using high-throughput sequencing platforms, were developed and compared. The resulting data set includes more than 95% of the currently accessible variants found in any individual, and can be used to inform association and functional studies.

Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome

Boulos Chalhoub, France Denœud, Shengyi Liu, Isobel A. P. Parkin +4 more

2014· Science2.6Kdoi:10.1126/science.1253435

Oilseed rape (Brassica napus L.) was formed ~7500 years ago by hybridization between B. rapa and B. oleracea, followed by chromosome doubling, a process known as allopolyploidy. Together with more ancient polyploidizations, this conferred an aggregate 72× genome multiplication since the origin of angiosperms and high gene content. We examined the B. napus genome and the consequences of its recent duplication. The constituent An and Cn subgenomes are engaged in subtle structural, functional, and epigenetic cross-talk, with abundant homeologous exchanges. Incipient gene loss and expression divergence have begun. Selection in B. napus oilseed types has accelerated the loss of glucosinolate genes, while preserving expansion of oil biosynthesis genes. These processes provide insights into allopolyploid evolution and its relationship with crop domestication and improvement.

New insights into the genetic etiology of Alzheimer’s disease and related dementias

Céline Bellenguez, Fahri Küçükali, Iris E. Jansen, Luca Kleineidam +4 more

2022· Nature Genetics2.5Kdoi:10.1038/s41588-022-01024-z

Characterization of the genetic landscape of Alzheimer's disease (AD) and related dementias (ADD) provides a unique opportunity for a better understanding of the associated pathophysiological processes. We performed a two-stage genome-wide association study totaling 111,326 clinically diagnosed/'proxy' AD cases and 677,663 controls. We found 75 risk loci, of which 42 were new at the time of analysis. Pathway enrichment analyses confirmed the involvement of amyloid/tau pathways and highlighted microglia implication. Gene prioritization in the new loci identified 31 genes that were suggestive of new genetically associated processes, including the tumor necrosis factor alpha pathway through the linear ubiquitin chain assembly complex. We also built a new genetic risk score associated with the risk of future AD/dementia or progression from mild cognitive impairment to AD/dementia. The improvement in prediction led to a 1.6- to 1.9-fold increase in AD risk from the lowest to the highest decile, in addition to effects of age and the APOE ε4 allele.

International network of cancer genome projects

Jennifer L. Jennings, Thomas J. Hudson, Arek Kasprzyk, John D. McPherson +4 more

2010· Nature2.4Kdoi:10.1038/nature08987

Hundreds of individual human cancer genome sequences are expected to be published in 2010, and thousands per year after that. The International Cancer Genome Consortium (ICGC) was launched with the aim of keeping track of the data relating to large-scale cancer genome studies of all major cancers in adults and children — a total of 50 different cancer types and/or subtypes. In this issue the ICGC team ( http://www.icgc.org ) spells out the policies and planning for the project. The International Cancer Genome Consortium (ICGC) was launched to coordinate large-scale cancer genome studies in tumours from 50 different cancer types and/or subtypes that are of clinical and societal importance across the globe. Systematic studies of more than 25,000 cancer genomes at the genomic, epigenomic and transcriptomic levels will reveal the repertoire of oncogenic mutations, uncover traces of the mutagenic influences, define clinically relevant subtypes for prognosis and therapeutic management, and enable the development of new cancer therapies.

Analysis of shared heritability in common disorders of the brain

Verneri Anttila, Brendan Bulik‐Sullivan, Hilary K. Finucane, Raymond K. Walters +4 more

2018· Science2.0Kdoi:10.1126/science.aap8757

Disorders of the brain can exhibit considerable epidemiological comorbidity and often share symptoms, provoking debate about their etiologic overlap. We quantified the genetic sharing of 25 brain disorders from genome-wide association studies of 265,218 patients and 784,643 control participants and assessed their relationship to 17 phenotypes from 1,191,588 individuals. Psychiatric disorders share common variant risk, whereas neurological disorders appear more distinct from one another and from the psychiatric disorders. We also identified significant sharing between disorders and a number of brain phenotypes, including cognitive measures. Further, we conducted simulations to explore how statistical power, diagnostic misclassification, and phenotypic heterogeneity affect genetic correlations. These results highlight the importance of common genetic variation as a risk factor for brain disorders and the value of heritability-based methods in understanding their etiology.

The banana (Musa acuminata) genome and the evolution of monocotyledonous plants

Angélique D’Hont, France Denœud, Jean‐Marc Aury, Franc‐Christophe Baurens +4 more

2012· Nature1.2Kdoi:10.1038/nature11241

The sequencing and analysis of the banana genome is reported; these results inform plant phylogenetic relationships and genome evolution, and provide a resource for future genetic improvement of this important crop species. Bananas (Musa spp.) are a staple food and a major source of income in many tropical and subtropical countries. This paper reports the sequencing and analysis of the banana genome. This is the first non-grass monocotyledon to have its genome sequenced, providing an important bridge for comparative genome analysis in plants. Global banana production is under threat from increasingly well-adapted pests and diseases, so the availability of the genome sequence is an important resource for future crop development and improvement. Bananas (Musa spp.), including dessert and cooking types, are giant perennial monocotyledonous herbs of the order Zingiberales, a sister group to the well-studied Poales, which include cereals. Bananas are vital for food security in many tropical and subtropical countries and the most popular fruit in industrialized countries1. The Musa domestication process started some 7,000 years ago in Southeast Asia. It involved hybridizations between diverse species and subspecies, fostered by human migrations2, and selection of diploid and triploid seedless, parthenocarpic hybrids thereafter widely dispersed by vegetative propagation. Half of the current production relies on somaclones derived from a single triploid genotype (Cavendish)1. Pests and diseases have gradually become adapted, representing an imminent danger for global banana production3,4. Here we describe the draft sequence of the 523-megabase genome of a Musa acuminata doubled-haploid genotype, providing a crucial stepping-stone for genetic improvement of banana. We detected three rounds of whole-genome duplications in the Musa lineage, independently of those previously described in the Poales lineage and the one we detected in the Arecales lineage. This first monocotyledon high-continuity whole-genome sequence reported outside Poales represents an essential bridge for comparative genome analysis in plants. As such, it clarifies commelinid-monocotyledon phylogenetic relationships, reveals Poaceae-specific features and has led to the discovery of conserved non-coding sequences predating monocotyledon–eudicotyledon divergence.

A Nitrospira metagenome illuminates the physiology and evolution of globally important nitrite-oxidizing bacteria

Sebastian Lücker, Michael Wagner, Frank Maixner, Éric Pelletier +4 more

2010· Proceedings of the National Academy of Sciences811doi:10.1073/pnas.1003860107

Nitrospira are barely studied and mostly uncultured nitrite-oxidizing bacteria, which are, according to molecular data, among the most diverse and widespread nitrifiers in natural ecosystems and biological wastewater treatment. Here, environmental genomics was used to reconstruct the complete genome of "Candidatus Nitrospira defluvii" from an activated sludge enrichment culture. On the basis of this first-deciphered Nitrospira genome and of experimental data, we show that Ca. N. defluvii differs dramatically from other known nitrite oxidizers in the key enzyme nitrite oxidoreductase (NXR), in the composition of the respiratory chain, and in the pathway used for autotrophic carbon fixation, suggesting multiple independent evolution of chemolithoautotrophic nitrite oxidation. Adaptations of Ca. N. defluvii to substrate-limited conditions include an unusual periplasmic NXR, which is constitutively expressed, and pathways for the transport, oxidation, and assimilation of simple organic compounds that allow a mixotrophic lifestyle. The reverse tricarboxylic acid cycle as the pathway for CO2 fixation and the lack of most classical defense mechanisms against oxidative stress suggest that Nitrospira evolved from microaerophilic or even anaerobic ancestors. Unexpectedly, comparative genomic analyses indicate functionally significant lateral gene-transfer events between the genus Nitrospira and anaerobic ammonium-oxidizing planctomycetes, which share highly similar forms of NXR and other proteins reflecting that two key processes of the nitrogen cycle are evolutionarily connected.

The genome of Theobroma cacao

Xavier Argout, Jérôme Salse, Jean‐Marc Aury, Mark J. Guiltinan +4 more

2010· Nature Genetics779doi:10.1038/ng.736

Xavier Argout and colleagues report the draft genome of Theobroma cacao, the tropical crop that is the source of chocolate. The sequence assembly covers approximately 80% of the genome. We sequenced and assembled the draft genome of Theobroma cacao, an economically important tropical-fruit tree crop that is the source of chocolate. This assembly corresponds to 76% of the estimated genome size and contains almost all previously described genes, with 82% of these genes anchored on the 10 T. cacao chromosomes. Analysis of this sequence information highlighted specific expansion of some gene families during evolution, for example, flavonoid-related genes. It also provides a major source of candidate genes for T. cacao improvement. Based on the inferred paleohistory of the T. cacao genome, we propose an evolutionary scenario whereby the ten T. cacao chromosomes were shaped from an ancestor through eleven chromosome fusions.

Meta-analysis of SHANK Mutations in Autism Spectrum Disorders: A Gradient of Severity in Cognitive Impairments

Claire S. Leblond, Caroline Nava, Anne Polge, Julie Gauthier +4 more

2014· PLoS Genetics667doi:10.1371/journal.pgen.1004580

SHANK genes code for scaffold proteins located at the post-synaptic density of glutamatergic synapses. In neurons, SHANK2 and SHANK3 have a positive effect on the induction and maturation of dendritic spines, whereas SHANK1 induces the enlargement of spine heads. Mutations in SHANK genes have been associated with autism spectrum disorders (ASD), but their prevalence and clinical relevance remain to be determined. Here, we performed a new screen and a meta-analysis of SHANK copy-number and coding-sequence variants in ASD. Copy-number variants were analyzed in 5,657 patients and 19,163 controls, coding-sequence variants were ascertained in 760 to 2,147 patients and 492 to 1,090 controls (depending on the gene), and, individuals carrying de novo or truncating SHANK mutations underwent an extensive clinical investigation. Copy-number variants and truncating mutations in SHANK genes were present in ∼1% of patients with ASD: mutations in SHANK1 were rare (0.04%) and present in males with normal IQ and autism; mutations in SHANK2 were present in 0.17% of patients with ASD and mild intellectual disability; mutations in SHANK3 were present in 0.69% of patients with ASD and up to 2.12% of the cases with moderate to profound intellectual disability. In summary, mutations of the SHANK genes were detected in the whole spectrum of autism with a gradient of severity in cognitive impairment. Given the rare frequency of SHANK1 and SHANK2 deleterious mutations, the clinical relevance of these genes remains to be ascertained. In contrast, the frequency and the penetrance of SHANK3 mutations in individuals with ASD and intellectual disability-more than 1 in 50-warrant its consideration for mutation screening in clinical practice.

Stroke genetics informs drug discovery and risk prediction across ancestries

Aniket Mishra, Rainer Malik, Tsuyoshi Hachiya, Tuuli Jürgenson +4 more

2022· Nature611doi:10.1038/s41586-022-05165-3

Abstract Previous genome-wide association studies (GWASs) of stroke — the second leading cause of death worldwide — were conducted predominantly in populations of European ancestry 1,2 . Here, in cross-ancestry GWAS meta-analyses of 110,182 patients who have had a stroke (five ancestries, 33% non-European) and 1,503,898 control individuals, we identify association signals for stroke and its subtypes at 89 (61 new) independent loci: 60 in primary inverse-variance-weighted analyses and 29 in secondary meta-regression and multitrait analyses. On the basis of internal cross-ancestry validation and an independent follow-up in 89,084 additional cases of stroke (30% non-European) and 1,013,843 control individuals, 87% of the primary stroke risk loci and 60% of the secondary stroke risk loci were replicated ( P < 0.05). Effect sizes were highly correlated across ancestries. Cross-ancestry fine-mapping, in silico mutagenesis analysis 3 , and transcriptome-wide and proteome-wide association analyses revealed putative causal genes (such as SH3PXD2A and FURIN ) and variants (such as at GRK5 and NOS3 ). Using a three-pronged approach 4 , we provide genetic evidence for putative drug effects, highlighting F11, KLKB1, PROC, GP1BA, LAMC2 and VCAM1 as possible targets, with drugs already under investigation for stroke for F11 and PROC. A polygenic score integrating cross-ancestry and ancestry-specific stroke GWASs with vascular-risk factor GWASs (integrative polygenic scores) strongly predicted ischaemic stroke in populations of European, East Asian and African ancestry 5 . Stroke genetic risk scores were predictive of ischaemic stroke independent of clinical risk factors in 52,600 clinical-trial participants with cardiometabolic disease. Our results provide insights to inform biology, reveal potential drug targets and derive genetic risk prediction tools across ancestries.

Novel Crohn Disease Locus Identified by Genome-Wide Association Maps to a Gene Desert on 5p13.1 and Modulates Expression of PTGER4

Cécile Libioulle, Édouard Louis, Sarah Hansoul, Cynthia Sandor +4 more

2007· PLoS Genetics563doi:10.1371/journal.pgen.0030058

To identify novel susceptibility loci for Crohn disease (CD), we undertook a genome-wide association study with more than 300,000 SNPs characterized in 547 patients and 928 controls. We found three chromosome regions that provided evidence of disease association with p-values between 10(-6) and 10(-9). Two of these (IL23R on Chromosome 1 and CARD15 on Chromosome 16) correspond to genes previously reported to be associated with CD. In addition, a 250-kb region of Chromosome 5p13.1 was found to contain multiple markers with strongly suggestive evidence of disease association (including four markers with p < 10(-7)). We replicated the results for 5p13.1 by studying 1,266 additional CD patients, 559 additional controls, and 428 trios. Significant evidence of association (p < 4 x 10(-4)) was found in case/control comparisons with the replication data, while associated alleles were over-transmitted to affected offspring (p < 0.05), thus confirming that the 5p13.1 locus contributes to CD susceptibility. The CD-associated 250-kb region was saturated with 111 SNP markers. Haplotype analysis supports a complex locus architecture with multiple variants contributing to disease susceptibility. The novel 5p13.1 CD locus is contained within a 1.25-Mb gene desert. We present evidence that disease-associated alleles correlate with quantitative expression levels of the prostaglandin receptor EP4, PTGER4, the gene that resides closest to the associated region. Our results identify a major new susceptibility locus for CD, and suggest that genetic variants associated with disease risk at this locus could modulate cis-acting regulatory elements of PTGER4.

Transcriptome and methylome profiling reveals relics of genome dominance in the mesopolyploid Brassica oleracea

Isobel A. P. Parkin, ChuShin Koh, Haibao Tang, Stephen J. Robinson +4 more

2014· Genome biology523doi:10.1186/gb-2014-15-6-r77

BACKGROUND: Brassica oleracea is a valuable vegetable species that has contributed to human health and nutrition for hundreds of years and comprises multiple distinct cultivar groups with diverse morphological and phytochemical attributes. In addition to this phenotypic wealth, B. oleracea offers unique insights into polyploid evolution, as it results from multiple ancestral polyploidy events and a final Brassiceae-specific triplication event. Further, B. oleracea represents one of the diploid genomes that formed the economically important allopolyploid oilseed, Brassica napus. A deeper understanding of B. oleracea genome architecture provides a foundation for crop improvement strategies throughout the Brassica genus. RESULTS: We generate an assembly representing 75% of the predicted B. oleracea genome using a hybrid Illumina/Roche 454 approach. Two dense genetic maps are generated to anchor almost 92% of the assembled scaffolds to nine pseudo-chromosomes. Over 50,000 genes are annotated and 40% of the genome predicted to be repetitive, thus contributing to the increased genome size of B. oleracea compared to its close relative B. rapa. A snapshot of both the leaf transcriptome and methylome allows comparisons to be made across the triplicated sub-genomes, which resulted from the most recent Brassiceae-specific polyploidy event. CONCLUSIONS: Differential expression of the triplicated syntelogs and cytosine methylation levels across the sub-genomes suggest residual marks of the genome dominance that led to the current genome architecture. Although cytosine methylation does not correlate with individual gene dominance, the independent methylation patterns of triplicated copies suggest epigenetic mechanisms play a role in the functional diversification of duplicate genes.

Genome sequence of the cyanobacterium Prochlorococcus marinus SS120, a nearly minimal oxyphototrophic genome

Alexis Dufresne, Marcel Salanoubat, Frédéric Partensky, François Artiguenave +4 more

2003· Proceedings of the National Academy of Sciences471doi:10.1073/pnas.1733211100

Prochlorococcus marinus, the dominant photosynthetic organism in the ocean, is found in two main ecological forms: high-light-adapted genotypes in the upper part of the water column and low-light-adapted genotypes at the bottom of the illuminated layer. P. marinus SS120, the complete genome sequence reported here, is an extremely low-light-adapted form. The genome of P. marinus SS120 is composed of a single circular chromosome of 1,751,080 bp with an average G+C content of 36.4%. It contains 1,884 predicted protein-coding genes with an average size of 825 bp, a single rRNA operon, and 40 tRNA genes. Together with the 1.66-Mbp genome of P. marinus MED4, the genome of P. marinus SS120 is one of the two smallest genomes of a photosynthetic organism known to date. It lacks many genes that are involved in photosynthesis, DNA repair, solute uptake, intermediary metabolism, motility, phototaxis, and other functions that are conserved among other cyanobacteria. Systems of signal transduction and environmental stress response show a particularly drastic reduction in the number of components, even taking into account the small size of the SS120 genome. In contrast, housekeeping genes, which encode enzymes of amino acid, nucleotide, cofactor, and cell wall biosynthesis, are all present. Because of its remarkable compactness, the genome of P. marinus SS120 might approximate the minimal gene complement of a photosynthetic organism.

Single nucleotide polymorphisms in TNFSF15 confer susceptibility to Crohn's disease

Keiko Yamazaki, Dermot McGovern, Jiannis Ragoussis, Marta Paolucci +4 more

2005· Human Molecular Genetics467doi:10.1093/hmg/ddi379

The inflammatory bowel diseases (IBDs), Crohn's disease (CD) and ulcerative colitis, are chronic inflammatory disorders of the digestive tract. The pathogenesis of IBD is complicated, and it is widely accepted that immunologic, environmental and genetic components contribute to its etiology. To identify genetic susceptibility factors in CD, we performed a genome-wide association study in Japanese patients and controls using nearly 80,000 gene-based single nucleotide polymorphism (SNP) markers and investigated the haplotype structure of the candidate locus in Japanese and European patients. We identified highly significant associations (P = 1.71 x 10(-14) with odds ratio of 2.17) of SNPs and haplotypes within the TNFSF15 (the gene encoding tumor necrosis factor superfamily, member 15) genes in Japanese CD patients. The association was confirmed in the study of two European IBD cohorts. Interestingly, a core TNFSF15 haplotype showing association with increased risk to the disease was common in the two ethnic groups. Our results suggest that the genetic variations in the TNFSF15 gene contribute to the susceptibility to IBD in the Japanese and European populations.

Genetic and Functional Analyses of SHANK2 Mutations Suggest a Multiple Hit Model of Autism Spectrum Disorders

Claire S. Leblond, Jutta Heinrich, Richard Delorme, Christian Proepper +4 more

2012· PLoS Genetics444doi:10.1371/journal.pgen.1002521

Autism spectrum disorders (ASD) are a heterogeneous group of neurodevelopmental disorders with a complex inheritance pattern. While many rare variants in synaptic proteins have been identified in patients with ASD, little is known about their effects at the synapse and their interactions with other genetic variations. Here, following the discovery of two de novo SHANK2 deletions by the Autism Genome Project, we identified a novel 421 kb de novo SHANK2 deletion in a patient with autism. We then sequenced SHANK2 in 455 patients with ASD and 431 controls and integrated these results with those reported by Berkel et al. 2010 (n = 396 patients and n = 659 controls). We observed a significant enrichment of variants affecting conserved amino acids in 29 of 851 (3.4%) patients and in 16 of 1,090 (1.5%) controls (P = 0.004, OR = 2.37, 95% CI = 1.23-4.70). In neuronal cell cultures, the variants identified in patients were associated with a reduced synaptic density at dendrites compared to the variants only detected in controls (P = 0.0013). Interestingly, the three patients with de novo SHANK2 deletions also carried inherited CNVs at 15q11-q13 previously associated with neuropsychiatric disorders. In two cases, the nicotinic receptor CHRNA7 was duplicated and in one case the synaptic translation repressor CYFIP1 was deleted. These results strengthen the role of synaptic gene dysfunction in ASD but also highlight the presence of putative modifier genes, which is in keeping with the "multiple hit model" for ASD. A better knowledge of these genetic interactions will be necessary to understand the complex inheritance pattern of ASD.

The complete genome sequence of Lactobacillus bulgaricus reveals extensive and ongoing reductive evolution

Maarten van de Guchte, Stéphanie Penaud, Christine Grimaldi, Valérie Barbe +4 more

2006· Proceedings of the National Academy of Sciences416doi:10.1073/pnas.0603024103

Lactobacillus delbrueckii ssp. bulgaricus (L. bulgaricus) is a representative of the group of lactic acid-producing bacteria, mainly known for its worldwide application in yogurt production. The genome sequence of this bacterium has been determined and shows the signs of ongoing specialization, with a substantial number of pseudogenes and incomplete metabolic pathways and relatively few regulatory functions. Several unique features of the L. bulgaricus genome support the hypothesis that the genome is in a phase of rapid evolution. (i) Exceptionally high numbers of rRNA and tRNA genes with regard to genome size may indicate that the L. bulgaricus genome has known a recent phase of important size reduction, in agreement with the observed high frequency of gene inactivation and elimination; (ii) a much higher GC content at codon position 3 than expected on the basis of the overall GC content suggests that the composition of the genome is evolving toward a higher GC content; and (iii) the presence of a 47.5-kbp inverted repeat in the replication termination region, an extremely rare feature in bacterial genomes, may be interpreted as a transient stage in genome evolution. The results indicate the adaptation of L. bulgaricus from a plant-associated habitat to the stable protein and lactose-rich milk environment through the loss of superfluous functions and protocooperation with Streptococcus thermophilus.

Grinding up Wheat: A Massive Loss of Nucleotide Diversity Since Domestication

Annabelle Haudry, Alberto Cenci, Catherine Ravel, Thomas Bataillon +4 more

2007· Molecular Biology and Evolution412doi:10.1093/molbev/msm077

Several demographic and selective events occurred during the domestication of wheat from the allotetraploid wild emmer (Triticum turgidum ssp. dicoccoides). Cultivated wheat has since been affected by other historical events. We analyzed nucleotide diversity at 21 loci in a sample of 101 individuals representing 4 taxa corresponding to representative steps in the recent evolution of wheat (wild, domesticated, cultivated durum, and bread wheats) to unravel the evolutionary history of cultivated wheats and to quantify its impact on genetic diversity. Sequence relationships are consistent with a single domestication event and identify 2 genetically different groups of bread wheat. The wild group is not highly polymorphic, with only 212 polymorphic sites among the 21,720 bp sequenced, and, during domestication, diversity was further reduced in cultivated forms--by 69% in bread wheat and 84% in durum wheat--with considerable differences between loci, some retaining no polymorphism at all. Coalescent simulations were performed and compared with our data to estimate the intensity of the bottlenecks associated with domestication and subsequent selection. Based on our 21-locus analysis, the average intensity of domestication bottleneck was estimated at about 3--giving a population size for the domesticated form about one third that of wild dicoccoides. The most severe bottleneck, with an intensity of about 6, occurred in the evolution of durum wheat. We investigated whether some of the genes departed from the empirical distribution of most loci, suggesting that they might have been selected during domestication or breeding. We detected a departure from the null model of demographic bottleneck for the hypothetical gene HgA. However, the atypical pattern of polymorphism at this locus might reveal selection on the linked locus Gsp1A, which may affect grain softness--an important trait for end-use quality in wheat.

Genomic evidence for ameiotic evolution in the bdelloid rotifer Adineta vaga

Jean‐François Flot, Boris Hespeels, Xiang Li, Benjamin Noël +4 more

2013· Nature411doi:10.1038/nature12326

The genome of the asexual rotifer Adineta vaga lacks homologous chromosomes; instead, its allelic regions are rearranged and sometimes found on the same chromosome in a palindromic fashion, a structure reminiscent of the primate Y chromosome and of other mitotic lineages such as cancer cells. Bdelloid rotifers are thought to have persisted and diversified asexually for millions of years, which is odd because loss of sexual reproduction is widely considered to be an evolutionary dead end for metazoans. The suspicion remained that they might engage in sex on rare occasions. But here Olivier Jaillon and colleagues sequence the genome of a bdelloid rotifer, Adineta vaga, and show that its structure is incompatible with conventional meiosis, the type of cell division associated with sexual reproduction. The genome has undergone abundant gene conversion, which may limit the accumulation of deleterious mutations in the absence of meiosis. Up to 8% of the genes are of probable non-metazoan origin, probably acquired through horizontal gene transfer. These findings demonstrate positive evidence for asexual evolution, supporting the hypothesis of ancient asexuality among bdelloid rotifers. Loss of sexual reproduction is considered an evolutionary dead end for metazoans, but bdelloid rotifers challenge this view as they appear to have persisted asexually for millions of years1. Neither male sex organs nor meiosis have ever been observed in these microscopic animals: oocytes are formed through mitotic divisions, with no reduction of chromosome number and no indication of chromosome pairing2. However, current evidence does not exclude that they may engage in sex on rare, cryptic occasions. Here we report the genome of a bdelloid rotifer, Adineta vaga (Davis, 1873)3, and show that its structure is incompatible with conventional meiosis. At gene scale, the genome of A. vaga is tetraploid and comprises both anciently duplicated segments and less divergent allelic regions. However, in contrast to sexual species, the allelic regions are rearranged and sometimes even found on the same chromosome. Such structure does not allow meiotic pairing; instead, we find abundant evidence of gene conversion, which may limit the accumulation of deleterious mutations in the absence of meiosis. Gene families involved in resistance to oxidation, carbohydrate metabolism and defence against transposons are significantly expanded, which may explain why transposable elements cover only 3% of the assembled sequence. Furthermore, 8% of the genes are likely to be of non-metazoan origin and were probably acquired horizontally. This apparent convergence between bdelloids and prokaryotes sheds new light on the evolutionary significance of sex.

A meta-analysis of genome-wide association studies identifies multiple longevity genes

Joris Deelen, Daniel S. Evans, Dan E. Arking, Niccoló Tesi +4 more

2019· Nature Communications404doi:10.1038/s41467-019-11558-2

Human longevity is heritable, but genome-wide association (GWA) studies have had limited success. Here, we perform two meta-analyses of GWA studies of a rigorous longevity phenotype definition including 11,262/3484 cases surviving at or beyond the age corresponding to the 90th/99th survival percentile, respectively, and 25,483 controls whose age at death or at last contact was at or below the age corresponding to the 60th survival percentile. Consistent with previous reports, rs429358 (apolipoprotein E (ApoE) ε4) is associated with lower odds of surviving to the 90th and 99th percentile age, while rs7412 (ApoE ε2) shows the opposite. Moreover, rs7676745, located near GPR78, associates with lower odds of surviving to the 90th percentile age. Gene-level association analysis reveals a role for tissue-specific expression of multiple genes in longevity. Finally, genetic correlation of the longevity GWA results with that of several disease-related phenotypes points to a shared genetic architecture between health and longevity.

MaGe: a microbial genome annotation system supported by synteny results

David Vallenet

2006· Nucleic Acids Research391doi:10.1093/nar/gkj406

Magnifying Genomes (MaGe) is a microbial genome annotation system based on a relational database containing information on bacterial genomes, as well as a web interface to achieve genome annotation projects. Our system allows one to initiate the annotation of a genome at the early stage of the finishing phase. MaGe's main features are (i) integration of annotation data from bacterial genomes enhanced by a gene coding re-annotation process using accurate gene models, (ii) integration of results obtained with a wide range of bioinformatics methods, among which exploration of gene context by searching for conserved synteny and reconstruction of metabolic pathways, (iii) an advanced web interface allowing multiple users to refine the automatic assignment of gene product functions. MaGe is also linked to numerous well-known biological databases and systems. Our system has been thoroughly tested during the annotation of complete bacterial genomes (Acinetobacter baylyi ADP1, Pseudoalteromonas haloplanktis, Frankia alni) and is currently used in the context of several new microbial genome annotation projects. In addition, MaGe allows for annotation curation and exploration of already published genomes from various genera (e.g. Yersinia, Bacillus and Neisseria). MaGe can be accessed at http://www.genoscope.cns.fr/agc/mage.

Search all NobleBlocks papers mentioning “Centre National de Recherche en Génomique Humaine” →