Laboratoire Génome et Développement des Plantes
facilityPerpignan, Occitanie, France
Research output, citation impact, and the most-cited recent papers from Laboratoire Génome et Développement des Plantes (France). Aggregated across the NobleBlocks index of 300M+ scholarly works.
Top-cited papers from Laboratoire Génome et Développement des Plantes
MCScan is an algorithm able to scan multiple genomes or subgenomes in order to identify putative homologous chromosomal regions, and align these regions using genes as anchors. The MCScanX toolkit implements an adjusted MCScan algorithm for detection of synteny and collinearity that extends the original software by incorporating 14 utility programs for visualization of results and additional downstream analyses. Applications of MCScanX to several sequenced plant genomes and gene families are shown as examples. MCScanX can be used to effectively analyze chromosome structural changes, and reveal the history of gene family expansions that might contribute to the adaptation of lineages and taxa. An integrated view of various modes of gene duplication can supplement the traditional gene tree analysis in specific families. The source code and documentation of MCScanX are freely available at http://chibba.pgml.uga.edu/mcscan2/.
The sequencing and analysis of the banana genome is reported; these results inform plant phylogenetic relationships and genome evolution, and provide a resource for future genetic improvement of this important crop species. Bananas (Musa spp.) are a staple food and a major source of income in many tropical and subtropical countries. This paper reports the sequencing and analysis of the banana genome. This is the first non-grass monocotyledon to have its genome sequenced, providing an important bridge for comparative genome analysis in plants. Global banana production is under threat from increasingly well-adapted pests and diseases, so the availability of the genome sequence is an important resource for future crop development and improvement. Bananas (Musa spp.), including dessert and cooking types, are giant perennial monocotyledonous herbs of the order Zingiberales, a sister group to the well-studied Poales, which include cereals. Bananas are vital for food security in many tropical and subtropical countries and the most popular fruit in industrialized countries1. The Musa domestication process started some 7,000 years ago in Southeast Asia. It involved hybridizations between diverse species and subspecies, fostered by human migrations2, and selection of diploid and triploid seedless, parthenocarpic hybrids thereafter widely dispersed by vegetative propagation. Half of the current production relies on somaclones derived from a single triploid genotype (Cavendish)1. Pests and diseases have gradually become adapted, representing an imminent danger for global banana production3,4. Here we describe the draft sequence of the 523-megabase genome of a Musa acuminata doubled-haploid genotype, providing a crucial stepping-stone for genetic improvement of banana. We detected three rounds of whole-genome duplications in the Musa lineage, independently of those previously described in the Poales lineage and the one we detected in the Arecales lineage. This first monocotyledon high-continuity whole-genome sequence reported outside Poales represents an essential bridge for comparative genome analysis in plants. As such, it clarifies commelinid-monocotyledon phylogenetic relationships, reveals Poaceae-specific features and has led to the discovery of conserved non-coding sequences predating monocotyledon–eudicotyledon divergence.
Like many other crops, the cultivated peanut (Arachis hypogaea L.) is of hybrid origin and has a polyploid genome that contains essentially complete sets of chromosomes from two ancestral species. Here we report the genome sequence of peanut and show that after its polyploid origin, the genome has evolved through mobile-element activity, deletions and by the flow of genetic information between corresponding ancestral chromosomes (that is, homeologous recombination). Uniformity of patterns of homeologous recombination at the ends of chromosomes favors a single origin for cultivated peanut and its wild counterpart A. monticola. However, through much of the genome, homeologous recombination has created diversity. Using new polyploid hybrids made from the ancestral species, we show how this can generate phenotypic changes such as spontaneous changes in the color of the flowers. We suggest that diversity generated by these genetic mechanisms helped to favor the domestication of the polyploid A. hypogaea over other diploid Arachis species cultivated by humans.
Xavier Argout and colleagues report the draft genome of Theobroma cacao, the tropical crop that is the source of chocolate. The sequence assembly covers approximately 80% of the genome. We sequenced and assembled the draft genome of Theobroma cacao, an economically important tropical-fruit tree crop that is the source of chocolate. This assembly corresponds to 76% of the estimated genome size and contains almost all previously described genes, with 82% of these genes anchored on the 10 T. cacao chromosomes. Analysis of this sequence information highlighted specific expansion of some gene families during evolution, for example, flavonoid-related genes. It also provides a major source of candidate genes for T. cacao improvement. Based on the inferred paleohistory of the T. cacao genome, we propose an evolutionary scenario whereby the ten T. cacao chromosomes were shaped from an ancestor through eleven chromosome fusions.
Retrotransposons are the main components of eukaryotic genomes, representing up to 80% of some large plant genomes. These mobile elements transpose via a "copy and paste" mechanism, thus increasing their copy number while active. Their accumulation is now accepted as the main factor of genome size increase in higher eukaryotes, besides polyploidy. However, the dynamics of this process are poorly understood. In this study, we show that Oryza australiensis, a wild relative of the Asian cultivated rice O. sativa, has undergone recent bursts of three LTR-retrotransposon families. This genome has accumulated more than 90,000 retrotransposon copies during the last three million years, leading to a rapid twofold increase of its size. In addition, phenetic analyses of these retrotransposons clearly confirm that the genomic bursts occurred posterior to the radiation of the species. This provides direct evidence of retrotransposon-mediated variation of genome size within a plant genus.
Systematic analysis of the Arabidopsis genome provides a basis for detailed studies of genome structure and evolution. Members of multigene families were mapped, and random sequence alignment was used to identify regions of extended similarity in the Arabidopsis genome. Detailed analysis showed that the number, order, and orientation of genes were conserved over large regions of the genome, revealing extensive duplication covering the majority of the known genomic sequence. Fine mapping analysis showed much rearrangement, resulting in a patchwork of duplicated regions that indicated deletion, insertion, tandem duplication, inversion, and reciprocal translocation. The implications of these observations for evolution of the Arabidopsis genome as well as their usefulness for analysis and annotation of the genomic sequence and in comparative genomics are discussed.
Abstract Oaks are an important part of our natural and cultural heritage. Not only are they ubiquitous in our most common landscapes 1 but they have also supplied human societies with invaluable services, including food and shelter, since prehistoric times 2 . With 450 species spread throughout Asia, Europe and America 3 , oaks constitute a critical global renewable resource. The longevity of oaks (several hundred years) probably underlies their emblematic cultural and historical importance. Such long-lived sessile organisms must persist in the face of a wide range of abiotic and biotic threats over their lifespans. We investigated the genomic features associated with such a long lifespan by sequencing, assembling and annotating the oak genome. We then used the growing number of whole-genome sequences for plants (including tree and herbaceous species) to investigate the parallel evolution of genomic characteristics potentially underpinning tree longevity. A further consequence of the long lifespan of trees is their accumulation of somatic mutations during mitotic divisions of stem cells present in the shoot apical meristems. Empirical 4 and modelling 5 approaches have shown that intra-organismal genetic heterogeneity can be selected for 6 and provides direct fitness benefits in the arms race with short-lived pests and pathogens through a patchwork of intra-organismal phenotypes 7 . However, there is no clear proof that large-statured trees consist of a genetic mosaic of clonally distinct cell lineages within and between branches. Through this case study of oak, we demonstrate the accumulation and transmission of somatic mutations and the expansion of disease-resistance gene families in trees.
Rice was chosen as a model organism for genome sequencing because of its economic importance, small genome size, and syntenic relationship with other cereal species. We have constructed a bacterial artificial chromosome fingerprint-based physical map of the rice genome to facilitate the whole-genome sequencing of rice. Most of the rice genome ( approximately 90.6%) was anchored genetically by overgo hybridization, DNA gel blot hybridization, and in silico anchoring. Genome sequencing data also were integrated into the rice physical map. Comparison of the genetic and physical maps reveals that recombination is suppressed severely in centromeric regions as well as on the short arms of chromosomes 4 and 10. This integrated high-resolution physical map of the rice genome will greatly facilitate whole-genome sequencing by helping to identify a minimum tiling path of clones to sequence. Furthermore, the physical map will aid map-based cloning of agronomically important genes and will provide an important tool for the comparative analysis of grass genomes.
Long non-protein coding RNAs (npcRNA) represent an emerging class of riboregulators, which either act directly in this long form or are processed to shorter miRNA and siRNA. Genome-wide bioinformatic analysis of full-length cDNA databases identified 76 Arabidopsis npcRNAs. Fourteen npcRNAs were antisense to protein-coding mRNAs, suggesting cis-regulatory roles. Numerous 24-nt siRNA matched to five different npcRNAs, suggesting that these npcRNAs are precursors of this type of siRNA. Expression analyses of the 76 npcRNAs identified a novel npcRNA that accumulates in a dcl1 mutant but does not appear to produce trans-acting siRNA or miRNA. Additionally, another npcRNA was the precursor of miR869 and shown to be up-regulated in dcl4 but not in dcl1 mutants, indicative of a young miRNA gene. Abiotic stress altered the accumulation of 22 npcRNAs among the 76, a fraction significantly higher than that observed for the RNA binding protein-coding fraction of the transcriptome. Overexpression analyses in Arabidopsis identified two npcRNAs as regulators of root growth during salt stress and leaf morphology, respectively. Hence, together with small RNAs, long npcRNAs encompass a sensitive component of the transcriptome that have diverse roles during growth and differentiation.
Anthocyanidin reductase encoded by the BANYULS (BAN) gene is the core enzyme in proanthocyanidin (PA) biosynthesis. Here, we analyzed the developmental mechanisms that regulate the spatiotemporal expression of BAN in the developing Arabidopsis seed coat. PA-accumulating cells were localized histochemically in the inner integument (seed body and micropyle) and pigment strand (chalaza). BAN promoter activity was detected specifically in these cells. Gain-of-function experiments showed that an 86-bp promoter fragment functioned as an enhancer specific for PA-accumulating cells. Mutations in regulatory genes of PA biosynthesis abolished BAN promoter activity (transparent testa2 [tt2], tt8, and transparent testa glabra1 [ttg1]), modified its spatial pattern (tt1 and tt16), or had no influence (ttg2), thus revealing complex regulatory interactions at several developmental levels. Genetic ablation of PA-accumulating cells targeted by the BAN promoter fused to BARNASE led to the formation of normal plants that produced viable yellow seeds. Importantly, these seeds had no obvious defects in endosperm and embryo development.
Mingsheng Chen, Klaus Mayer, Steve Rounsley, Rod Wing and colleagues report the genome sequence of African rice (Oryza glaberrima), a different species than Asian rice. The authors resequenced 20 O. glaberrima accessions and 94 Oryza barthii accessions (the putative progenitor species of O. glaberrima), and their analyses support the hypothesis that O. glaberrima was domesticated in a single region along the upper Niger river. The cultivation of rice in Africa dates back more than 3,000 years. Interestingly, African rice is not of the same origin as Asian rice (Oryza sativa L.) but rather is an entirely different species (i.e., Oryza glaberrima Steud.). Here we present a high-quality assembly and annotation of the O. glaberrima genome and detailed analyses of its evolutionary history of domestication and selection. Population genomics analyses of 20 O. glaberrima and 94 Oryza barthii accessions support the hypothesis that O. glaberrima was domesticated in a single region along the Niger river as opposed to noncentric domestication events across Africa. We detected evidence for artificial selection at a genome-wide scale, as well as with a set of O. glaberrima genes orthologous to O. sativa genes that are known to be associated with domestication, thus indicating convergent yet independent selection of a common set of genes during two geographically and culturally distinct domestication processes.
Thioredoxins (Trx) and glutaredoxins (Grx) constitute families of thiol oxidoreductases. Our knowledge of Trx and Grx in plants has dramatically increased during the last decade. The release of the Arabidopsis genome sequence revealed an unexpectedly high number of Trx and Grx genes. The availability of several genomes of vascular and nonvascular plants allowed the establishment of a clear classification of the genes and the chronology of their appearance during plant evolution. Proteomic approaches have been developed that identified the putative Trx and Grx target proteins which are implicated in all aspects of plant growth, including basal metabolism, iron/sulfur cluster formation, development, adaptation to the environment, and stress responses. Analyses of the biochemical characteristics of specific Trx and Grx point to a strong specificity toward some target enzymes, particularly within plastidial Trx and Grx. In apparent contradiction with this specificity, genetic approaches show an absence of phenotype for most available Trx and Grx mutants, suggesting that redundancies also exist between Trx and Grx members. Despite this, the isolation of mutants inactivated in multiple genes and several genetic screens allowed the demonstration of the involvement of Trx and Grx in pathogen response, phytohormone pathways, and at several control points of plant development. Cytosolic Trxs are reduced by NADPH-thioredoxin reductase (NTR), while the reduction of Grx depends on reduced glutathione (GSH). Interestingly, recent development integrating biochemical analysis, proteomic data, and genetics have revealed an extensive crosstalk between the cytosolic NTR/Trx and GSH/Grx systems. This crosstalk, which occurs at multiple levels, reveals the high plasticity of the redox systems in plants.
In Arabidopsis thaliana, four major regulators (ABSCISIC ACID INSENSITIVE3 [ABI3], FUSCA3 [FUS3], LEAFY COTYLEDON1 [LEC1], and LEC2) control most aspects of seed maturation, such as accumulation of storage compounds, cotyledon identity, acquisition of desiccation tolerance, and dormancy. The molecular basis for complex genetic interactions among these regulators is poorly understood. By analyzing ABI3 and FUS3 expression in various single, double, and triple maturation mutants, we have identified multiple regulatory links among all four genes. We found that one of the major roles of LEC2 was to upregulate FUS3 and ABI3. The lec2 mutation is responsible for a dramatic decrease in ABI3 and FUS3 expression, and most lec2 phenotypes can be rescued by ABI3 or FUS3 constitutive expression. In addition, ABI3 and FUS3 positively regulate themselves and each other, thereby forming feedback loops essential for their sustained and uniform expression in the embryo. Finally, LEC1 also positively regulates ABI3 and FUS3 in the cotyledons. Most of the genetic controls discovered were found to be local and redundant, explaining why they had previously been overlooked. This works establishes a genetic framework for seed maturation, organizing the key regulators of this process into a hierarchical network. In addition, it offers a molecular explanation for the puzzling variable features of lec2 mutant embryos.
A review of type III effectors (T3 effectors) from strains of Xanthomonas reveals a growing list of candidate and known effectors based on functional assays and sequence and structural similarity searches of genomic data. We propose that the effectors and suspected effectors should be distributed into 39 so-called Xop groups reflecting sequence similarity. Some groups have structural motifs for putative enzymatic functions, and recent studies have provided considerable insight into the interaction with host factors in their function as mediators of virulence and elicitors of resistance for a few specific T3 effectors. Many groups are related to T3 effectors of plant and animal pathogenic bacteria, and several groups appear to have been exploited primarily by Xanthomonas species based on available data. At the same time, a relatively large number of candidate effectors remain to be examined in more detail with regard to their function within host cells.
Accumulation of deleterious mutations has important consequences for the evolution of mating systems and the persistence of small populations. It is well established that consanguineous mating can purge a part of the mutation load and that lethal mutations can also be purged in small populations. However, the efficiency of purging in natural populations, due to either consanguineous mating or to reduced population size, has been questioned. Consequences of consanguineous mating systems and small population size are often equated under "inbreeding" because both increase homozygosity, and selection is though to be more efficient against homozygous deleterious alleles. I show that two processes of purging that I call "purging by drift" and "purging by nonrandom mating" have to be distinguished. Conditions under which the two ways of purging are effective are derived. Nonrandom mating can purge deleterious mutations regardless of their dominance level, whereas only highly recessive mutations can be purged by drift. Both types of purging are limited by population size, and sharp thresholds separate domains where purging is either effective or not. The limitations derived here on the efficiency of purging are compatible with some experimental studies. Implications of these results for conservation and evolution of mating systems are discussed.
Red seaweeds are key components of coastal ecosystems and are economically important as food and as a source of gelling agents, but their genes and genomes have received little attention. Here we report the sequencing of the 105-Mbp genome of the florideophyte Chondrus crispus (Irish moss) and the annotation of the 9,606 genes. The genome features an unusual structure characterized by gene-dense regions surrounded by repeat-rich regions dominated by transposable elements. Despite its fairly large size, this genome shows features typical of compact genomes, e.g., on average only 0.3 introns per gene, short introns, low median distance between genes, small gene families, and no indication of large-scale genome duplication. The genome also gives insights into the metabolism of marine red algae and adaptations to the marine environment, including genes related to halogen metabolism, oxylipins, and multicellularity (microRNA processing and transcription factors). Particularly interesting are features related to carbohydrate metabolism, which include a minimalistic gene set for starch biosynthesis, the presence of cellulose synthases acquired before the primary endosymbiosis showing the polyphyly of cellulose synthesis in Archaeplastida, and cellulases absent in terrestrial plants as well as the occurrence of a mannosylglycerate synthase potentially originating from a marine bacterium. To explain the observations on genome structure and gene content, we propose an evolutionary scenario involving an ancestral red alga that was driven by early ecological forces to lose genes, introns, and intergenetic DNA; this loss was followed by an expansion of genome size as a consequence of activity of transposable elements.
Recent comprehensive sequence analysis of the maize genome now permits detailed discovery and description of all transposable elements (TEs) in this complex nuclear environment. Reiteratively optimized structural and homology criteria were used in the computer-assisted search for retroelements, TEs that transpose by reverse transcription of an RNA intermediate, with the final results verified by manual inspection. Retroelements were found to occupy the majority (>75%) of the nuclear genome in maize inbred B73. Unprecedented genetic diversity was discovered in the long terminal repeat (LTR) retrotransposon class of retroelements, with >400 families (>350 newly discovered) contributing >31,000 intact elements. The two other classes of retroelements, SINEs (four families) and LINEs (at least 30 families), were observed to contribute 1,991 and approximately 35,000 copies, respectively, or a combined approximately 1% of the B73 nuclear genome. With regard to fully intact elements, median copy numbers for all retroelement families in maize was 2 because >250 LTR retrotransposon families contained only one or two intact members that could be detected in the B73 draft sequence. The majority, perhaps all, of the investigated retroelement families exhibited non-random dispersal across the maize genome, with LINEs, SINEs, and many low-copy-number LTR retrotransposons exhibiting a bias for accumulation in gene-rich regions. In contrast, most (but not all) medium- and high-copy-number LTR retrotransposons were found to preferentially accumulate in gene-poor regions like pericentromeric heterochromatin, while a few high-copy-number families exhibited the opposite bias. Regions of the genome with the highest LTR retrotransposon density contained the lowest LTR retrotransposon diversity. These results indicate that the maize genome provides a great number of different niches for the survival and procreation of a great variety of retroelements that have evolved to differentially occupy and exploit this genomic diversity.
Eukaryotic ribosomes are made of two components, four ribosomal RNAs, and approximately 80 ribosomal proteins (r-proteins). The exact number of r-proteins and r-protein genes in higher plants is not known. The strong conservation in eukaryotic r-protein primary sequence allowed us to use the well-characterized rat (Rattus norvegicus) r-protein set to identify orthologues on the five haploid chromosomes of Arabidopsis. By use of the numerous expressed sequence tag (EST) accessions and the complete genomic sequence of this species, we identified 249 genes (including some pseudogenes) corresponding to 80 (32 small subunit and 48 large subunit) cytoplasmic r-protein types. None of the r-protein genes are single copy and most are encoded by three or four expressed genes, indicative of the internal duplication of the Arabidopsis genome. The r-proteins are distributed throughout the genome. Inspection of genes in the vicinity of r-protein gene family members confirms extensive duplications of large chromosome fragments and sheds light on the evolutionary history of the Arabidopsis genome. Examination of large duplicated regions indicated that a significant fraction of the r-protein genes have been either lost from one of the duplicated fragments or inserted after the initial duplication event. Only 52 r-protein genes lack a matching EST accession, and 19 of these contain incomplete open reading frames, confirming that most genes are expressed. Assessment of cognate EST numbers suggests that r-protein gene family members are differentially expressed.
The extent of gene dispersal is a fundamental factor of the population and evolutionary dynamics of tropical tree species, but directly monitoring seed and pollen movement is a difficult task. However, indirect estimates of historical gene dispersal can be obtained from the fine-scale spatial genetic structure of populations at drift-dispersal equilibrium. Using an approach that is based on the slope of the regression of pairwise kinship coefficients on spatial distance and estimates of the effective population density, we compare indirect gene dispersal estimates of sympatric populations of 10 tropical tree species. We re-analysed 26 data sets consisting of mapped allozyme, SSR (simple sequence repeat), RAPD (random amplified polymorphic DNA) or AFLP (amplified fragment length polymorphism) genotypes from two rainforest sites in French Guiana. Gene dispersal estimates were obtained for at least one marker in each species, although the estimation procedure failed under insufficient marker polymorphism, limited sample size, or inappropriate sampling area. Estimates generally suffered low precision and were affected by assumptions regarding the effective population density. Averaging estimates over data sets, the extent of gene dispersal ranged from 150 m to 1200 m according to species. Smaller gene dispersal estimates were obtained in species with heavy diaspores, which are presumably not well dispersed, and in populations with high local adult density. We suggest that limited seed dispersal could indirectly limit effective pollen dispersal by creating higher local tree densities, thereby increasing the positive correlation between pollen and seed dispersal distances. We discuss the potential and limitations of our indirect estimation procedure and suggest guidelines for future studies.
Growing evidence shows that epigenetic mechanisms contribute to complex traits, with implications across many fields of biology. In plant ecology, recent studies have attempted to merge ecological experiments with epigenetic analyses to elucidate the contribution of epigenetics to plant phenotypes, stress responses, adaptation to habitat, and range distributions. While there has been some progress in revealing the role of epigenetics in ecological processes, studies with non-model species have so far been limited to describing broad patterns based on anonymous markers of DNA methylation. In contrast, studies with model species have benefited from powerful genomic resources, which contribute to a more mechanistic understanding but have limited ecological realism. Understanding the significance of epigenetics for plant ecology requires increased transfer of knowledge and methods from model species research to genomes of evolutionarily divergent species, and examination of responses to complex natural environments at a more mechanistic level. This requires transforming genomics tools specifically for studying non-model species, which is challenging given the large and often polyploid genomes of plants. Collaboration among molecular geneticists, ecologists and bioinformaticians promises to enhance our understanding of the mutual links between genome function and ecological processes.