Génétique Physiologie et Systèmes d'Elevage
facilityToulouse, Occitanie, France
Research output, citation impact, and the most-cited recent papers from Génétique Physiologie et Systèmes d'Elevage (France). Aggregated across the NobleBlocks index of 300M+ scholarly works.
Top-cited papers from Génétique Physiologie et Systèmes d'Elevage
This paper reports the genome sequence of domesticated tomato, a major crop plant, and a draft sequence for its closest wild relative; comparative genomics reveal very little divergence between the two genomes but some important differences with the potato genome, another important food crop in the genus Solanum. Tomato (Solanum lycopersicum) is a major crop plant and a model system for fruit development. Solanum is one of the largest angiosperm genera1 and includes annual and perennial plants from diverse habitats. Here we present a high-quality genome sequence of domesticated tomato, a draft sequence of its closest wild relative, Solanum pimpinellifolium2, and compare them to each other and to the potato genome (Solanum tuberosum). The two tomato genomes show only 0.6% nucleotide divergence and signs of recent admixture, but show more than 8% divergence from potato, with nine large and several smaller inversions. In contrast to Arabidopsis, but similar to soybean, tomato and potato small RNAs map predominantly to gene-rich chromosomal regions, including gene promoters. The Solanum lineage has experienced two consecutive genome triplications: one that is ancient and shared with rosids, and a more recent one. These triplications set the stage for the neofunctionalization of genes controlling fruit characteristics, such as colour and fleshiness.
An annotated reference sequence representing the hexaploid bread wheat genome in 21 pseudomolecules has been analyzed to identify the distribution and genomic context of coding and noncoding elements across the A, B, and D subgenomes. With an estimated coverage of 94% of the genome and containing 107,891 high-confidence gene models, this assembly enabled the discovery of tissue- and developmental stage-related coexpression networks by providing a transcriptome atlas representing major stages of wheat development. Dynamics of complex gene families involved in environmental adaptation and end-use quality were revealed at subgenome resolution and contextualized to known agronomic single-gene or quantitative trait loci. This community resource establishes the foundation for accelerating wheat research and application through improved understanding of wheat biology and genomics-assisted breeding.
BACKGROUND: Venn diagrams are commonly used to display list comparison. In biology, they are widely used to show the differences between gene lists originating from different differential analyses, for instance. They thus allow the comparison between different experimental conditions or between different methods. However, when the number of input lists exceeds four, the diagram becomes difficult to read. Alternative layouts and dynamic display features can improve its use and its readability. RESULTS: jvenn is a new JavaScript library. It processes lists and produces Venn diagrams. It handles up to six input lists and presents results using classical or Edwards-Venn layouts. User interactions can be controlled and customized. Finally, jvenn can easily be embeded in a web page, allowing to have dynamic Venn diagrams. CONCLUSIONS: jvenn is an open source component for web environments helping scientists to analyze their data. The library package, which comes with full documentation and an example, is freely available at http://bioinfo.genotoul.fr/jvenn.
Dense molecular markers are being used in genetic evaluation for parts of the population. This requires a two-step procedure where pseudo-data (for instance, daughter yield deviations) are computed from full records and pedigree data and later used for genomic evaluation. This results in bias and loss of information. One way to incorporate the genomic information into a full genetic evaluation is by modifying the numerator relationship matrix. A naive proposal is to substitute the relationships of genotyped animals with the genomic relationship matrix. However, this results in incoherencies because the genomic relationship matrix includes information on relationships among ancestors and descendants. In other words, using the pedigree-derived covariance between genotyped and ungenotyped individuals, with the pretense that genomic information does not exist, leads to inconsistencies. It is proposed to condition the genetic value of ungenotyped animals on the genetic value of genotyped animals via the selection index (e.g., pedigree information), and then use the genomic relationship matrix for the latter. This results in a joint distribution of genotyped and ungenotyped genetic values, with a pedigree-genomic relationship matrix H. In this matrix, genomic information is transmitted to the covariances among all ungenotyped individuals. The matrix is (semi)positive definite by construction, which is not the case for the naive approach. Numerical examples and alternative expressions are discussed. Matrix H is suitable for iteration on data algorithms that multiply a vector times a matrix, such as preconditioned conjugated gradients.
Vertebrate evolution has been shaped by several rounds of whole-genome duplications (WGDs) that are often suggested to be associated with adaptive radiations and evolutionary innovations. Due to an additional round of WGD, the rainbow trout genome offers a unique opportunity to investigate the early evolutionary fate of a duplicated vertebrate genome. Here we show that after 100 million years of evolution the two ancestral subgenomes have remained extremely collinear, despite the loss of half of the duplicated protein-coding genes, mostly through pseudogenization. In striking contrast is the fate of miRNA genes that have almost all been retained as duplicated copies. The slow and stepwise rediploidization process characterized here challenges the current hypothesis that WGD is followed by massive and rapid genomic reorganizations and gene deletions. Although whole-genome duplications (WGDs) are rare events, they have an important role in shaping vertebrate evolution. Here, the authors sequence the rainbow trout genome and show that rediploidization after WGD occurs in a slow and stepwise manner.
Motivation: Metagenomics leads to major advances in microbial ecology and biologists need user friendly tools to analyze their data on their own. Results: This Galaxy-supported pipeline, called FROGS, is designed to analyze large sets of amplicon sequences and produce abundance tables of Operational Taxonomic Units (OTUs) and their taxonomic affiliation. The clustering uses Swarm. The chimera removal uses VSEARCH, combined with original cross-sample validation. The taxonomic affiliation returns an innovative multi-affiliation output to highlight databases conflicts and uncertainties. Statistical results and numerous graphical illustrations are produced along the way to monitor the pipeline. FROGS was tested for the detection and quantification of OTUs on real and in silico datasets and proved to be rapid, robust and highly sensitive. It compares favorably with the widespread mothur, UPARSE and QIIME. Availability and implementation: Source code and instructions for installation: https://github.com/geraldinepascal/FROGS.git. A companion website: http://frogs.toulouse.inra.fr. Contact: geraldine.pascal@inra.fr. Supplementary information: Supplementary data are available at Bioinformatics online.
The BioMart Community Portal (www.biomart.org) is a community-driven effort to provide a unified interface to biomedical databases that are distributed worldwide. The portal provides access to numerous database projects supported by 30 scientific organizations. It includes over 800 different biological datasets spanning genomics, proteomics, model organisms, cancer data, ontology information and more. All resources available through the portal are independently administered and funded by their host organizations. The BioMart data federation technology provides a unified interface to all the available data. The latest version of the portal comes with many new databases that have been created by our ever-growing community. It also comes with better support and extensibility for data analysis and visualization tools. A new addition to our toolbox, the enrichment analysis tool is now accessible through graphical and web service interface. The BioMart community portal averages over one million requests per day. Building on this level of service and the wealth of information that has become available, the BioMart Community Portal has introduced a new, more scalable and cheaper alternative to the large data stores maintained by specialized organizations.
A high-quality reference for the sunflower genome (Helianthus annuus L.) and analysis of gene networks involved in flowering time and oil metabolism provide a basis for nutritional exploitation and analyses of adaptation to climate change. Nicolas Langlade and colleagues report the genome sequence of the domesticated sunflower, Helianthus annuus L., a global oil crop that can maintain stable yields across a wide range of environmental conditions. Their comparative analyses provide insights into the evolutionary history of Asterids. They also analysed transcriptomic data from vegetative and floral organs, re-sequenced 80 domesticated lines and performed genome-wide association studies identifying 35 loci associated with flowering time. These resources will be useful in breeding programs as well as ecological and evolutionary studies. The domesticated sunflower, Helianthus annuus L., is a global oil crop that has promise for climate change adaptation, because it can maintain stable yields across a wide variety of environmental conditions, including drought1. Even greater resilience is achievable through the mining of resistance alleles from compatible wild sunflower relatives2,3, including numerous extremophile species4. Here we report a high-quality reference for the sunflower genome (3.6 gigabases), together with extensive transcriptomic data from vegetative and floral organs. The genome mostly consists of highly similar, related sequences5 and required single-molecule real-time sequencing technologies for successful assembly. Genome analyses enabled the reconstruction of the evolutionary history of the Asterids, further establishing the existence of a whole-genome triplication at the base of the Asterids II clade6 and a sunflower-specific whole-genome duplication around 29 million years ago7. An integrative approach combining quantitative genetics, expression and diversity data permitted development of comprehensive gene networks for two major breeding traits, flowering time and oil metabolism, and revealed new candidate genes in these networks. We found that the genomic architecture of flowering time has been shaped by the most recent whole-genome duplication, which suggests that ancient paralogues can remain in the same regulatory networks for dozens of millions of years. This genome represents a cornerstone for future research programs aiming to exploit genetic diversity to improve biotic and abiotic stress resistance and oil production, while also considering agricultural constraints and human nutritional needs8,9.
Staphylococci are the main aetiological agents of small ruminants intramammary infections (IMI), the more frequent isolates being S. aureus in clinical cases and coagulase negative species in subclinical IMI. The clinical IMI, whose annual incidence is usually lower than 5%, mainly occur at the beginning of machine milking and during the first third of lactation. These features constitute small ruminant peculiarities compared to dairy cattle. Small ruminant mastitis is generally a chronic and contagious infection: the primary sources are mammary and cutaneous carriages, and spreading mainly occurs during milking. Somatic cell counts (SCC) represent a valuable tool for prevalence assessment and screening, but predictive values are better in ewes than in goats. Prevention is most often based on milking machine management, sanitation and annual control, and milking technique optimisation. Elimination mainly relies on culling animals exhibiting clinical, chronic and recurrent IMI, and on drying-off intramammary antibiotherapy; this treatment allows a good efficacy and may be used selectively by targeting infected udders only. Heritability values for lactation mean SCC scores are between 0.11 and 0.15. Effective inclusion of ewe's mastitis resistance in the breeding goal has recently been implemented in France following experimental and large scale estimations of genetic parameters for SCC scores.
OBJECTIVE: The gut microbiota, which is considered a causal factor in metabolic diseases as shown best in animals, is under the dual influence of the host genome and nutritional environment. This study investigated whether the gut microbiota per se, aside from changes in genetic background and diet, could sign different metabolic phenotypes in mice. METHODS: The unique animal model of metabolic adaptation was used, whereby C57Bl/6 male mice fed a high-fat carbohydrate-free diet (HFD) became either diabetic (HFD diabetic, HFD-D) or resisted diabetes (HFD diabetes-resistant, HFD-DR). Pyrosequencing of the gut microbiota was carried out to profile the gut microbial community of different metabolic phenotypes. Inflammation, gut permeability, features of white adipose tissue, liver and skeletal muscle were studied. Furthermore, to modify the gut microbiota directly, an additional group of mice was given a gluco-oligosaccharide (GOS)-supplemented HFD (HFD+GOS). RESULTS: Despite the mice having the same genetic background and nutritional status, a gut microbial profile specific to each metabolic phenotype was identified. The HFD-D gut microbial profile was associated with increased gut permeability linked to increased endotoxaemia and to a dramatic increase in cell number in the stroma vascular fraction from visceral white adipose tissue. Most of the physiological characteristics of the HFD-fed mice were modulated when gut microbiota was intentionally modified by GOS dietary fibres. CONCLUSIONS: The gut microbiota is a signature of the metabolic phenotypes independent of differences in host genetic background and diet.
Whole-genome sequencing projects are increasingly populating the tree of life and characterizing biodiversity1–4. Sparse taxon sampling has previously been proposed to confound phylogenetic inference5, and captures only a fraction of the genomic diversity. Here we report a substantial step towards the dense representation of avian phylogenetic and molecular diversity, by analysing 363 genomes from 92.4% of bird families—including 267 newly sequenced genomes produced for phase II of the Bird 10,000 Genomes (B10K) Project. We use this comparative genome dataset in combination with a pipeline that leverages a reference-free whole-genome alignment to identify orthologous regions in greater numbers than has previously been possible and to recognize genomic novelties in particular bird lineages. The densely sampled alignment provides a single-base-pair map of selection, has more than doubled the fraction of bases that are confidently predicted to be under conservation and reveals extensive patterns of weak selection in predominantly non-coding DNA. Our results demonstrate that increasing the diversity of genomes used in comparative studies can reveal more shared and lineage-specific variation, and improve the investigation of genomic characteristics. We anticipate that this genomic resource will offer new perspectives on evolutionary processes in cross-species comparative analyses and assist in efforts to conserve species. A dataset of the genomes of 363 species from the Bird 10,000 Genomes Project shows increased power to detect shared and lineage-specific variation, demonstrating the importance of phylogenetically diverse taxon sampling in whole-genome sequencing.
SUMMARY: Breeding programs face the challenge of integrating information from genomics and from quantitative trait loci (QTL) analysis in order to identify genomic sequences controlling the variation of important traits. Despite the development of integrative databases, building a consensus map of genes, QTL and other loci gathered from multiple maps remains a manual and tedious task. Nevertheless, this is a critical step to reveal co-locations between genes and QTL. Another important matter is to determine whether QTL linked to same traits or related ones is detected in independent experiments and located in the same region, and represents a single locus or not. Statistical tools such as meta-analysis can be used to answer this question. BioMercator has been developed to automate map compilation and QTL meta-analysis, and to visualize co-locations between genes and QTL through a graphical interface. AVAILABILITY: Available upon request (http://moulon/~bioinfo/BioMercator/). Free of charge for academic use.
Abstract Oaks are an important part of our natural and cultural heritage. Not only are they ubiquitous in our most common landscapes 1 but they have also supplied human societies with invaluable services, including food and shelter, since prehistoric times 2 . With 450 species spread throughout Asia, Europe and America 3 , oaks constitute a critical global renewable resource. The longevity of oaks (several hundred years) probably underlies their emblematic cultural and historical importance. Such long-lived sessile organisms must persist in the face of a wide range of abiotic and biotic threats over their lifespans. We investigated the genomic features associated with such a long lifespan by sequencing, assembling and annotating the oak genome. We then used the growing number of whole-genome sequences for plants (including tree and herbaceous species) to investigate the parallel evolution of genomic characteristics potentially underpinning tree longevity. A further consequence of the long lifespan of trees is their accumulation of somatic mutations during mitotic divisions of stem cells present in the shoot apical meristems. Empirical 4 and modelling 5 approaches have shown that intra-organismal genetic heterogeneity can be selected for 6 and provides direct fitness benefits in the arms race with short-lived pests and pathogens through a patchwork of intra-organismal phenotypes 7 . However, there is no clear proof that large-statured trees consist of a genetic mosaic of clonally distinct cell lineages within and between branches. Through this case study of oak, we demonstrate the accumulation and transmission of somatic mutations and the expansion of disease-resistance gene families in trees.
Sugarcane (Saccharum spp.) is a major crop for sugar and bioenergy production. Its highly polyploid, aneuploid, heterozygous, and interspecific genome poses major challenges for producing a reference sequence. We exploited colinearity with sorghum to produce a BAC-based monoploid genome sequence of sugarcane. A minimum tiling path of 4660 sugarcane BAC that best covers the gene-rich part of the sorghum genome was selected based on whole-genome profiling, sequenced, and assembled in a 382-Mb single tiling path of a high-quality sequence. A total of 25,316 protein-coding gene models are predicted, 17% of which display no colinearity with their sorghum orthologs. We show that the two species, S. officinarum and S. spontaneum, involved in modern cultivars differ by their transposable elements and by a few large chromosomal rearrangements, explaining their distinct genome size and distinct basic chromosome numbers while also suggesting that polyploidization arose in both lineages after their divergence.
We have developed a panel of 152 whole-genome radiation hybrids by fusing irradiated diploid pig lymphocytes or fibroblasts with recipient hamster permanent cells. The number and size of the porcine chromosome fragments retained in each hybrid clone were checked by fluorescence in situ hybridization with a SINE probe or by primed in situ labeling (PRINS) with SINE-specific primers. A strategy based on the interspersed repetitive sequence polymerase chain reaction (IRS-PCR) was developed for selected clones to determine if the large fragments painted by the SINE probe corresponded to one pig chromosome or to different fragments of several chromosomes. This strategy was buttressed by a double PRINS approach using primers specific for alpha-satellite sequences of two different groups of swine chromosomes. Genome retention frequency was estimated for each clone by PCR with 32 markers localized on different porcine chromosomes. Of the 152 hybrids produced, 126 were selected on the basis of cytogenetic content and chromosome retention frequency to construct a radiation hybrid map of swine chromosome 8. Our initial results for this chromosome indicate that the resolution of the radiation hybrid map is 18 times higher than that obtained by linkage analysis.
Recombinant strains of hepatitis E virus (HEV) with insertions of human genomic fragments or HEV sequence duplications in the sequence encoding the polyproline region (PPR) were previously described in chronically infected patients. Such genomic rearrangements confer a replicative advantage in vitro but little is known about their frequency, location or origin. As the sequences of only a few virus genomes are available, we analyzed the complete genomes of 114 HEV genotype 3 strains using the single molecular real-time sequencing method to determine the frequency, location and origin of inserted genomic fragments, plus the proportions of variants with genomic rearrangements in each virus quasispecies. We also examined the amino acid compositions and post-translational modifications conferred by these rearrangements by comparing them to sequences without human gene insertions or HEV gene duplications. We found genomic rearrangements in 7/114 (6.1%) complete genome sequences (4 HEV-3f, 1 HEV-3e, 1 HEV-3h and 1 HEV-3chi-new), all from immunocompromised patients, and 3/7 were found at the acute phase of infection. Six of the 7 patients harbored virus-host recombinant variants, including 1 patient with 2 different recombinant variants. We also sequenced 3 recombinant variants with genome duplications of the PPR or PPR+X domains in a single patient. All the genomic rearrangements (7 human fragment insertions of varying origins and 3 HEV genome duplications) occurred in the PPR. The sequences with genomic rearrangements had specific characteristics: increased net load (p<0.001) and more ubiquitination (p<0.001), phosphorylation (p<0.001) and acetylation (p<0.001) sites. The human fragment insertions and HEV genome duplications had slightly different characteristics. We believe this is the first description of HEV strains with genomic rearrangements in patients at the acute phase of infection; perhaps these strains are directly transmitted. Clearly, genomic rearrangements produce a greater net load with duplications and insertions having different features. Further studies are needed to clarify the mechanisms by which such modifications influence HEV replication.
The genetic changes underlying the initial steps of animal domestication are still poorly understood. We generated a high-quality reference genome for the rabbit and compared it to resequencing data from populations of wild and domestic rabbits. We identified more than 100 selective sweeps specific to domestic rabbits but only a relatively small number of fixed (or nearly fixed) single-nucleotide polymorphisms (SNPs) for derived alleles. SNPs with marked allele frequency differences between wild and domestic rabbits were enriched for conserved noncoding sites. Enrichment analyses suggest that genes affecting brain and neuronal development have often been targeted during domestication. We propose that because of a truly complex genetic background, tame behavior in rabbits and other domestic animals evolved by shifts in allele frequencies at many loci, rather than by critical changes at only a few domestication loci.
The improvement and application of pest and disease models to analyse and predict yield losses including those due to climate change is still a challenge for the scientific community. Applied modelling of crop diseases and pests has mostly targeted the development of support capabilities to schedule scouting or pesticide applications. There is a need for research to both broaden the scope and evaluate the capabilities of pest and disease models. Key research questions not only involve the assessment of the potential effects of climate change on known pathosystems, but also on new pathogens which could alter the (still incompletely documented) impacts of pests and diseases on agricultural systems. Yield loss data collected in various current environments may no longer represent a adequate reference to develop tactical, decision-oriented, models for plant diseases and pests and their impacts, because of the ongoing changes in climate patterns. Process-based agricultural simulation modelling, on the other hand, appears to represent a viable methodology to estimate the impacts of these potential effects. A new generation of tools based on state-of-the-art knowledge and technologies is needed to allow systems analysis including key processes and their dynamics over appropriate suitable range of environmental variables. This paper offers a brief overview of the current state of development in coupling pest and disease models to crop models, and discusses technical and scientific challenges. We propose a five-stage roadmap to improve the simulation of the impacts caused by plant diseases and pests; i) improve the quality and availability of data for model inputs; ii) improve the quality and availability of data for model evaluation; iii) improve the integration with crop models; iv) improve the processes for model evaluation; and v) develop a community of plant pest and disease modelers.
We describe the organization of a nascent international effort, the Functional Annotation of Animal Genomes (FAANG) project, whose aim is to produce comprehensive maps of functional elements in the genomes of domesticated animal species.
Abstract Understanding the drivers of yield levels under climate change is required to support adaptation planning and respond to changing production risks. This study uses an ensemble of crop models applied on a spatial grid to quantify the contributions of various climatic drivers to past yield variability in grain maize and winter wheat of European cropping systems (1984–2009) and drivers of climate change impacts to 2050. Results reveal that for the current genotypes and mix of irrigated and rainfed production, climate change would lead to yield losses for grain maize and gains for winter wheat. Across Europe, on average heat stress does not increase for either crop in rainfed systems, while drought stress intensifies for maize only. In low-yielding years, drought stress persists as the main driver of losses for both crops, with elevated CO 2 offering no yield benefit in these years.