Centre de Biologie pour la Gestion des Populations
facilityMontferrier-sur-Lez, Occitanie, France
Research output, citation impact, and the most-cited recent papers from Centre de Biologie pour la Gestion des Populations (France). Aggregated across the NobleBlocks index of 300M+ scholarly works.
Top-cited papers from Centre de Biologie pour la Gestion des Populations
Microsatellite null alleles are commonly encountered in population genetics studies, yet little is known about their impact on the estimation of population differentiation. Computer simulations based on the coalescent were used to investigate the evolutionary dynamics of null alleles, their impact on F(ST) and genetic distances, and the efficiency of estimators of null allele frequency. Further, we explored how the existing method for correcting genotype data for null alleles performed in estimating F(ST) and genetic distances, and we compared this method with a new method proposed here (for F(ST) only). Null alleles were likely to be encountered in populations with a large effective size, with an unusually high mutation rate in the flanking regions, and that have diverged from the population from which the cloned allele state was drawn and the primers designed. When populations were significantly differentiated, F(ST) and genetic distances were overestimated in the presence of null alleles. Frequency of null alleles was estimated precisely with the algorithm presented in Dempster et al. (1977). The conventional method for correcting genotype data for null alleles did not provide an accurate estimate of F(ST) and genetic distances. However, the use of the genetic distance of Cavalli-Sforza and Edwards (1967) corrected by the conventional method gave better estimates than those obtained without correction. F(ST) estimation from corrected genotype frequencies performed well when restricted to visible allele sizes. Both the proposed method and the traditional correction method have been implemented in a program that is available free of charge at http://www.montpellier.inra.fr/URLB/. We used 2 published microsatellite data sets based on original and redesigned pairs of primers to empirically confirm our simulation results.
GENECLASS2 is a software that computes various genetic assignment criteria to assign or exclude reference populations as the origin of diploid or haploid individuals, as well as of groups of individuals, on the basis of multilocus genotype data. In addition to traditional assignment aims, the program allows the specific task of first-generation migrant detection. It includes several Monte Carlo resampling algorithms that compute for each individual its probability of belonging to each reference population or to be a resident (i.e., not a first-generation migrant) in the population where it was sampled. A user-friendly interface facilitates the treatment of large datasets.
Plant diversity strongly influences ecosystem functions and services, such as soil carbon storage. However, the mechanisms underlying the positive plant diversity effects on soil carbon storage are poorly understood. We explored this relationship using long-term data from a grassland biodiversity experiment (The Jena Experiment) and radiocarbon (14C) modelling. Here we show that higher plant diversity increases rhizosphere carbon inputs into the microbial community resulting in both increased microbial activity and carbon storage. Increases in soil carbon were related to the enhanced accumulation of recently fixed carbon in high-diversity plots, while plant diversity had less pronounced effects on the decomposition rate of existing carbon. The present study shows that elevated carbon storage at high plant diversity is a direct function of the soil microbial community, indicating that the increase in carbon storage is mainly limited by the integration of new carbon into soil and less by the decomposition of existing soil carbon. The mechanisms driving soil carbon storage, one of the largest stores of terrestrial carbon, remain poorly understood. Here, the authors present data from the long-term Jena Experiment on grassland biodiversity, showing that elevated carbon storage at high plant diversity is a direct function of increased soil microbial activity.
Genetic assignment methods use genotype likelihoods to draw inference about where individuals were or were not born, potentially allowing direct, real-time estimates of dispersal. We used simulated data sets to test the power and accuracy of Monte Carlo resampling methods in generating statistical thresholds for identifying F0 immigrants in populations with ongoing gene flow, and hence for providing direct, real-time estimates of migration rates. The identification of accurate critical values required that resampling methods preserved the linkage disequilibrium deriving from recent generations of immigrants and reflected the sampling variance present in the data set being analysed. A novel Monte Carlo resampling method taking into account these aspects was proposed and its efficiency was evaluated. Power and error were relatively insensitive to the frequency assumed for missing alleles. Power to identify F0 immigrants was improved by using large sample size (up to about 50 individuals) and by sampling all populations from which migrants may have originated. A combination of plotting genotype likelihoods and calculating mean genotype likelihood ratios (DLR) appeared to be an effective way to predict whether F0 immigrants could be identified for a particular pair of populations using a given set of markers.
The spider mite Tetranychus urticae is a cosmopolitan agricultural pest with an extensive host plant range and an extreme record of pesticide resistance. Here we present the completely sequenced and annotated spider mite genome, representing the first complete chelicerate genome. At 90 megabases T. urticae has the smallest sequenced arthropod genome. Compared with other arthropods, the spider mite genome shows unique changes in the hormonal environment and organization of the Hox complex, and also reveals evolutionary innovation of silk production. We find strong signatures of polyphagy and detoxification in gene families associated with feeding on different hosts and in new gene families acquired by lateral gene transfer. Deep transcriptome analysis of mites feeding on different plants shows how this pest responds to a changing host environment. The T. urticae genome thus offers new insights into arthropod evolution and plant–herbivore interactions, and provides unique opportunities for developing novel plant protection strategies. The genome of the spider mite Tetranychus urticae is sequenced, providing insights into its polyphagous feeding, silk production, hormonal repertoire and reduced Hox cluster. The spider mite (Tetranychus urticae) is a common agricultural pest that feeds on a wide range of hosts — including maize (corn), soya, tomatoes and peppers — and is notoriously resistant to pesticides. Its genome has now been sequenced and analysed, providing insights into its hormonal repertoire and the evolution of silk production. Transcriptome analysis of mites feeding on different plants reveals how this pest defends itself in a changing host environment and gives pointers to possible non-pesticide plant-protection strategies. The genome encodes 17 fibroin genes, and physical tests of spider-mite silk show it to be a natural nanomaterial with fibres that are more than 100 times thinner than those produced by silk spiders.
Abstract Geneland is a computer package that allows to make use of georeferenced individual multilocus genotypes for the inference of the number of populations and of the spatial location of genetic discontinuities between those populations. Main assumptions of the method are: (i) the number of populations is unknown and all values are considered a priori equally likely, (ii) populations are spread over areas given by a union of some polygons of unknown location in the spatial domain, (iii) Hardy–Weinberg equilibrium is assumed within each population and (iv) allele frequencies in each population are unknown and treated as random variable either following the so‐called Dirichlet model or Falush model. Different algorithms implemented in Geneland to perform inferences are first briefly presented. Then major running steps and outputs (i.e. histogram of number of populations and map of posterior probabilities of population membership) are illustrated from the analysis of a simulated data set, which was also produced by Geneland.
MOTIVATION: DIYABC is a software package for a comprehensive analysis of population history using approximate Bayesian computation on DNA polymorphism data. Version 2.0 implements a number of new features and analytical methods. It allows (i) the analysis of single nucleotide polymorphism data at large number of loci, apart from microsatellite and DNA sequence data, (ii) efficient Bayesian model choice using linear discriminant analysis on summary statistics and (iii) the serial launching of multiple post-processing analyses. DIYABC v2.0 also includes a user-friendly graphical interface with various new options. It can be run on three operating systems: GNU/Linux, Microsoft Windows and Apple Os X. AVAILABILITY: Freely available with a detailed notice document and example projects to academic users at http://www1.montpellier.inra.fr/CBGP/diyabc CONTACT: estoup@supagro.inra.fr Supplementary information: Supplementary data are available at Bioinformatics online.
Homoplasy has recently attracted the attention of population geneticists, as a consequence of the popularity of highly variable stepwise mutating markers such as microsatellites. Microsatellite alleles generally refer to DNA fragments of different size (electromorphs). Electromorphs are identical in state (i.e. have identical size), but are not necessarily identical by descent due to convergent mutation(s). Homoplasy occurring at microsatellites is thus referred to as size homoplasy. Using new analytical developments and computer simulations, we first evaluate the effect of the mutation rate, the mutation model, the effective population size and the time of divergence between populations on size homoplasy at the within and between population levels. We then review the few experimental studies that used various molecular techniques to detect size homoplasious events at some microsatellite loci. The relationship between this molecularly accessible size homoplasy size and the actual amount of size homoplasy is not trivial, the former being considerably influenced by the molecular structure of microsatellite core sequences. In a third section, we show that homoplasy at microsatellite electromorphs does not represent a significant problem for many types of population genetics analyses realized by molecular ecologists, the large amount of variability at microsatellite loci often compensating for their homoplasious evolution. The situations where size homoplasy may be more problematic involve high mutation rates and large population sizes together with strong allele size constraints.
Landscape genetics is a new discipline that aims to provide information on how landscape and environmental features influence population genetic structure. The first key step of landscape genetics is the spatial detection and location of genetic discontinuities between populations. However, efficient methods for achieving this task are lacking. In this article, we first clarify what is conceptually involved in the spatial modeling of genetic data. Then we describe a Bayesian model implemented in a Markov chain Monte Carlo scheme that allows inference of the location of such genetic discontinuities from individual geo-referenced multilocus genotypes, without a priori knowledge on populational units and limits. In this method, the global set of sampled individuals is modeled as a spatial mixture of panmictic populations, and the spatial organization of populations is modeled through the colored Voronoi tessellation. In addition to spatially locating genetic discontinuities, the method quantifies the amount of spatial dependence in the data set, estimates the number of populations in the studied area, assigns individuals to their population of origin, and detects individual migrants between populations, while taking into account uncertainty on the location of sampled individuals. The performance of the method is evaluated through the analysis of simulated data sets. Results show good performances for standard data sets (e.g., 100 individuals genotyped at 10 loci with 10 alleles per locus), with high but also low levels of population differentiation (e.g., FST<0.05). The method is then applied to a set of 88 individuals of wolverines (Gulo gulo) sampled in the northwestern United States and genotyped at 10 microsatellites.
UNLABELLED: Genetic data obtained on population samples convey information about their evolutionary history. Inference methods can extract part of this information but they require sophisticated statistical techniques that have been made available to the biologist community (through computer programs) only for simple and standard situations typically involving a small number of samples. We propose here a computer program (DIY ABC) for inference based on approximate Bayesian computation (ABC), in which scenarios can be customized by the user to fit many complex situations involving any number of populations and samples. Such scenarios involve any combination of population divergences, admixtures and population size changes. DIY ABC can be used to compare competing scenarios, estimate parameters for one or more scenarios and compute bias and precision measures for a given scenario and known values of parameters (the current version applies to unlinked microsatellite data). This article describes key methods used in the program and provides its main features. The analysis of one simulated and one real dataset, both with complex evolutionary scenarios, illustrates the main possibilities of DIY ABC. AVAILABILITY: The software DIY ABC is freely available at http://www.montpellier.inra.fr/CBGP/diyabc.
Detailed knowledge about the geographical pathways followed by propagules from their source to the invading populations--referred to here as routes of invasion-provides information about the history of the invasion process and the origin and genetic composition of the invading populations. The reconstruction of invasion routes is required for defining and testing different hypotheses concerning the environmental and evolutionary factors responsible for biological invasions. In practical terms, it facilitates the design of strategies for controlling or preventing invasions. Most of our knowledge about the introduction routes of invasive species is derived from historical and observational data, which are often sparse, incomplete and, sometimes, misleading. In this context, population genetics has proved a useful approach for reconstructing routes of introduction, highlighting the complexity and the often counterintuitive nature of the true story. This approach has proved particularly useful since the recent development of new model-based methods, such as approximate Bayesian computation, making it possible to make quantitative inferences in the complex evolutionary scenarios typically encountered in invasive species. In this review, we summarize some of the fundamental aspects of routes of invasion, explain why the reconstruction of these routes is useful for addressing both practical and theoretical questions, and comment on the various reconstruction methods available. Finally, we consider the main insights obtained to date from studies of invasion routes.
Most eukaryotic organisms are arthropods. Yet, their diversity in rich terrestrial ecosystems is still unknown. Here we produce tangible estimates of the total species richness of arthropods in a tropical rainforest. Using a comprehensive range of structured protocols, we sampled the phylogenetic breadth of arthropod taxa from the soil to the forest canopy in the San Lorenzo forest, Panama. We collected 6144 arthropod species from 0.48 hectare and extrapolated total species richness to larger areas on the basis of competing models. The whole 6000-hectare forest reserve most likely sustains 25,000 arthropod species. Notably, just 1 hectare of rainforest yields >60% of the arthropod biodiversity held in the wider landscape. Models based on plant diversity fitted the accumulated species richness of both herbivore and nonherbivore taxa exceptionally well. This lends credence to global estimates of arthropod biodiversity developed from plant models.
Sequential techniques can enhance the efficiency of the approximate Bayesian computation algorithm, as in Sisson et al.’s (2007) partial rejection control version. While this method is based upon the theoretical works of Del Moral et al. (2006), the application to approximate Bayesian computation results in a bias in the approximation to the posterior. An alternative version based on genuine importance sampling arguments bypasses this difficulty, in connection with the population Monte Carlo method of Cappé et al. (2004), and it includes an automatic scaling of the forward kernel. When applied to a population genetics example, it compares favourably with two other versions of the approximate algorithm.
SUMMARY: QDD is an open access program providing a user-friendly tool for microsatellite detection and primer design from large sets of DNA sequences. The program is designed to deal with all steps of treatment of raw sequences obtained from pyrosequencing of enriched DNA libraries, but it is also applicable to data obtained through other sequencing methods, using FASTA files as input. The following tasks are completed by QDD: tag sorting, adapter/vector removal, elimination of redundant sequences, detection of possible genomic multicopies (duplicated loci or transposable elements), stringent selection of target microsatellites and customizable primer design. It can treat up to one million sequences of a few hundred base pairs in the tag-sorting step, and up to 50,000 sequences in a single input file for the steps involving estimation of sequence similarity. AVAILABILITY: QDD is freely available under the GPL licence for Windows and Linux from the following web site: http://www.univ-provence.fr/gsite/Local/egee/dir/meglecz/QDD.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
UNLABELLED: With the development of next-generation sequencing and genotyping approaches, large single nucleotide polymorphism haplotype datasets are becoming available in a growing number of both model and non-model species. Identifying genomic regions with unexpectedly high local haplotype homozygosity relatively to neutral expectation represents a powerful strategy to ascertain candidate genes responding to natural or artificial selection. To facilitate genome-wide scans of selection based on the analysis of long-range haplotypes, we developed the R package rehh. It provides a versatile tool to detect the footprints of recent or ongoing selection with several graphical functions that help visual interpretation of the results. AVAILABILITY AND IMPLEMENTATION: Stable version is available from CRAN: http://cran.r-project.org/. Development version is available from the R-forge repository: http://r-forge.r-project.org/projects/rehh. Both versions can be installed directly from R. Function documentation and example data files are provided within the package and a tutorial is available as Supplementary Material. rehh is distributed under the GNU General Public Licence (GPL ≥ 2).
The efficacy of disease resistance genes in plants decreases over time because of the selection of virulent pathogen genotypes. A key goal of crop protection programs is to increase the durability of the resistance conferred by these genes. The spatial and temporal deployment of plant disease resistance genes is considered to be a major factor determining their durability. In the literature, four principal strategies combining resistance genes over time and space have been considered to delay the evolution of virulent pathogen genotypes. We reviewed this literature with the aim of determining which deployment strategy results in the greatest durability of resistance genes. Although theoretical and empirical studies comparing deployment strategies of more than one resistance gene are very scarce, they suggest that the overall durability of disease resistance genes can be increased by combining their presence in the same plant (pyramiding). Retrospective analyses of field monitoring data also suggest that the pyramiding of disease resistance genes within a plant is the most durable strategy. By extension, we suggest that the combination of disease resistance genes with other practices for pathogen control (pesticides, farming practices) may be a relevant management strategy to slow down the evolution of virulent pathogen genotypes.
In population genomics studies, accounting for the neutral covariance structure across population allele frequencies is critical to improve the robustness of genome-wide scan approaches. Elaborating on the BayEnv model, this study investigates several modeling extensions (i) to improve the estimation accuracy of the population covariance matrix and all the related measures, (ii) to identify significantly overly differentiated SNPs based on a calibration procedure of the XtX statistics, and (iii) to consider alternative covariate models for analyses of association with population-specific covariables. In particular, the auxiliary variable model allows one to deal with multiple testing issues and, providing the relative marker positions are available, to capture some linkage disequilibrium information. A comprehensive simulation study was carried out to evaluate the performances of these different models. Also, when compared in terms of power, robustness, and computational efficiency to five other state-of-the-art genome-scan methods (BayEnv2, BayScEnv, BayScan, flk, and lfmm), the proposed approaches proved highly effective. For illustration purposes, genotyping data on 18 French cattle breeds were analyzed, leading to the identification of 13 strong signatures of selection. Among these, four (surrounding the KITLG, KIT, EDN3, and ALB genes) contained SNPs strongly associated with the piebald coloration pattern while a fifth (surrounding PLAG1) could be associated to morphological differences across the populations. Finally, analysis of Pool-Seq data from 12 populations of Littorina saxatilis living in two different ecotypes illustrates how the proposed framework might help in addressing relevant ecological issues in nonmodel species. Overall, the proposed methods define a robust Bayesian framework to characterize adaptive genetic differentiation across populations. The BayPass program implementing the different models is available at http://www1.montpellier.inra.fr/CBGP/software/baypass/.
Fermented beverages and foods have played a significant role in most societies worldwide for millennia. To better understand how the yeast species Saccharomyces cerevisiae, the main fermenting agent, evolved along this historical and expansion process, we analysed the genetic diversity among 651 strains from 56 different geographical origins, worldwide. Their genotyping at 12 microsatellite loci revealed 575 distinct genotypes organized in subgroups of yeast types, i.e. bread, beer, wine, sake. Some of these groups presented unexpected relatedness: Bread strains displayed a combination of alleles intermediate between beer and wine strains, and strains used for rice wine and sake were most closely related to beer and bread strains. However, up to 28% of genetic diversity between these technological groups was associated with geographical differences which suggests local domestications. Focusing on wine yeasts, a group of Lebanese strains were basal in an F(ST) tree, suggesting a Mesopotamia-based origin of most wine strains. In Europe, migration of wine strains occurred through the Danube Valley, and around the Mediterranean Sea. An approximate Bayesian computation approach suggested a postglacial divergence (most probable period 10,000-12,000 bp). As our results suggest intimate association between man and wine yeast across centuries, we hypothesize that yeast followed man and vine migrations as a commensal member of grapevine flora.
BACKGROUND: Approximate Bayesian computation (ABC) is a recent flexible class of Monte-Carlo algorithms increasingly used to make model-based inference on complex evolutionary scenarios that have acted on natural populations. The software DIYABC offers a user-friendly interface allowing non-expert users to consider population histories involving any combination of population divergences, admixtures and population size changes. We here describe and illustrate new developments of this software that mainly include (i) inference from DNA sequence data in addition or separately to microsatellite data, (ii) the possibility to analyze five categories of loci considering balanced or non balanced sex ratios: autosomal diploid, autosomal haploid, X-linked, Y-linked and mitochondrial, and (iii) the possibility to perform model checking computation to assess the "goodness-of-fit" of a model, a feature of ABC analysis that has been so far neglected. RESULTS: We used controlled simulated data sets generated under evolutionary scenarios involving various divergence and admixture events to evaluate the effect of mixing autosomal microsatellite, mtDNA and/or nuclear autosomal DNA sequence data on inferences. This evaluation included the comparison of competing scenarios and the quantification of their relative support, and the estimation of parameter posterior distributions under a given scenario. We also considered a set of scenarios often compared when making ABC inferences on the routes of introduction of invasive species to illustrate the interest of the new model checking option of DIYABC to assess model misfit. CONCLUSIONS: Our new developments of the integrated software DIYABC should be particularly useful to make inference on complex evolutionary scenarios involving both recent and ancient historical events and using various types of molecular markers in diploid or haploid organisms. They offer a handy way for non-expert users to achieve model checking computation within an ABC framework, hence filling up a gap of ABC analysis. The software DIYABC V1.0 is freely available at http://www1.montpellier.inra.fr/CBGP/diyabc.
Recent studies of the routes of worldwide introductions of alien organisms suggest that many widespread invasions could have stemmed not from the native range, but from a particularly successful invasive population, which serves as the source of colonists for remote new territories. We call here this phenomenon the invasive bridgehead effect. Evaluating the likelihood of such a scenario is heuristically challenging. We solved this problem by using approximate Bayesian computation methods to quantitatively compare complex invasion scenarios based on the analysis of population genetics (microsatellite variation) and historical (first observation dates) data. We applied this approach to the Harlequin ladybird Harmonia axyridis (HA), a coccinellid native to Asia that was repeatedly introduced as a biocontrol agent without becoming established for decades. We show that the recent burst of worldwide invasions of HA followed a bridgehead scenario, in which an invasive population in eastern North America acted as the source of the colonists that invaded the European, South American and African continents, with some admixture with a biocontrol strain in Europe. This demonstration of a mechanism of invasion via a bridgehead has important implications both for invasion theory (i.e., a single evolutionary shift in the bridgehead population versus multiple changes in case of introduced populations becoming invasive independently) and for ongoing efforts to manage invasions by alien organisms (i.e., heightened vigilance against invasive bridgeheads).