University of Canberra
UniversityCanberra, Australia
Research output, citation impact, and the most-cited recent papers from University of Canberra (Australia). Aggregated across the NobleBlocks index of 300M+ scholarly works.
Top-cited papers from University of Canberra
Collinearity refers to the non independence of predictor variables, usually in a regression‐type analysis. It is a common feature of any descriptive ecological data set and can be a problem for parameter estimation because it inflates the variance of regression parameters and hence potentially leads to the wrong identification of relevant predictors in a statistical model. Collinearity is a severe problem when a model is trained on data from one region or time, and predicted to another with a different or unknown structure of collinearity. To demonstrate the reach of the problem of collinearity in ecology, we show how relationships among predictors differ between biomes, change over spatial scales and through time. Across disciplines, different approaches to addressing collinearity problems have been developed, ranging from clustering of predictors, threshold‐based pre‐selection, through latent variable methods, to shrinkage and regularisation. Using simulated data with five predictor‐response relationships of increasing complexity and eight levels of collinearity we compared ways to address collinearity with standard multiple regression and machine‐learning approaches. We assessed the performance of each approach by testing its impact on prediction to new data. In the extreme, we tested whether the methods were able to identify the true underlying relationship in a training dataset with strong collinearity by evaluating its performance on a test dataset without any collinearity. We found that methods specifically designed for collinearity, such as latent variable methods and tree based models, did not outperform the traditional GLM and threshold‐based pre‐selection. Our results highlight the value of GLM in combination with penalised methods (particularly ridge) and threshold‐based pre‐selection when omitted variables are considered in the final interpretation. However, all approaches tested yielded degraded predictions under change in collinearity structure and the ‘folk lore’‐thresholds of correlation coefficients between predictor variables of |r| >0.7 was an appropriate indicator for when collinearity begins to severely distort model estimation and subsequent prediction. The use of ecological understanding of the system in pre‐analysis variable selection and the choice of the least sensitive statistical approaches reduce the problems of collinearity, but cannot ultimately solve them.
BACKGROUND: The Global Burden of Diseases, Injuries, and Risk Factors Study 2015 provides an up-to-date synthesis of the evidence for risk factor exposure and the attributable burden of disease. By providing national and subnational assessments spanning the past 25 years, this study can inform debates on the importance of addressing risks in context. METHODS: We used the comparative risk assessment framework developed for previous iterations of the Global Burden of Disease Study to estimate attributable deaths, disability-adjusted life-years (DALYs), and trends in exposure by age group, sex, year, and geography for 79 behavioural, environmental and occupational, and metabolic risks or clusters of risks from 1990 to 2015. This study included 388 risk-outcome pairs that met World Cancer Research Fund-defined criteria for convincing or probable evidence. We extracted relative risk and exposure estimates from randomised controlled trials, cohorts, pooled cohorts, household surveys, census data, satellite data, and other sources. We used statistical models to pool data, adjust for bias, and incorporate covariates. We developed a metric that allows comparisons of exposure across risk factors-the summary exposure value. Using the counterfactual scenario of theoretical minimum risk level, we estimated the portion of deaths and DALYs that could be attributed to a given risk. We decomposed trends in attributable burden into contributions from population growth, population age structure, risk exposure, and risk-deleted cause-specific DALY rates. We characterised risk exposure in relation to a Socio-demographic Index (SDI). FINDINGS: Between 1990 and 2015, global exposure to unsafe sanitation, household air pollution, childhood underweight, childhood stunting, and smoking each decreased by more than 25%. Global exposure for several occupational risks, high body-mass index (BMI), and drug use increased by more than 25% over the same period. All risks jointly evaluated in 2015 accounted for 57·8% (95% CI 56·6-58·8) of global deaths and 41·2% (39·8-42·8) of DALYs. In 2015, the ten largest contributors to global DALYs among Level 3 risks were high systolic blood pressure (211·8 million [192·7 million to 231·1 million] global DALYs), smoking (148·6 million [134·2 million to 163·1 million]), high fasting plasma glucose (143·1 million [125·1 million to 163·5 million]), high BMI (120·1 million [83·8 million to 158·4 million]), childhood undernutrition (113·3 million [103·9 million to 123·4 million]), ambient particulate matter (103·1 million [90·8 million to 115·1 million]), high total cholesterol (88·7 million [74·6 million to 105·7 million]), household air pollution (85·6 million [66·7 million to 106·1 million]), alcohol use (85·0 million [77·2 million to 93·0 million]), and diets high in sodium (83·0 million [49·3 million to 127·5 million]). From 1990 to 2015, attributable DALYs declined for micronutrient deficiencies, childhood undernutrition, unsafe sanitation and water, and household air pollution; reductions in risk-deleted DALY rates rather than reductions in exposure drove these declines. Rising exposure contributed to notable increases in attributable DALYs from high BMI, high fasting plasma glucose, occupational carcinogens, and drug use. Environmental risks and childhood undernutrition declined steadily with SDI; low physical activity, high BMI, and high fasting plasma glucose increased with SDI. In 119 countries, metabolic risks, such as high BMI and fasting plasma glucose, contributed the most attributable DALYs in 2015. Regionally, smoking still ranked among the leading five risk factors for attributable DALYs in 109 countries; childhood underweight and unsafe sex remained primary drivers of early death and disability in much of sub-Saharan Africa. INTERPRETATION: Declines in some key environmental risks have contributed to declines in critical infectious diseases. Some risks appear to be invariant to SDI. Increasing risks, including high BMI, high fasting plasma glucose, drug use, and some occupational exposures, contribute to rising burden from some conditions, but also provide opportunities for intervention. Some highly preventable risks, such as smoking, remain major causes of attributable DALYs, even as exposure is declining. Public policy makers need to pay attention to the risks that are increasingly major contributors to global burden. FUNDING: Bill & Melinda Gates Foundation.
INTRODUCTION: Appraising the quality of studies included in systematic reviews combining qualitative and quantitative evidence is challenging. To address this challenge, a critical appraisal tool was developed: the Mixed Methods Appraisal Tool (MMAT). The aim of this paper is to present the enhance ments made to the MMAT. DEVELOPMENT: The MMAT was initially developed in 2006 based on a literature review on systematic reviews combining qualitative and quantitative evidence. It was subject to pilot and interrater reliability testing. A revised version of the MMAT was developed in 2018 based on the results from usefulness testing, a literature review on critical appraisal tools and a modified e-Delphi study with methodological experts to identify core criteria. TOOL DESCRIPTION: The MMAT assesses the quality of qualitative, quantitative, and mixed methods studies. It focuses on methodological criteria and includes five core quality criteria for each of the following five categories of study designs: (a) qualitative, (b) randomized controlled, (c) nonrandomized, (d) quantitative descriptive, and (e) mixed methods. CONCLUSION: The MMAT is a unique tool that can be used to appraise the quality of different study designs. Also, by limiting to core criteria, the MMAT can provide a more efficient appraisal.
Due to the advantages of deep learning, in this paper, a regularized deep feature extraction (FE) method is presented for hyperspectral image (HSI) classification using a convolutional neural network (CNN). The proposed approach employs several convolutional and pooling layers to extract deep features from HSIs, which are nonlinear, discriminant, and invariant. These features are useful for image classification and target detection. Furthermore, in order to address the common issue of imbalance between high dimensionality and limited availability of training samples for the classification of HSI, a few strategies such as L2 regularization and dropout are investigated to avoid overfitting in class data modeling. More importantly, we propose a 3-D CNN-based FE model with combined regularization to extract effective spectral-spatial features of hyperspectral imagery. Finally, in order to further improve the performance, a virtual sample enhanced method is proposed. The proposed approaches are carried out on three widely used hyperspectral data sets: Indian Pines, University of Pavia, and Kennedy Space Center. The obtained results reveal that the proposed models with sparse constraints provide competitive results to state-of-the-art methods. In addition, the proposed deep FE opens a new window for further research.
The term “urban stream syndrome” describes the consistently observed ecological degradation of streams draining urban land. This paper reviews recent literature to describe symptoms of the syndrome, explores mechanisms driving the syndrome, and identifies appropriate goals and methods for ecological restoration of urban streams. Symptoms of the urban stream syndrome include a flashier hydrograph, elevated concentrations of nutrients and contaminants, altered channel morphology, and reduced biotic richness, with increased dominance of tolerant species. More research is needed before generalizations can be made about urban effects on stream ecosystem processes, but reduced nutrient uptake has been consistently reported. The mechanisms driving the syndrome are complex and interactive, but most impacts can be ascribed to a few major large-scale sources, primarily urban stormwater runoff delivered to streams by hydraulically efficient drainage systems. Other stressors, such as combined or sanitary sewer overflows, wastewater treatment plant effluents, and legacy pollutants (long-lived pollutants from earlier land uses) can obscure the effects of stormwater runoff. Most research on urban impacts to streams has concentrated on correlations between instream ecological metrics and total catchment imperviousness. Recent research shows that some of the variance in such relationships can be explained by the distance between the stream reach and urban land, or by the hydraulic efficiency of stormwater drainage. The mechanisms behind such patterns require experimentation at the catchment scale to identify the best management approaches to conservation and restoration of streams in urban catchments. Remediation of stormwater impacts is most likely to be achieved through widespread application of innovative approaches to drainage design. Because humans dominate urban ecosystems, research on urban stream ecology will require a broadening of stream ecological research to integrate with social, behavioral, and economic research.
Plant traits-the morphological, anatomical, physiological, biochemical and phenological characteristics of plants-determine how plants respond to environmental factors, affect other trophic levels, and influence ecosystem properties and their benefits and detriments to people. Plant trait data thus represent the basis for a vast area of research spanning from evolutionary biology, community and functional ecology, to biodiversity conservation, ecosystem and landscape management, restoration, biogeography and earth system modelling. Since its foundation in 2007, the TRY database of plant traits has grown continuously. It now provides unprecedented data coverage under an open access data policy and is the main plant trait database used by the research community worldwide. Increasingly, the TRY database also supports new frontiers of trait-based plant research, including the identification of data gaps and the subsequent mobilization or measurement of new data. To support this development, in this article we evaluate the extent of the trait data compiled in TRY and analyse emerging patterns of data coverage and representativeness. Best species coverage is achieved for categorical traits-almost complete coverage for 'plant growth form'. However, most traits relevant for ecology and vegetation modelling are characterized by continuous intraspecific variation and trait-environmental relationships. These traits have to be measured on individual plants in their respective environment. Despite unprecedented data coverage, we observe a humbling lack of completeness and representativeness of these continuous traits in many aspects. We, therefore, conclude that reducing data gaps and biases in the TRY database remains a key challenge and requires a coordinated approach to data mobilization and trait measurements. This can only be achieved in collaboration with other initiatives.
OBJECTIVE: To assess the cancer risk in children and adolescents following exposure to low dose ionising radiation from diagnostic computed tomography (CT) scans. DESIGN: Population based, cohort, data linkage study in Australia. COHORT MEMBERS: 10.9 million people identified from Australian Medicare records, aged 0-19 years on 1 January 1985 or born between 1 January 1985 and 31 December 2005; all exposures to CT scans funded by Medicare during 1985-2005 were identified for this cohort. Cancers diagnosed in cohort members up to 31 December 2007 were obtained through linkage to national cancer records. MAIN OUTCOME: Cancer incidence rates in individuals exposed to a CT scan more than one year before any cancer diagnosis, compared with cancer incidence rates in unexposed individuals. RESULTS: 60,674 cancers were recorded, including 3150 in 680,211 people exposed to a CT scan at least one year before any cancer diagnosis. The mean duration of follow-up after exposure was 9.5 years. Overall cancer incidence was 24% greater for exposed than for unexposed people, after accounting for age, sex, and year of birth (incidence rate ratio (IRR) 1.24 (95% confidence interval 1.20 to 1.29); P<0.001). We saw a dose-response relation, and the IRR increased by 0.16 (0.13 to 0.19) for each additional CT scan. The IRR was greater after exposure at younger ages (P<0.001 for trend). At 1-4, 5-9, 10-14, and 15 or more years since first exposure, IRRs were 1.35 (1.25 to 1.45), 1.25 (1.17 to 1.34), 1.14 (1.06 to 1.22), and 1.24 (1.14 to 1.34), respectively. The IRR increased significantly for many types of solid cancer (digestive organs, melanoma, soft tissue, female genital, urinary tract, brain, and thyroid); leukaemia, myelodysplasia, and some other lymphoid cancers. There was an excess of 608 cancers in people exposed to CT scans (147 brain, 356 other solid, 48 leukaemia or myelodysplasia, and 57 other lymphoid). The absolute excess incidence rate for all cancers combined was 9.38 per 100,000 person years at risk, as of 31 December 2007. The average effective radiation dose per scan was estimated as 4.5 mSv. CONCLUSIONS: The increased incidence of cancer after CT scan exposure in this cohort was mostly due to irradiation. Because the cancer excess was still continuing at the end of follow-up, the eventual lifetime risk from CT scans cannot yet be determined. Radiation doses from contemporary CT scans are likely to be lower than those in 1985-2005, but some increase in cancer risk is still likely from current scans. Future CT scans should be limited to situations where there is a definite clinical indication, with every scan optimised to provide a diagnostic CT image at the lowest possible radiation dose.
Heatwaves have increased in intensity, frequency and duration, with these trends projected to worsen under enhanced global warming. Understanding regional heatwave trends has critical implications for the biophysical and human systems they impact. Until now a comprehensive assessment of regional observed changes was hindered by the range of metrics employed, underpinning datasets, and time periods examined. Here, using the Berkeley Earth temperature dataset and key heatwave metrics, we systematically examine regional and global observed heatwave trends. In almost all regions, heatwave frequency demonstrates the most rapid and significant change. A measure of cumulative heat shows significant increases almost everywhere since the 1950s, mainly driven by heatwave days. Trends in heatwave frequency, duration and cumulative heat have accelerated since the 1950s, and due to the high influence of variability we recommend regional trends are assessed over multiple decades. Our results provide comparable regional observed heatwave trends, on spatial and temporal scales necessary for understanding impacts.
This paper summarises an ambitious research agenda aiming to uncover the factors that affect help-seeking among young people for mental health problems. The research set out to consider why young people, and particularly young males, do not seek help when they are in psychological distress or suicidal; how professional services be made more accessible and attractive to young people; the factors that inhibit and facilitate help-seeking; and how community gatekeepers can support young people to access services to help with personal and emotional problems. A range of studies was undertaken in New South Wales, Queensland and the ACT, using both qualitative and quantitative approaches. Data from a total of 2721 young people aged 14–24 years were gathered, as well as information from some of the community gatekeepers to young people’s mental health care.Help-seeking was measured in all the studies using the General Help Seeking Questionnaire (Wilson, Deane, Ciarrochi & Rickwood, 2005), which measures future help-seeking intentions and, through supplementary questions, can also assess prior help-seeking experience. Many of the studies also measured recent help-seeking behaviour using the Actual Help Seeking Questionnaire. The types of mental health problems examined varied across the studies and included depressive symptoms, personal-emotional problems, and suicidal thoughts.The help-seeking process was conceptualised using a framework developed during the research program. This framework maintains that help-seeking is a process of translating the very personal domain of psychological distress to the interpersonal domain of seeking help. Factors that were expected to facilitate or inhibit this translation process were investigated. These included factors that determine awareness of the personal domain of psychological distress and that affect the ability to articulate or express this personal domain to others, as well as willingness to disclose mental health issues to other people.The results are reported in terms of: patterns of help-seeking across adolescence and young adulthood; the relationship of help-seeking intentions to behaviour; barriers to seeking help—lack of emotional competence, the help-negation effect related to suicidal thoughts, negative attitudes and beliefs about help-seeking and fear of stigma; and facilitators of seeking help—emotional competence, positive past experience, mental health literacy, and supportive social influences. The paper considers the implications of the findings for the development of interventions to encourage young people to seek help for their mental health problems, and concludes by identifying gaps in the help-seeking research and literature and suggesting future directions.
The problem of robustly stabilizing a linear uncertain system is considered with emphasis on the interplay between the time-domain results on the quadratic stabilization of uncertain systems and the frequency-domain results on H/sup infinity / optimization. A complete solution to a certain quadratic stabilization problem in which uncertainty enters both the state and the input matrices of the system is given. Relations between these robust stabilization problems and H/sup infinity / control theory are explored. It is also shown that in a number of cases, if a robust stabilization problem can be solved via Lyapunov methods, then it can be also be solved via H/sup infinity / control theory-based methods.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
Introduction: Several studies have measured health outcomes in the United States, but none have provided a comprehensive assessment of patterns of health by state. Objective: To use the results of the Global Burden of Disease Study (GBD) to report trends in the burden of diseases, injuries, and risk factors at the state level from 1990 to 2016. Design and Setting: A systematic analysis of published studies and available data sources estimates the burden of disease by age, sex, geography, and year. Main Outcomes and Measures: Prevalence, incidence, mortality, life expectancy, healthy life expectancy (HALE), years of life lost (YLLs) due to premature mortality, years lived with disability (YLDs), and disability-adjusted life-years (DALYs) for 333 causes and 84 risk factors with 95% uncertainty intervals (UIs) were computed. Results: Between 1990 and 2016, overall death rates in the United States declined from 745.2 (95% UI, 740.6 to 749.8) per 100 000 persons to 578.0 (95% UI, 569.4 to 587.1) per 100 000 persons. The probability of death among adults aged 20 to 55 years declined in 31 states and Washington, DC from 1990 to 2016. In 2016, Hawaii had the highest life expectancy at birth (81.3 years) and Mississippi had the lowest (74.7 years), a 6.6-year difference. Minnesota had the highest HALE at birth (70.3 years), and West Virginia had the lowest (63.8 years), a 6.5-year difference. The leading causes of DALYs in the United States for 1990 and 2016 were ischemic heart disease and lung cancer, while the third leading cause in 1990 was low back pain, and the third leading cause in 2016 was chronic obstructive pulmonary disease. Opioid use disorders moved from the 11th leading cause of DALYs in 1990 to the 7th leading cause in 2016, representing a 74.5% (95% UI, 42.8% to 93.9%) change. In 2016, each of the following 6 risks individually accounted for more than 5% of risk-attributable DALYs: tobacco consumption, high body mass index (BMI), poor diet, alcohol and drug use, high fasting plasma glucose, and high blood pressure. Across all US states, the top risk factors in terms of attributable DALYs were due to 1 of the 3 following causes: tobacco consumption (32 states), high BMI (10 states), or alcohol and drug use (8 states). Conclusions and Relevance: There are wide differences in the burden of disease at the state level. Specific diseases and risk factors, such as drug use disorders, high BMI, poor diet, high fasting plasma glucose level, and alcohol use disorders are increasing and warrant increased attention. These data can be used to inform national health priorities for research, clinical care, and policy.
We developed a model in which leader-member exchange mediated between perceived transformational leadership behaviors and followers' task performance and organizational citizenship behaviors. Our sample comprised 162 leader-follower dyads within organizations situated throughout the People's Republic of China. We showed that leader-member exchange fully mediated between transformational leadership and task performance as well as organizational citizenship behaviors. Implications for the theory and practice of leadership are discussed, and future research directions offered.
BACKGROUND: Physical exercise is seen as a promising intervention to prevent or delay cognitive decline in individuals aged 50 years and older, yet the evidence from reviews is not conclusive. OBJECTIVES: To determine if physical exercise is effective in improving cognitive function in this population. DESIGN: Systematic review with multilevel meta-analysis. DATA SOURCES: Electronic databases Medline (PubMed), EMBASE (Scopus), PsychINFO and CENTRAL (Cochrane) from inception to November 2016. ELIGIBILITY CRITERIA: Randomised controlled trials of physical exercise interventions in community-dwelling adults older than 50 years, with an outcome measure of cognitive function. RESULTS: The search returned 12 820 records, of which 39 studies were included in the systematic review. Analysis of 333 dependent effect sizes from 36 studies showed that physical exercise improved cognitive function (0.29; 95% CI 0.17 to 0.41; p<0.01). Interventions of aerobic exercise, resistance training, multicomponent training and tai chi, all had significant point estimates. When exercise prescription was examined, a duration of 45-60 min per session and at least moderate intensity, were associated with benefits to cognition. The results of the meta-analysis were consistent and independent of the cognitive domain tested or the cognitive status of the participants. CONCLUSIONS: Physical exercise improved cognitive function in the over 50s, regardless of the cognitive status of participants. To improve cognitive function, this meta-analysis provides clinicians with evidence to recommend that patients obtain both aerobic and resistance exercise of at least moderate intensity on as many days of the week as feasible, in line with current exercise guidelines.
OBJECTIVE: The mixed methods appraisal tool (MMAT) was developed for critically appraising different study designs. This study aimed to improve the content validity of three of the five categories of studies in the MMAT by identifying relevant methodological criteria for appraising the quality of qualitative, survey, and mixed methods studies. STUDY DESIGN AND SETTING: First, we performed a literature review to identify critical appraisal tools and extract methodological criteria. Second, we conducted a two-round modified e-Delphi technique. We asked three method-specific panels of experts to rate the relevance of each criterion on a five-point Likert scale. RESULTS: A total of 383 criteria were extracted from 18 critical appraisal tools and a literature review on the quality of mixed methods studies, and 60 were retained. In the first and second rounds of the e-Delphi, 73 and 56 experts participated, respectively. Consensus was reached for six qualitative criteria, eight survey criteria, and seven mixed methods criteria. These results led to modifications of eight of the 11 MMAT (version 2011) criteria. Specifically, we reformulated two criteria, replaced four, and removed two. Moreover, we added six new criteria. CONCLUSION: Results of this study led to improve the content validity of this tool, revise it, and propose a new version (MMAT version 2018).
The contrast-response function of a class of first order interneurons in the fly's compound eye approximates to the cumulative probability distribution of contrast levels in natural scenes. Elementary information theory shows that this matching enables the neurons to encode contrast fluctuations most efficiently.
An important component of the biological assessment of stream condition is an evaluation of the direct or indirect effects of human activities or disturbances. The concept of a "reference condition" is increasingly used to describe the standard or benchmark against which current condition is compared. Many individual nations, and the European Union as a whole, have codified the concept of reference condition in legislation aimed at protecting and improving the ecological condition of streams. However, the phrase "reference condition" has many meanings in a variety of contexts. One of the primary purposes of this paper is to bring some consistency to the use of the term. We argue the need for a "reference condition" term that is reserved for referring to the "naturalness" of the biota (structure and function) and that naturalness implies the absence of significant human disturbance or alteration. To avoid the confusion that arises when alternative definitions of reference condition are used, we propose that the original concept of reference condition be preserved in this modified form of the term: "reference condition for biological integrity," or RC(BI). We further urge that these specific terms be used to refer to the concepts and methods used in individual bioassessments to characterize the expected condition to which current conditions are compared: "minimally disturbed condition" (MDC); "historical condition" (HC); "least disturbed condition" (LDC); and "best attainable condition" (BAC). We argue that each of these concepts can be narrowly defined, and each implies specific methods for estimating expectations. We also describe current methods by which these expectations are estimated including: the reference-site approach (condition at minimally or least-disturbed sites); best professional judgment; interpretation of historical condition; extrapolation of empirical models; and evaluation of ambient distributions. Because different assumptions about what constitutes reference condition will have important effects on the final classification of streams into condition classes, we urge that bioassessments be consistent in describing the definitions and methods used to set expectations.
BACKGROUND: The number of mental health apps (MHapps) developed and now available to smartphone users has increased in recent years. MHapps and other technology-based solutions have the potential to play an important part in the future of mental health care; however, there is no single guide for the development of evidence-based MHapps. Many currently available MHapps lack features that would greatly improve their functionality, or include features that are not optimized. Furthermore, MHapp developers rarely conduct or publish trial-based experimental validation of their apps. Indeed, a previous systematic review revealed a complete lack of trial-based evidence for many of the hundreds of MHapps available. OBJECTIVE: To guide future MHapp development, a set of clear, practical, evidence-based recommendations is presented for MHapp developers to create better, more rigorous apps. METHODS: A literature review was conducted, scrutinizing research across diverse fields, including mental health interventions, preventative health, mobile health, and mobile app design. RESULTS: Sixteen recommendations were formulated. Evidence for each recommendation is discussed, and guidance on how these recommendations might be integrated into the overall design of an MHapp is offered. Each recommendation is rated on the basis of the strength of associated evidence. It is important to design an MHapp using a behavioral plan and interactive framework that encourages the user to engage with the app; thus, it may not be possible to incorporate all 16 recommendations into a single MHapp. CONCLUSIONS: Randomized controlled trials are required to validate future MHapps and the principles upon which they are designed, and to further investigate the recommendations presented in this review. Effective MHapps are required to help prevent mental health problems and to ease the burden on health systems.
Nucleotide sequence and taxonomy reference databases are critical resources for widespread applications including marker-gene and metagenome sequencing for microbiome analysis, diet metabarcoding, and environmental DNA (eDNA) surveys. Reproducibly generating, managing, using, and evaluating nucleotide sequence and taxonomy reference databases creates a significant bottleneck for researchers aiming to generate custom sequence databases. Furthermore, database composition drastically influences results, and lack of standardization limits cross-study comparisons. To address these challenges, we developed RESCRIPt, a Python 3 software package and QIIME 2 plugin for reproducible generation and management of reference sequence taxonomy databases, including dedicated functions that streamline creating databases from popular sources, and functions for evaluating, comparing, and interactively exploring qualitative and quantitative characteristics across reference databases. To highlight the breadth and capabilities of RESCRIPt, we provide several examples for working with popular databases for microbiome profiling (SILVA, Greengenes, NCBI-RefSeq, GTDB), eDNA and diet metabarcoding surveys (BOLD, GenBank), as well as for genome comparison. We show that bigger is not always better, and reference databases with standardized taxonomies and those that focus on type strains have quantitative advantages, though may not be appropriate for all use cases. Most databases appear to benefit from some curation (quality filtering), though sequence clustering appears detrimental to database quality. Finally, we demonstrate the breadth and extensibility of RESCRIPt for reproducible workflows with a comparison of global hepatitis genomes. RESCRIPt provides tools to democratize the process of reference database acquisition and management, enabling researchers to reproducibly and transparently create reference materials for diverse research applications. RESCRIPt is released under a permissive BSD-3 license at https://github.com/bokulich-lab/RESCRIPt.
Abstract We propose an integrated, heuristic model of lotic biocomplexity across spatiotemporal scales from headwaters to large rivers. This riverine ecosystem synthesis (RES) provides a framework for understanding both broad, often discontinuous patterns along longitudinal and lateral dimensions of river networks and local ecological patterns across various temporal and smaller spatial scales. Rather than posing a completely new model, we arrange a conceptual marriage of eco‐geomorphology (ecological aspects of fluvial geomorphology) with a terrestrial landscape model describing hierarchical patch dynamics. We modify five components of this terrestrial model for lotic ecosystems: (1) nested, discontinuous hierarchies of patch mosaics; (2) ecosystem dynamics as a composite of intra‐ and inter‐patch dynamics; (3) linked patterns and processes; (4) dominance of non‐equilibrial and stochastic processes; and (5) formation of a quasi‐equilibrial, metastable state. Our conceptual model blends our perspectives on biocomplexity with aspects of aquatic models proposed from 1980–2004. Contrasting with a common view of rivers as continuous, longitudinal gradients in physical conditions, the RES portrays rivers as downstream arrays of large hydrogeomorphic patches (e.g. constricted, braided and floodplain channel areas) formed by catchment geomorphology and climate. The longitudinal distribution of these patches, which are identifiable using standard geomorphic techniques, varies amongst rivers and is difficult to forecast above ecoregional scales. Some types of hydrogeomorphic patches may reoccur along this downstream passage. Unique ecological ‘functional process zones’ are formed by individual types of hydrogeomorphic patches because of physiochemical habitat differences which affect ecosystem structure and function. The RES currently includes 14 tenets predicting how patterns of individual species distributions, community regulation, lotic ecosystem processes, and floodplain interactions will vary over spatiotemporal scales, especially as they relate to the functional process zones formed by hydrogeomorphic differences in the river network. Copyright © 2006 John Wiley & Sons, Ltd.
Although vast technological advances have been made and genetic software packages are growing in number, it is not a trivial task to analyse SNP data. We announce a new r package, dartr, enabling the analysis of single nucleotide polymorphism data for population genomic and phylogenomic applications. dartr provides user-friendly functions for data quality control and marker selection, and permits rigorous evaluations of conformation to Hardy-Weinberg equilibrium, gametic-phase disequilibrium and neutrality. The package reports standard descriptive statistics, permits exploration of patterns in the data through principal components analysis and conducts standard F-statistics, as well as basic phylogenetic analyses, population assignment, isolation by distance and exports data to a variety of commonly used downstream applications (e.g., newhybrids, faststructure and phylogeny applications) outside of the r environment. The package serves two main purposes: first, a user-friendly approach to lower the hurdle to analyse such data-therefore, the package comes with a detailed tutorial targeted to the r beginner to allow data analysis without requiring deep knowledge of r. Second, we use a single, well-established format-genlight from the adegenet package-as input for all our functions to avoid data reformatting. By strictly using the genlight format, we hope to facilitate this format as the de facto standard of future software developments and hence reduce the format jungle of genetic data sets. The dartr package is available via the r CRAN network and GitHub.