Atlas of Living Australia
archiveCanberra, Australian Capital Territory, Australia
Research output, citation impact, and the most-cited recent papers from Atlas of Living Australia (Australia). Aggregated across the NobleBlocks index of 300M+ scholarly works.
Top-cited papers from Atlas of Living Australia
Trait-based approaches have improved our understanding of plant evolution, community assembly and ecosystem functioning. A major challenge for the upcoming decades is to understand the functions and evolution of early life-history traits, across levels of organization and ecological strategies. Although a variety of seed traits are critical for dispersal, persistence, germination timing and seedling establishment, only seed mass has been considered systematically. Here we suggest broadening the range of morphological, physiological and biochemical seed traits to add new understanding on plant niches, population dynamics and community assembly. The diversity of seed traits and functions provides an important challenge that will require international collaboration in three areas of research. First, we present a conceptual framework for a seed ecological spectrum that builds upon current understanding of plant niches. We then lay the foundation for a seed-trait functional network, the establishment of which will underpin and facilitate trait-based inferences. Finally, we anticipate novel insights and challenges associated with incorporating diverse seed traits into predictive evolutionary ecology, community ecology and applied ecology. If the community invests in standardized seed-trait collection and the implementation of rigorous databases, major strides can be made at this exciting frontier of functional ecology.
DNA barcoding protocols require the linkage of each sequence record to a voucher specimen that has, whenever possible, been authoritatively identified. Natural history collections would seem an ideal resource for barcode library construction, but they have never seen large-scale analysis because of concerns linked to DNA degradation. The present study examines the strength of this barrier, carrying out a comprehensive analysis of moth and butterfly (Lepidoptera) species in the Australian National Insect Collection. Protocols were developed that enabled tissue samples, specimen data, and images to be assembled rapidly. Using these methods, a five-person team processed 41,650 specimens representing 12,699 species in 14 weeks. Subsequent molecular analysis took about six months, reflecting the need for multiple rounds of PCR as sequence recovery was impacted by age, body size, and collection protocols. Despite these variables and the fact that specimens averaged 30.4 years old, barcode records were obtained from 86% of the species. In fact, one or more barcode compliant sequences (>487 bp) were recovered from virtually all species represented by five or more individuals, even when the youngest was 50 years old. By assembling specimen images, distributional data, and DNA barcode sequences on a web-accessible informatics platform, this study has greatly advanced accessibility to information on thousands of species. Moreover, much of the specimen data became publically accessible within days of its acquisition, while most sequence results saw release within three months. As such, this study reveals the speed with which DNA barcode workflows can mobilize biodiversity data, often providing the first web-accessible information for a species. These results further suggest that existing collections can enable the rapid development of a comprehensive DNA barcode library for the most diverse compartment of terrestrial biodiversity - insects.
BACKGROUND: Microbial inhabitants of soils are important to ecosystem and planetary functions, yet there are large gaps in our knowledge of their diversity and ecology. The 'Biomes of Australian Soil Environments' (BASE) project has generated a database of microbial diversity with associated metadata across extensive environmental gradients at continental scale. As the characterisation of microbes rapidly expands, the BASE database provides an evolving platform for interrogating and integrating microbial diversity and function. FINDINGS: BASE currently provides amplicon sequences and associated contextual data for over 900 sites encompassing all Australian states and territories, a wide variety of bioregions, vegetation and land-use types. Amplicons target bacteria, archaea and general and fungal-specific eukaryotes. The growing database will soon include metagenomics data. Data are provided in both raw sequence (FASTQ) and analysed OTU table formats and are accessed via the project's data portal, which provides a user-friendly search tool to quickly identify samples of interest. Processed data can be visually interrogated and intersected with other Australian diversity and environmental data using tools developed by the 'Atlas of Living Australia'. CONCLUSIONS: Developed within an open data framework, the BASE project is the first Australian soil microbial diversity database. The database will grow and link to other global efforts to explore microbial, plant, animal, and marine biodiversity. Its design and open access nature ensures that BASE will evolve as a valuable tool for documenting an often overlooked component of biodiversity and the many microbe-driven processes that are essential to sustain soil function and ecosystem services.
Essential Biodiversity Variables (EBVs) allow observation and reporting of global biodiversity change, but a detailed framework for the empirical derivation of specific EBVs has yet to be developed. Here, we re-examine and refine the previous candidate set of species traits EBVs and show how traits related to phenology, morphology, reproduction, physiology and movement can contribute to EBV operationalization. The selected EBVs express intra-specific trait variation and allow monitoring of how organisms respond to global change. We evaluate the societal relevance of species traits EBVs for policy targets and demonstrate how open, interoperable and machine-readable trait data enable the building of EBV data products. We outline collection methods, meta(data) standardization, reproducible workflows, semantic tools and licence requirements for producing species traits EBVs. An operationalization is critical for assessing progress towards biodiversity conservation and sustainable development goals and has wide implications for data-intensive science in ecology, biogeography, conservation and Earth observation.
Appropriate selection of environmental variables is critical to the performance of biodiversity models, but has received less attention than the choice of modelling method. Online aggregators of biological and environmental data, such as the Global Biodiversity Information Facility and the Atlas of Living Australia, necessitate a rational approach to variable selection. We outline a set of general principles for systematically identifying, compiling, evaluating and selecting environmental variables for a biodiversity model. Our approach aims to maximise the information obtained from the analysis of biological records linked to a potentially large suite of spatial environmental variables. We demonstrate the utility of this structured framework through case studies with Australian vascular plants: regional modelling of a species distribution, continent-wide modelling of species compositional turnover and environmental classification. The approach is informed by three components of a biodiversity model: (1) an ecological framework or conceptual model, (2) a data model concerning availability, resolution and variable selection and (3) a method for analysing data. We expand the data model in structuring the problem of choosing environmental variables. The case studies demonstrate a structured approach for the: (1) cost-effective compilation of variables in the context of an explicit ecological framework for the study, attribute accuracy and resolution; (2) evaluation of non-linear relationships between variables using knowledge of their derivation, scatter plots and dissimilarity matrices; (3) selection and grouping of variables based on hypotheses of relative ecological importance and perceived predictor effectiveness; (4) systematic testing of variables as predictors through the process of model building and refinement and (5) model critique, inference and synthesis using direct gradient analysis to evaluate the shape of response curves in the context of ecological theory by presenting predictions in both geographic and environmental space.
Building on centuries of research based on herbarium specimens gathered through time and around the globe, a new era of discovery, synthesis, and prediction using digitized collections data has begun. This paper provides an overview of how aggregated, open access botanical and associated biological, environmental, and ecological data sets, from genes to the ecosystem, can be used to document the impacts of global change on communities, organisms, and society; predict future impacts; and help to drive the remediation of change. Advocacy for botanical collections and their expansion is needed, including ongoing digitization and online publishing. The addition of non-traditional digitized data fields, user annotation capability, and born-digital field data collection enables the rapid access of rich, digitally available data sets for research, education, informed decision-making, and other scholarly and creative activities. Researchers are receiving enormous benefits from data aggregators including the Global Biodiversity Information Facility (GBIF), Integrated Digitized Biocollections (iDigBio), the Atlas of Living Australia (ALA), and the Biodiversity Heritage Library (BHL), but effective collaboration around data infrastructures is needed when working with large and disparate data sets. Tools for data discovery, visualization, analysis, and skills training are increasingly important for inspiring novel research that improves the intrinsic value of physical and digital botanical collections.
Collections of biological specimens are fundamental to scientific understanding and characterization of natural diversity-past, present and future. This paper presents a system for liberating useful information from physical collections by bringing specimens into the digital domain so they can be more readily shared, analyzed, annotated and compared. It focuses on insects and is strongly motivated by the desire to accelerate and augment current practices in insect taxonomy which predominantly use text, 2D diagrams and images to describe and characterize species. While these traditional kinds of descriptions are informative and useful, they cannot cover insect specimens "from all angles" and precious specimens are still exchanged between researchers and collections for this reason. Furthermore, insects can be complex in structure and pose many challenges to computer vision systems. We present a new prototype for a practical, cost-effective system of off-the-shelf components to acquire natural-colour 3D models of insects from around 3 mm to 30 mm in length. ("Natural-colour" is used to contrast with "false-colour", i.e., colour generated from, or applied to, gray-scale data post-acquisition.) Colour images are captured from different angles and focal depths using a digital single lens reflex (DSLR) camera rig and two-axis turntable. These 2D images are processed into 3D reconstructions using software based on a visual hull algorithm. The resulting models are compact (around 10 megabytes), afford excellent optical resolution, and can be readily embedded into documents and web pages, as well as viewed on mobile devices. The system is portable, safe, relatively affordable, and complements the sort of volumetric data that can be acquired by computed tomography. This system provides a new way to augment the description and documentation of insect species holotypes, reducing the need to handle or ship specimens. It opens up new opportunities to collect data for research, education, art, entertainment, biodiversity assessment and biosecurity control.
Abstract Environmental DNA (eDNA) metabarcoding, where specific DNA markers (barcodes) are purified from the environment, sequenced, and used to identify organisms, is revolutionizing biodiversity science. Researchers, industry, and governments are increasingly incorporating eDNA surveys into their toolkits for biomonitoring because of their high accuracy, taxonomically holistic lens, and ease of deployment. Yet, the full value of this rich and revolutionary technique is not being realized because eDNA data are rarely maintained in consistent formats or on open platforms that facilitate easy re‐use or integration with other data layers. Here, we discuss opportunities to improve the accessibility and “FAIRness” of eDNA records with the aim to maximize the value of this remarkable source of information on environmental state and change.
The Atlas of Living Australia (ALA) is Australia's national biodiversity database, delivering data and related services to more than 80,000 Australian and international users annually. Established under the Australian Government's National Collaborative Research Infrastructure Strategy to provide trusted biodiversity data to support the research sector, its utility now extends to government, higher education, non-government organisations and community groups. These partners provide data to the ALA and leverage its data and related services. The ALA has also played an important leadership role internationally in the biodiversity informatics and infrastructure space, both through its partnership with the Global Biodiversity Information Facility and through support for the international Living Atlases programmes which has now delivered 24 instances of ALA software to deliver sovereign biodiversity data capability around the world. This paper begins with a historical overview of the genesis of the ALA from the collections, museums and herbaria community in Australia. It details the biodiversity and related data and services delivered to users with a primary focus on species occurrence records which represent the ALA's primary data type. Finally, the paper explores the ALA's future directions by referencing results from a recently completed national consultation process.
The early twenty-first century has witnessed massive expansions in availability and accessibility of digital data in virtually all domains of the biodiversity sciences. Led by an array of asynchronous digitization activities spanning ecological, environmental, climatological, and biological collections data, these initiatives have resulted in a plethora of mostly disconnected and siloed data, leaving to researchers the tedious and time-consuming manual task of finding and connecting them in usable ways, integrating them into coherent data sets, and making them interoperable. The focus to date has been on elevating analog and physical records to digital replicas in local databases prior to elevating them to ever-growing aggregations of essentially disconnected discipline-specific information. In the present article, we propose a new interconnected network of digital objects on the Internet-the Digital Extended Specimen (DES) network-that transcends existing aggregator technology, augments the DES with third-party data through machine algorithms, and provides a platform for more efficient research and robust interdisciplinary discovery.
Abstract Bio‐logging data obtained by tagging animals are key to addressing global conservation challenges. However, the many thousands of existing bio‐logging datasets are not easily discoverable, universally comparable, nor readily accessible through existing repositories and across platforms, slowing down ecological research and effective management. A set of universal standards is needed to ensure discoverability, interoperability and effective translation of bio‐logging data into research and management recommendations. We propose a standardisation framework adhering to existing data principles (FAIR: Findable, Accessible, Interoperable and Reusable; and TRUST: Transparency, Responsibility, User focus, Sustainability and Technology) and involving the use of simple templates to create a data flow from manufacturers and researchers to compliant repositories, where automated procedures should be in place to prepare data availability into four standardised levels: (a) decoded raw data, (b) curated data, (c) interpolated data and (d) gridded data. Our framework allows for integration of simple tabular arrays (e.g. csv files) and creation of sharable and interoperable network Common Data Form (netCDF) files containing all the needed information for accuracy‐of‐use, rightful attribution (ensuring data providers keep ownership through the entire process) and data preservation security. We show the standardisation benefits for all stakeholders involved, and illustrate the application of our framework by focusing on marine animals and by providing examples of the workflow across all data levels, including filled templates and code to process data between levels, as well as templates to prepare netCDF files ready for sharing. Adoption of our framework will facilitate collection of Essential Ocean Variables (EOVs) in support of the Global Ocean Observing System (GOOS) and inter‐governmental assessments (e.g. the World Ocean Assessment), and will provide a starting point for broader efforts to establish interoperable bio‐logging data formats across all fields in animal ecology.
The Creative Commons (CC) licenses are a suite of copyright-based licenses defining terms for the distribution and re-use of creative works. CC provides licenses for different use cases and includes open content licenses such as the Attribution license (CC BY, used by many Open Access scientific publishers) and the Attribution Share Alike license (CC BY-SA, used by Wikipedia, for example). However, the license suite also contains non-free and non-open licenses like those containing a "non-commercial" (NC) condition. Although many people identify "non-commercial" with "non-profit", detailed analysis reveals that significant differences exist and that the license may impose some unexpected re-use limitations on works thus licensed. After providing background information on the concepts of Creative Commons licenses in general, this contribution focuses on the NC condition, its advantages, disadvantages and appropriate scope. Specifically, it contributes material towards a risk analysis for potential re-users of NC-licensed works.
Essential Biodiversity Variables (EBV) are fundamental variables that can be used for assessing biodiversity change over time, for determining adherence to biodiversity policy, for monitoring progress towards sustainable development goals, and for tracking biodiversity responses to disturbances and management interventions. Data from observations or models that provide measured or estimated EBV values, which we refer to as EBV data products, can help to capture the above processes and trends and can serve as a coherent framework for documenting trends in biodiversity. Using primary biodiversity records and other raw data as sources to produce EBV data products depends on cooperation and interoperability among multiple stakeholders, including those collecting and mobilising data for EBVs and those producing, publishing and preserving EBV data products. Here, we encapsulate ten principles for the current best practice in EBV-focused biodiversity informatics as ‘The Bari Manifesto’, serving as implementation guidelines for data and research infrastructure providers to support the emerging EBV operational framework based on trans-national and cross-infrastructure scientific workflows. The principles provide guidance on how to contribute towards the production of EBV data products that are globally oriented, while remaining appropriate to the producer's own mission, vision and goals. These ten principles cover: data management planning; data structure; metadata; services; data quality; workflows; provenance; ontologies/vocabularies; data preservation; and accessibility. For each principle, desired outcomes and goals have been formulated. Some specific actions related to fulfilling the Bari Manifesto principles are highlighted in the context of each of four groups of organizations contributing to enabling data interoperability - data standards bodies, research data infrastructures, the pertinent research communities, and funders. The Bari Manifesto provides a roadmap enabling support for routine generation of EBV data products, and increases the likelihood of success for a global EBV framework.
<p class="Normal1">This article offers an assessment of current data practices in the citizen science, community science, and crowdsourcing communities. We begin by reviewing current trends in scientific data relevant to citizen science before presenting the results of our qualitative research. Following a purposive sampling scheme designed to capture data management practices from a wide range of initiatives through a landscape sampling methodology (Bos et al. 2007), we sampled 36 projects from English-speaking countries. The authors used a semi-structured protocol to interview project proponents (either scientific leads or data managers) to better understand how projects are addressing key aspects of the data lifecycle, reporting results through descriptive statistics and other analyses. Findings suggest that citizen science projects are doing well in terms of data quality assessment and governance, but are sometimes lacking in providing open access to data outputs, documenting data, ensuring interoperability through data standards, or building robust and sustainable infrastructure. Based on this assessment, the paper presents a number of recommendations for the citizen science community related to data quality, data infrastructure, data governance, data documentation, and data access.
This paper discusses the following key messages. Taxonomy is (and taxonomists are) more important than ever in times of global change. Taxonomic endeavour is not occurring fast enough: in 250 years since the creation of the Linnean Systema Naturae, only about 20% of Earth’s species have been named. We need fundamental changes to the taxonomic process and paradigm to increase taxonomic productivity by orders of magnitude. Currently, taxonomic productivity is limited principally by the rate at which we capture and manage morphological information to enable species discovery. Many recent (and welcomed) initiatives in managing and delivering biodiversity information and accelerating the taxonomic process do not address this bottleneck. Development of computational image analysis and feature extraction methods is a crucial missing capacity needed to enable taxonomists to overcome the taxonomic impediment in a meaningful time frame.
The first goal of this chapter is to propose a slight re-framing of citizen science, which will contextualize the information presented in the rest of the book. The authors propose a perspective on and a definition for citizen science (which is alternative to the numerous previously documented definitions) as: “work undertaken by civic educators together with citizen communities to advance science, foster a broad scientific mentality, and/or encourage democratic engagement, which allows society to deal rationally with complex modern problems”. By explaining the rationale behind this definition, the authors also hope to raise awareness of the role that the meaning of words and phrases (semantics) plays in understanding and supporting citizen science. A second goal of this chapter is to explain how different organizations already use certain software solutions to organize knowledge about citizen science, how these systems can be classified and how they can facilitate or impede interoperability – the ability of humans and machines to pass information between each other.
This paper explores what the virtual biodiversity e-infrastructure will look like as it takes advantage of advances in 'Big Data' biodiversity informatics and e-research infrastructure, which allow integration of various taxon-level data types (genome, morphology, distribution and species interactions) within a phylogenetic and environmental framework. By overcoming the data scaling problem in ecology, this integrative framework will provide richer information and fast learning to enable a deeper understanding of biodiversity evolution and dynamics in a rapidly changing world. The Atlas of Living Australia is used as one example of the advantages of progressing towards this future. Living in this future will require the adoption of new ways of integrating scientific knowledge into societal decision making.This article is part of the themed issue 'From DNA barcodes to biomes'.
The Atlas of Living Australia (ALA: http://www.ala.org.au) provides the largest free and open repository of integrated biological and environmental information in a consistent format for the Australian region. As of June 2015, the ALA contained over 55 million records (10% of Global Biodiversity Information Facility’s (GBIF’s) total), consisting of 150,000+ native and alien species, nearly 500 layers of gridded and polygonal bio-environmental data, 39+ million pages of biological literature, and 45,000+ images of species and other integrated biological data. The development of the research interface to the ALA (http://spatial.ala.org.au) was the trigger to develop an architecture designed to tightly integrate environmental data for online use with biological data. Environmental layers are classed as environmental (gridded with continuous values) or contextual (polygonal with discrete class values). A suite of analysis and visualization tools have been developed to demonstrate the value of integrating the ALA’s biological and environmental data. This paper outlines the purpose and process of establishing the ALA and discusses the integration of environmental data relevant to biodiversity research in the Australian region and the vision for continually improved services for research, area management, education, and citizen science. The ALA’s environmental infrastructure addresses current needs but increased data types, volumes, and resolution suggests new directions are needed to provide quality services into the future. The experience of building the ALA has relevance for other agencies setting up similar infrastructure which supports integrated access to and use of their national biological and environmental information.
Understanding variations in habitat use through time contributes to identification of habitats critical for species survival. Here we used passive acoustic telemetry to examine the residency and site fidelity patterns of the reef manta ray Mobula alfredi at Lady Elliot Island (LEI), a key aggregation site in eastern Australia. Six acoustic receivers were moored around LEI between 2009 and 2012, and 34 acoustic transmitters were deployed on manta rays. All tagged animals returned to this site within their recording period, with some individuals visiting the area for up to 23 consecutive days. Using a set of mixed effect models, we analysed the hourly visitation patterns of manta rays in relation to temporal and environmental variables. Diel phase and sea temperature showed the strongest effects on the presence of manta rays, with weaker effects detected for wind direction, wind speed and moon illumination. Individuals occupied LEI habitat mostly during daylight hours and in calm weather conditions, which may be linked with behavioural thermoregulation, social interactions and cleaning activities. Their absence at night may be related to foraging strategies in deeper offshore waters. The effect of sea temperature reflects the greater seasonal occurrence of manta rays at LEI in winter, when temperatures are coolest, potentially in response to regional food availability. The high degree of manta ray site fidelity at aggregation sites underscores the importance of these areas as key daytime habitats for the species. We suggest that conservation measures should prioritise the protection of coastal aggregation habitats from overexploitation and degradation.
BACKGROUND: Many scientific disciplines rely on correct taxon delineations and identifications. So does a great part of the general public as well as decision makers. Researchers, students and enthusiastic amateurs often feel frustrated because information about species remains scattered, difficult to access, or difficult to decipher. Together, this affects almost anyone who wishes to identify species or verify identifications. Many remedies have been proposed, but we argue that the role of natural history collections remains insufficiently appreciated. We suggest using state-of-the-art mass imaging technology and to join forces to create a global natural history metacollection on the internet, providing access to the morphology of tens of millions of specimens and making them available for automated digital image analysis. DISCUSSION: Robotic high-resolution imaging technology and fast (high performance) computer-based image stitching make it now feasible to digitize entire collection drawers typically used for arthropod collections, or trays or containers used for other objects. Resolutions of 500 megapixels and much higher are already utilized to capture the contents of 40x50 cm collection drawers, providing amazing detail of specimens. Flanked by metadata entry, this helps to create access to tens of thousands of specimens in days. By setting priorities and combining the holdings of the most comprehensive collections for certain taxa, drawer digitizing offers the unique opportunity to create a global, virtual metacollection.The taxonomic and geographic coverage of such a collection could never be achieved by a single institution or individual. We argue that by joining forces, many new impulses will emerge for systematic biology, related fields and understanding of biodiversity in general.Digitizing drawers containing unidentified, little-curated specimens is a contribution towards the beginning of a new era of online curation. It also will help taxonomists and curators to discover and process the millions of "gems" of undescribed species hidden in museum accessions. SUMMARY: Our proposal suggests creating virtual, high-resolution image resources that will, for the first time in history, provide access for expert scientists as well as students and the general public to the enormous wealth of the world's natural history collections. We foresee that this will contribute to a better understanding, appreciation and increased use of biodiversity resources and the natural history collections serving this cause.