Turku Centre for Computer Science

facilityTurku, Finland

Research output, citation impact, and the most-cited recent papers from Turku Centre for Computer Science (Finland). Aggregated across the NobleBlocks index of 300M+ scholarly works.

Total works

4.7K

Citations

240.7K

h-index

180

i10-index

4.1K

Also known as

Turku Centre for Computer ScienceTurun tietotekniikan tutkimus- ja koulutuskeskusÅbo datatekniska forsknings- och utbildningscentrum

Top-cited papers from Turku Centre for Computer Science

First Sagittarius A* Event Horizon Telescope Results. I. The Shadow of the Supermassive Black Hole in the Center of the Milky Way

Kazunori Akiyama, A. Alberdi, W. Alef, Juan Carlos Algaba +4 more

2022· The Astrophysical Journal Letters1.7Kdoi:10.3847/2041-8213/ac6674

Abstract We present the first Event Horizon Telescope (EHT) observations of Sagittarius A* (Sgr A*), the Galactic center source associated with a supermassive black hole. These observations were conducted in 2017 using a global interferometric array of eight telescopes operating at a wavelength of λ = 1.3 mm. The EHT data resolve a compact emission region with intrahour variability. A variety of imaging and modeling analyses all support an image that is dominated by a bright, thick ring with a diameter of 51.8 ± 2.3 μ as (68% credible interval). The ring has modest azimuthal brightness asymmetry and a comparatively dim interior. Using a large suite of numerical simulations, we demonstrate that the EHT images of Sgr A* are consistent with the expected appearance of a Kerr black hole with mass ∼4 × 10 6 M ⊙ , which is inferred to exist at this location based on previous infrared observations of individual stellar orbits, as well as maser proper-motion studies. Our model comparisons disfavor scenarios where the black hole is viewed at high inclination ( i > 50°), as well as nonspinning black holes and those with retrograde accretion disks. Our results provide direct evidence for the presence of a supermassive black hole at the center of the Milky Way, and for the first time we connect the predictions from dynamical measurements of stellar orbits on scales of 10 3 –10 5 gravitational radii to event-horizon-scale images and variability. Furthermore, a comparison with the EHT results for the supermassive black hole M87* shows consistency with the predictions of general relativity spanning over three orders of magnitude in central mass.

TrackMate 7: integrating state-of-the-art segmentation algorithms into tracking pipelines

Dmitry Ershov, Minh-Son Phan, Joanna W. Pylvänäinen, Stéphane Rigaud +4 more

2022· Nature Methods1.2Kdoi:10.1038/s41592-022-01507-1

TrackMate is an automated tracking software used to analyze bioimages and is distributed as a Fiji plugin. Here, we introduce a new version of TrackMate. TrackMate 7 is built to address the broad spectrum of modern challenges researchers face by integrating state-of-the-art segmentation algorithms into tracking pipelines. We illustrate qualitatively and quantitatively that these new capabilities function effectively across a wide range of bio-imaging experiments. TrackMate 7 combines the benefits of machine and deep learning-based image segmentation with accurate object tracking to enable improved 2D and 3D tracking of diverse objects in biological research.

A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications

Ashraful Haque, Jessica A. Engel, Sarah A. Teichmann, Tapio Lönnberg

2017· Genome Medicine1.1Kdoi:10.1186/s13073-017-0467-4

RNA sequencing (RNA-seq) is a genomic approach for the detection and quantitative analysis of messenger RNA molecules in a biological sample and is useful for studying cellular responses. RNA-seq has fueled much discovery and innovation in medicine over recent years. For practical reasons, the technique is usually conducted on samples comprising thousands to millions of cells. However, this has hindered direct assessment of the fundamental unit of biology-the cell. Since the first single-cell RNA-sequencing (scRNA-seq) study was published in 2009, many more have been conducted, mostly by specialist laboratories with unique skills in wet-lab single-cell genomics, bioinformatics, and computation. However, with the increasing commercial availability of scRNA-seq platforms, and the rapid ongoing maturation of bioinformatics approaches, a point has been reached where any biomedical researcher or clinician can use scRNA-seq to make exciting discoveries. In this review, we present a practical guide to help researchers design their first scRNA-seq studies, including introductory information on experimental hardware, protocol choice, quality control, data analysis and biological interpretation.

A large-scale evaluation of computational protein function prediction

Predrag Radivojac, Wyatt T. Clark, Tal Oron, Alexandra M. Schnoes +4 more

2013· Nature Methods1.1Kdoi:10.1038/nmeth.2340

Automated annotation of protein function is challenging. As the number of sequenced genomes rapidly grows, the overwhelming majority of protein products can only be annotated computationally. If computational predictions are to be relied upon, it is crucial that the accuracy of these methods be high. Here we report the results from the first large-scale community-based critical assessment of protein function annotation (CAFA) experiment. Fifty-four methods representing the state of the art for protein function prediction were evaluated on a target set of 866 proteins from 11 organisms. Two findings stand out: (i) today's best protein function prediction algorithms substantially outperform widely used first-generation methods, with large gains on all types of targets; and (ii) although the top methods perform well enough to guide experiments, there is considerable need for improvement of currently available tools.

Human gut colonisation may be initiated in utero by distinct microbial communities in the placenta and amniotic fluid

María Carmen Collado, Samuli Rautava, Juhani Aakko, Erika Isolauri +1 more

2016· Scientific Reports1.1Kdoi:10.1038/srep23129

Interaction with intestinal microbes in infancy has a profound impact on health and disease in later life through programming of immune and metabolic pathways. We collected maternal faeces, placenta, amniotic fluid, colostrum, meconium and infant faeces samples from 15 mother-infant pairs in an effort to rigorously investigate prenatal and neonatal microbial transfer and gut colonisation. To ensure sterile sampling, only deliveries at full term by elective caesarean section were studied. Microbiota composition and activity assessment by conventional bacterial culture, 16S rRNA gene pyrosequencing, quantitative PCR, and denaturing gradient gel electrophoresis revealed that the placenta and amniotic fluid harbour a distinct microbiota characterised by low richness, low diversity and the predominance of Proteobacteria. Shared features between the microbiota detected in the placenta and amniotic fluid and in infant meconium suggest microbial transfer at the foeto-maternal interface. At the age of 3-4 days, the infant gut microbiota composition begins to resemble that detected in colostrum. Based on these data, we propose that the stepwise microbial gut colonisation process may be initiated already prenatally by a distinct microbiota in the placenta and amniotic fluid. The link between the mother and the offspring is continued after birth by microbes present in breast milk.

Stress-Inducible Regulation of Heat Shock Factor 1 by the Deacetylase SIRT1

Sandy D. Westerheide, Julius Anckar, Stanley M. Stevens, Lea Sistonen +1 more

2009· Science713doi:10.1126/science.1165946

Heat shock factor 1 (HSF1) is essential for protecting cells from protein-damaging stress associated with misfolded proteins and regulates the insulin-signaling pathway and aging. Here, we show that human HSF1 is inducibly acetylated at a critical residue that negatively regulates DNA binding activity. Activation of the deacetylase and longevity factor SIRT1 prolonged HSF1 binding to the heat shock promoter Hsp70 by maintaining HSF1 in a deacetylated, DNA-binding competent state. Conversely, down-regulation of SIRT1 accelerated the attenuation of the heat shock response (HSR) and release of HSF1 from its cognate promoter elements. These results provide a mechanistic basis for the requirement of HSF1 in the regulation of life span and establish a role for SIRT1 in protein homeostasis and the HSR.

The reinterpretation of dreams: An evolutionary hypothesis of the function of dreaming

Antti Revonsuo

2000· Behavioral and Brain Sciences667doi:10.1017/s0140525x00004015

Several theories claim that dreaming is a random by-product of REM sleep physiology and that it does not serve any natural function. Phenomenal dream content, however, is not as disorganized as such views imply. The form and content of dreams is not random but organized and selective: during dreaming, the brain constructs a complex model of the world in which certain types of elements, when compared to waking life, are underrepresented whereas others are over represented. Furthermore, dream content is consistently and powerfully modulated by certain types of waking experiences. On the basis of this evidence, I put forward the hypothesis that the biological function of dreaming is to simulate threatening events, and to rehearse threat perception and threat avoidance. To evaluate this hypothesis, we need to consider the original evolutionary context of dreaming and the possible traces it has left in the dream content of the present human population. In the ancestral environment human life was short and full of threats. Any behavioral advantage in dealing with highly dangerous events would have increased the probability of reproductive success. A dream-production mechanism that tends to select threatening waking events and simulate them over and over again in various combinations would have been valuable for the development and maintenance of threat-avoidance skills. Empirical evidence from normative dream content, children's dreams, recurrent dreams, nightmares, post traumatic dreams, and the dreams of hunter-gatherers indicates that our dream-production mechanisms are in fact specialized in the simulation of threatening events, and thus provides support to the threat simulation hypothesis of the function of dreaming.

First Sagittarius A* Event Horizon Telescope Results. VI. Testing the Black Hole Metric

Kazunori Akiyama, A. Alberdi, W. Alef, Juan Carlos Algaba +4 more

2022· The Astrophysical Journal Letters633doi:10.3847/2041-8213/ac6756

Abstract Astrophysical black holes are expected to be described by the Kerr metric. This is the only stationary, vacuum, axisymmetric metric, without electromagnetic charge, that satisfies Einstein’s equations and does not have pathologies outside of the event horizon. We present new constraints on potential deviations from the Kerr prediction based on 2017 EHT observations of Sagittarius A* (Sgr A*). We calibrate the relationship between the geometrically defined black hole shadow and the observed size of the ring-like images using a library that includes both Kerr and non-Kerr simulations. We use the exquisite prior constraints on the mass-to-distance ratio for Sgr A* to show that the observed image size is within ∼10% of the Kerr predictions. We use these bounds to constrain metrics that are parametrically different from Kerr, as well as the charges of several known spacetimes. To consider alternatives to the presence of an event horizon, we explore the possibility that Sgr A* is a compact object with a surface that either absorbs and thermally reemits incident radiation or partially reflects it. Using the observed image size and the broadband spectrum of Sgr A*, we conclude that a thermal surface can be ruled out and a fully reflective one is unlikely. We compare our results to the broader landscape of gravitational tests. Together with the bounds found for stellar-mass black holes and the M87 black hole, our observations provide further support that the external spacetimes of all black holes are described by the Kerr metric, independent of their mass.

Galaxy And Mass Assembly (GAMA): end of survey report and data release 2

J. Liske, I. K. Baldry, Simon P. Driver, R. J. Tuffs +4 more

2015· Monthly Notices of the Royal Astronomical Society600doi:10.1093/mnras/stv1436

The Galaxy And Mass Assembly (GAMA) survey is one of the largest contemporary spectroscopic surveys of low redshift galaxies. Covering an area of 286 deg 2 (split among five survey regions) down to a limiting magnitude of r < 19.8 mag, we have collected spectra and reliable redshifts for 238 000 objects using the AAOmega spectrograph on the Anglo-Australian Telescope. In addition, we have assembled imaging data from a number of independent surveys in order to generate photometry spanning the wavelength range 1 nm-1 m. Here, we report on the recently completed spectroscopic survey and present a series of diagnostics to assess its final state and the quality of the redshift data. We also describe a number of survey aspects and procedures, or updates thereof, including changes to the input catalogue, redshifting and re-redshifting, and the derivation of ultraviolet, optical and near-infrared photometry. Finally, we present the second public release of GAMA data. In this release, we provide input catalogue and targeting information, spectra, redshifts, ultraviolet, optical and near-infrared photometry, single-component Srsic fits, stellar masses, H-derived star formation rates, environment information, and group properties for all galaxies with r < 19.0 mag in two of our survey regions, and for all galaxies with r < 19.4 mag in a third region (72 225 objects in total). The data base serving these data is available at http://www.gama-survey.org/.

Microanatomy of the Human Atherosclerotic Plaque by Single-Cell Transcriptomics

Marie A.C. Depuydt, Koen H.M. Prange, Lotte Slenders, Tiit Örd +4 more

2020· Circulation Research567doi:10.1161/circresaha.120.316770

Rationale: Atherosclerotic lesions are known for their cellular heterogeneity, yet the molecular complexity within the cells of human plaques has not been fully assessed. Objective: Using single-cell transcriptomics and chromatin accessibility, we gained a better understanding of the pathophysiology underlying human atherosclerosis. Methods and Results: We performed single-cell RNA and single-cell ATAC sequencing on human carotid atherosclerotic plaques to define the cells at play and determine their transcriptomic and epigenomic characteristics. We identified 14 distinct cell populations including endothelial cells, smooth muscle cells, mast cells, B cells, myeloid cells, and T cells and identified multiple cellular activation states and suggested cellular interconversions. Within the endothelial cell population, we defined subsets with angiogenic capacity plus clear signs of endothelial to mesenchymal transition. CD4 + and CD8 + T cells showed activation-based subclasses, each with a gradual decline from a cytotoxic to a more quiescent phenotype. Myeloid cells included 2 populations of proinflammatory macrophages showing IL (interleukin) 1B or TNF (tumor necrosis factor) expression as well as a foam cell-like population expressing TREM2 (triggering receptor expressed on myeloid cells 2) and displaying a fibrosis-promoting phenotype. ATACseq data identified specific transcription factors associated with the myeloid subpopulation and T cell cytokine profiles underlying mutual activation between both cell types. Finally, cardiovascular disease susceptibility genes identified using public genome-wide association studies data were particularly enriched in lesional macrophages, endothelial, and smooth muscle cells. Conclusions: This study provides a transcriptome-based cellular landscape of human atherosclerotic plaques and highlights cellular plasticity and intercellular communication at the site of disease. This detailed definition of cell communities at play in atherosclerosis will facilitate cell-based mapping of novel interventional targets with direct functional relevance for the treatment of human disease.

Democratising deep learning for microscopy with ZeroCostDL4Mic

Lucas von Chamier, Romain F. Laine, Johanna Jukkala, Christoph Spahn +4 more

2021· Nature Communications553doi:10.1038/s41467-021-22518-0

Deep Learning (DL) methods are powerful analytical tools for microscopy and can outperform conventional image processing pipelines. Despite the enthusiasm and innovations fuelled by DL technology, the need to access powerful and compatible resources to train DL networks leads to an accessibility barrier that novice users often find difficult to overcome. Here, we present ZeroCostDL4Mic, an entry-level platform simplifying DL access by leveraging the free, cloud-based computational resources of Google Colab. ZeroCostDL4Mic allows researchers with no coding expertise to train and apply key DL networks to perform tasks including segmentation (using U-Net and StarDist), object detection (using YOLOv2), denoising (using CARE and Noise2Void), super-resolution microscopy (using Deep-STORM), and image-to-image translation (using Label-free prediction - fnet, pix2pix and CycleGAN). Importantly, we provide suitable quantitative tools for each network to evaluate model performance, allowing model optimisation. We demonstrate the application of the platform to study multiple biological processes.

Timepix3: a 65K channel hybrid pixel readout chip with simultaneous ToA/ToT and sparse readout

T. Poikela, Juha Plosila, Tomi Westerlund, M. Campbell +4 more

2014· Journal of Instrumentation548doi:10.1088/1748-0221/9/05/c05013

The Timepix3, hybrid pixel detector (HPD) readout chip, a successor to the Timepix \cite{timepix2007} chip, can record time-of-arrival (ToA) and time-over-threshold (ToT) simultaneously in each pixel. ToA information is recorded in a 14-bit register at 40 MHz and can be refined by a further 4 bits with a nominal resolution of 1.5625 ns (640 MHz). ToT is recorded in a 10-bit overflow controlled counter at 40 MHz. Pixels can be programmed to record 14 bits of integral ToT and 10 bits of event counting, both at 40 MHz. The chip is designed in 130 nm CMOS and contains 256 × 256 pixel channels (55 × 55 μm2). The chip, which has more than 170 M transistors, has been conceived as a general-purpose readout chip for HPDs used in a wide range of applications. Common requirements of these applications are operation without a trigger signal, and sparse readout where only pixels containing event information are read out.

Planck2013 results. XV. CMB power spectra and likelihood

P. A. R. Ade, N. Aghanim, C. Armitage-Caplan, M. Arnaud +4 more

2014· Astronomy and Astrophysics546doi:10.1051/0004-6361/201321573

This paper presents the Planck 2013 likelihood, a complete statistical description of the two-point correlation function of the CMB temperature fluctuations that accounts for all known relevant uncertainties, both instrumental and astrophysical in nature. We use this likelihood to derive our best estimate of the CMB angular power spectrum from Planck over three decades in multipole moment, , covering 2 2500. The main source of uncertainty at < 1500 is cosmic variance. Uncertainties in small-scale foreground modelling and instrumental noise dominate the error budget at higher s. For < 50, our likelihood exploits all Planck frequency channels from 30 to 353 GHz, separating the cosmological CMB signal from diffuse Galactic foregrounds through a physically motivated Bayesian component separation technique. At 50, we employ a correlated Gaussian likelihood approximation based on a fine-grained set of angular cross-spectra derived from multiple detector combinations between the 100, 143, and 217 GHz frequency channels, marginalising over power spectrum foreground templates. We validate our likelihood through an extensive suite of consistency tests, and assess the impact of residual foreground and instrumental uncertainties on the final cosmological parameters. We find good internal agreement among the high-cross-spectra with residuals below a few K 2 at < 1000, in agreement with estimated calibration uncertainties. We compare our results with foreground-cleaned CMB maps derived from all Planck frequencies, as well as with cross-spectra derived from the 70 GHz Planck map, and find broad agreement in terms of spectrum residuals and cosmological parameters. We further show that the best-fit CDM cosmology is in excellent agreement with preliminary Planck EE and T E polarisation spectra. We find that the standard CDM cosmology is well constrained by Planck from the measurements at < 1500. One specific example is the spectral index of scalar perturbations, for which we report a 5.4 deviation from scale invariance, n s = 1. Increasing the multipole range beyond 1500 does not increase our accuracy for the CDM parameters, but instead allows us to study extensions beyond the standard model. We find no indication of significant departures from the CDM framework. Finally, we report a tension between the Planck best-fit CDM model and the low-spectrum in the form of a power deficit of 5-10% at < 40, with a statistical significance of 2.5-3. Without a theoretically motivated model for this power deficit, we do not elaborate further on its cosmological implications, but note that this is our most puzzling finding in an otherwise remarkably consistent data set.

A systematic review of the mental health changes of children and young people before and during the COVID-19 pandemic

Laura Kauhanen, Wan Mohd Azam Wan Mohd Yunus, Lotta Lempinen, Kirsi Peltonen +4 more

2022· European Child & Adolescent Psychiatry531doi:10.1007/s00787-022-02060-0

There is increasing knowledge that the COVID-19 pandemic has had an impact on mental health of children and young people. However, the global evidence of mental health changes before compared to during the COVID-19 pandemic focusing on children and young people has not been systematically reviewed. This systematic review examined longitudinal and repeated cross-sectional studies comparing before and during COVID-19 pandemic data to determine whether the mental health of children and young people had changed before and during the COVID-19 pandemic. The Web of Science, PubMed, Embase and PsycINFO databases were searched to identify peer-reviewed studies that had been published in English and focused on children and young people between 0 and 24 years of age. This identified 21 studies from 11 countries, covering more than 96,000 subjects from 3 to 24 years of age. Pre-pandemic and pandemic data were compared. Most studies reported longitudinal deterioration in the mental health of adolescents and young people, with increased depression, anxiety and psychological distress after the pandemic started. Other findings included deteriorated negative affect, mental well-being and increased loneliness. Comparing data for pandemic and pre-pandemic periods showed that the COVID-19 pandemic may negatively impact the mental health of children and young people. There is an urgent need for high-quality research to address the impact, risks and protective factors of the pandemic on their mental health, as this will provide a good foundation for dealing with future health emergencies and other crises.

Planck2013 results. XXIX. ThePlanckcatalogue of Sunyaev-Zeldovich sources

P. A. R. Ade, N. Aghanim, C. Armitage-Caplan, M. Arnaud +4 more

2014· Astronomy and Astrophysics524doi:10.1051/0004-6361/201321523

We describe the all-sky Planck catalogue of clusters and cluster candidates derived from Sunyaev-Zeldovich (SZ) effect detections using the first 15.5 months of Planck satellite observations. The catalogue contains 1227 entries, making it over six times the size of the Planck Early SZ (ESZ) sample and the largest SZ-selected catalogue to date. It contains 861 confirmed clusters, of which 178 have been confirmed as clusters, mostly through follow-up observations, and a further 683 are previously-known clusters. The remaining 366 have the status of cluster candidates, and we divide them into three classes according to the quality of evidence that they are likely to be true clusters. The Planck SZ catalogue is the deepest all-sky cluster catalogue, with redshifts up to about one, and spans the broadest cluster mass range from (0.1 to 1.6) 10 15 M . Confirmation of cluster candidates through comparison with existing surveys or cluster catalogues is extensively described, as is the statistical characterization of the catalogue in terms of completeness and statistical reliability. The outputs of the validation process are provided as additional information. This gives, in particular, an ensemble of 813 cluster redshifts, and for all these Planck clusters we also include a mass estimated from a newly-proposed SZ-mass proxy. A refined measure of the SZ Compton parameter for the clusters with X-ray counter-parts is provided, as is an X-ray flux for all the Planck clusters not previously detected in X-ray surveys.

Sudden Transition between Classical and Quantum Decoherence

Laura Mazzola, Jyrki Piilo, Sabrina Maniscalco

2010· Physical Review Letters518doi:10.1103/physrevlett.104.200401

We study the dynamics of quantum and classical correlations in the presence of nondissipative decoherence. We discover a class of initial states for which the quantum correlations, quantified by the quantum discord, are not destroyed by decoherence for times t<[symbol: see text]. In this initial time interval classical correlations decay. For t>[symbol: see text], on the other hand, classical correlations do not change in time and only quantum correlations are lost due to the interaction with the environment. Therefore, at the transition time [symbol: see text] the open system dynamics exhibits a sudden transition from classical to quantum decoherence regime.

First Sagittarius A* Event Horizon Telescope Results. V. Testing Astrophysical Models of the Galactic Center Black Hole

Kazunori Akiyama, A. Alberdi, W. Alef, Juan Carlos Algaba +4 more

2022· The Astrophysical Journal Letters516doi:10.3847/2041-8213/ac6672

Abstract In this paper we provide a first physical interpretation for the Event Horizon Telescope's (EHT) 2017 observations of Sgr A*. Our main approach is to compare resolved EHT data at 230 GHz and unresolved non-EHT observations from radio to X-ray wavelengths to predictions from a library of models based on time-dependent general relativistic magnetohydrodynamics simulations, including aligned, tilted, and stellar-wind-fed simulations; radiative transfer is performed assuming both thermal and nonthermal electron distribution functions. We test the models against 11 constraints drawn from EHT 230 GHz data and observations at 86 GHz, 2.2 μ m, and in the X-ray. All models fail at least one constraint. Light-curve variability provides a particularly severe constraint, failing nearly all strongly magnetized (magnetically arrested disk (MAD)) models and a large fraction of weakly magnetized models. A number of models fail only the variability constraints. We identify a promising cluster of these models, which are MAD and have inclination i ≤ 30°. They have accretion rate (5.2–9.5) × 10 −9 M ⊙ yr −1 , bolometric luminosity (6.8–9.2) × 10 35 erg s −1 , and outflow power (1.3–4.8) × 10 38 erg s −1 . We also find that all models with i ≥ 70° fail at least two constraints, as do all models with equal ion and electron temperature; exploratory, nonthermal model sets tend to have higher 2.2 μ m flux density; and the population of cold electrons is limited by X-ray constraints due to the risk of bremsstrahlung overproduction. Finally, we discuss physical and numerical limitations of the models, highlighting the possible importance of kinetic effects and duration of the simulations.

A modelling system for the exhaust emissions of marine traffic and its application in the Baltic Sea area

Jukka-Pekka Jalkanen, Anders Brink, Juha Kalli, Heidi Pettersson +2 more

2009· Atmospheric chemistry and physics495doi:10.5194/acp-9-9209-2009

Abstract. A method is presented for the evaluation of the exhaust emissions of marine traffic, based on the messages provided by the Automatic Identification System (AIS), which enable the identification and location determination of ships. The use of the AIS data facilitates the positioning of ship emissions with a high spatial resolution, which is limited only by the inaccuracies of the Global Positioning System (typically a few metres) that is used in vessel navigation. The emissions are computed based on the relationship of the instantaneous speed to the design speed, and the detailed technical information of the engines of the ships. The modelling of emissions is also based on a few basic principles of ship design, including the modelling of the propelling power of each vessel in terms of its speed. We have investigated the effect of waves on the consumption of fuel, and on the emissions to the atmosphere. The predictions of fuel consumption were compared with the actual values obtained from the shipowners. For a Roll on – Roll off cargo/passenger ship (RoPax), the predicted and reported values of annual fuel consumption agreed within an accuracy of 6%. According to the data analysis and model computations, the emissions of NOx, SOx and CO2 originating from ships in the Baltic Sea during the full calendar year of 2007 were in total 400 kt, 138 kt and 19 Mt, respectively. A breakdown of emissions by flag state, the type of ship and the year of construction is also presented. The modelling system can be used as a decision support tool in the case of issues concerning, e.g., the health effects caused by shipping emissions or the construction of emission-based fairway dues systems or emissions trading. The computation of emissions can be automated, which will save resources in constructing emission inventories. Both the methodologies and the emission computation program can be applied in any sea region in the world, provided that the AIS data from that specific region are available.

BioInfer: a corpus for information extraction in the biomedical domain

Sampo Pyysalo, Filip Ginter, Juho Heimonen, Jari Björne +3 more

2007· BMC Bioinformatics482doi:10.1186/1471-2105-8-50

BACKGROUND: Lately, there has been a great interest in the application of information extraction methods to the biomedical domain, in particular, to the extraction of relationships of genes, proteins, and RNA from scientific publications. The development and evaluation of such methods requires annotated domain corpora. RESULTS: We present BioInfer (Bio Information Extraction Resource), a new public resource providing an annotated corpus of biomedical English. We describe an annotation scheme capturing named entities and their relationships along with a dependency analysis of sentence syntax. We further present ontologies defining the types of entities and relationships annotated in the corpus. Currently, the corpus contains 1100 sentences from abstracts of biomedical research articles annotated for relationships, named entities, as well as syntactic dependencies. Supporting software is provided with the corpus. The corpus is unique in the domain in combining these annotation types for a single set of sentences, and in the level of detail of the relationship annotation. CONCLUSION: We introduce a corpus targeted at protein, gene, and RNA relationships which serves as a resource for the development of information extraction systems and their components such as parsers and domain analyzers. The corpus will be maintained and further developed with a current version being available at http://www.it.utu.fi/BioInfer.

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

Naihui Zhou, Yuxiang Jiang, Timothy Bergquist, Alexandra Lee +4 more

2019· Genome biology478doi:10.1186/s13059-019-1835-8

BACKGROUND: The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function. RESULTS: Here, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole-genome mutation screening in Candida albicans and Pseudomonas aureginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory. CONCLUSION: We conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens.

Search all NobleBlocks papers mentioning “Turku Centre for Computer Science” →