Center for Systems Biology Dresden
facilityDresden, Saxony, Germany
Research output, citation impact, and the most-cited recent papers from Center for Systems Biology Dresden (Germany). Aggregated across the NobleBlocks index of 300M+ scholarly works.
Top-cited papers from Center for Systems Biology Dresden
The many functional partnerships and interactions that occur between proteins are at the core of cellular processing and their systematic characterization helps to provide context in molecular systems biology. However, known and predicted interactions are scattered over multiple resources, and the available data exhibit notable differences in terms of quality and completeness. The STRING database (http://string-db.org) aims to provide a critical assessment and integration of protein-protein interactions, including direct (physical) as well as indirect (functional) associations. The new version 10.0 of STRING covers more than 2000 organisms, which has necessitated novel, scalable algorithms for transferring interaction information between organisms. For this purpose, we have introduced hierarchical and self-consistent orthology annotations for all interacting proteins, grouping the proteins into families at various levels of phylogenetic resolution. Further improvements in version 10.0 include a completely redesigned prediction pipeline for inferring protein-protein associations from co-expression data, an API interface for the R computing environment and improved statistical analysis for enrichment tests in user-provided networks.
Abstract High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species 1–4 . To address this issue, the international Genome 10K (G10K) consortium 5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences.
autophagic responses. Here, we critically discuss current methods of assessing autophagy and the information they can, or cannot, provide. Our ultimate goal is to encourage intellectual and technical innovation in the field.
The field of image denoising is currently dominated by discriminative deep learning methods that are trained on pairs of noisy input and clean target images. Recently it has been shown that such methods can also be trained without clean targets. Instead, independent pairs of noisy images can be used, in an approach known as Noise2Noise (N2N). Here, we introduce Noise2Void (N2V), a training scheme that takes this idea one step further. It does not require noisy image pairs, nor clean target images. Consequently, N2V allows us to train directly on the body of data to be denoised and can therefore be applied when other methods cannot. Especially interesting is the application to biomedical image data, where the acquisition of training targets, clean or noisy, is frequently not possible. We compare the performance of N2V to approaches that have either clean target images and/or noisy image pairs available. Intuitively, N2V cannot be expected to outperform methods that have more information available during training. Still, we observe that the denoising performance of Noise2Void drops in moderation and compares favorably to training-free denoising methods.
RNA and membraneless organelles Membraneless compartments can form in cells through liquidliquid phase separation (see the Perspective by Polymenidou). But what prevents these cellular condensates from randomly fusing together? Using the RNA-binding protein (RBP) Whi3, Langdon et al. demonstrated that the secondary structure of different RNA components determines the distinct biophysical and biological properties of the two types of condensates that Whi3 forms. Several RBPs, such as FUS and TDP43, contain prion-like domains and are linked to neurodegenerative diseases. These RBPs are usually soluble in the nucleus but can form pathological aggregates in the cytoplasm. Maharana et al. showed that local RNA concentrations determine distinct phase separation behaviors in different subcellular locations. The higher RNA concentrations in the nucleus act as a buffer to prevent phase separation of RBPs; when mislocalized to the cytoplasm, lower RNA concentrations trigger aggregation. Science , this issue p. 922 , p. 918 ; see also p. 859
The coordinated expression of highly related homoeologous genes in polyploid species underlies the phenotypes of many of the world's major crops. Here we combine extensive gene expression datasets to produce a comprehensive, genome-wide analysis of homoeolog expression patterns in hexaploid bread wheat. Bias in homoeolog expression varies between tissues, with ~30% of wheat homoeologs showing nonbalanced expression. We found expression asymmetries along wheat chromosomes, with homoeologs showing the largest inter-tissue, inter-cultivar, and coding sequence variation, most often located in high-recombination distal ends of chromosomes. These transcriptionally dynamic genes potentially represent the first steps toward neo- or subfunctionalization of wheat homoeologs. Coexpression networks reveal extensive coordination of homoeologs throughout development and, alongside a detailed expression atlas, provide a framework to target candidate genes underpinning agronomic traits in wheat.
We have made rapid progress in recent years in identifying the genetic causes of many human diseases. However, despite this recent progress, our mechanistic understanding of these diseases is often incomplete. This is a problem because it limits our ability to develop effective disease treatments. To overcome this limitation, we need new concepts to describe and comprehend the complex mechanisms underlying human diseases. Condensate formation by phase separation emerges as a new principle to explain the organization of living cells. In this review, we present emerging evidence that aberrant forms of condensates are associated with many human diseases, including cancer, neurodegeneration, and infectious diseases. We examine disease mechanisms driven by aberrant condensates, and we point out opportunities for therapeutic interventions. We conclude that phase separation provides a useful new framework to understand and fight some of the most severe human diseases.
The allohexaploid bread wheat genome consists of three closely related subgenomes (A, B, and D), but a clear understanding of their phylogenetic history has been lacking. We used genome assemblies of bread wheat and five diploid relatives to analyze genome-wide samples of gene trees, as well as to estimate evolutionary relatedness and divergence times. We show that the A and B genomes diverged from a common ancestor ~7 million years ago and that these genomes gave rise to the D genome through homoploid hybrid speciation 1 to 2 million years later. Our findings imply that the present-day bread wheat genome is a product of multiple rounds of hybrid speciation (homoploid and polyploid) and lay the foundation for a new framework for understanding the wheat genome as a multilevel phylogenetic mosaic.
High-throughput sequencing for transcript profiling in plants has revealed that alternative splicing (AS) affects a much higher proportion of the transcriptome than was previously assumed. AS is involved in most plant processes and is particularly prevalent in plants exposed to environmental stress. The identification of mutations in predicted splicing factors and spliceosomal proteins that affect cell fate, the circadian clock, plant defense, and tolerance/sensitivity to abiotic stress all point to a fundamental role of splicing/AS in plant growth, development, and responses to external cues. Splicing factors affect the AS of multiple downstream target genes, thereby transferring signals to alter gene expression via splicing factor/AS networks. The last two to three years have seen an ever-increasing number of examples of functional AS. At a time when the identification of AS in individual genes and at a global level is exploding, this review aims to bring together such examples to illustrate the extent and importance of AS, which are not always obvious from individual publications. It also aims to ensure that plant scientists are aware that AS is likely to occur in the genes that they study and that dynamic changes in AS and its consequences need to be considered routinely.
The novel coronavirus severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the cause of COVID-19. The main receptor of SARS-CoV-2, angiotensin I converting enzyme 2 (ACE2), is now undergoing extensive scrutiny to understand the routes of transmission and sensitivity in different species. Here, we utilized a unique dataset of ACE2 sequences from 410 vertebrate species, including 252 mammals, to study the conservation of ACE2 and its potential to be used as a receptor by SARS-CoV-2. We designed a five-category binding score based on the conservation properties of 25 amino acids important for the binding between ACE2 and the SARS-CoV-2 spike protein. Only mammals fell into the medium to very high categories and only catarrhine primates into the very high category, suggesting that they are at high risk for SARS-CoV-2 infection. We employed a protein structural analysis to qualitatively assess whether amino acid changes at variable residues would be likely to disrupt ACE2/SARS-CoV-2 spike protein binding and found the number of predicted unfavorable changes significantly correlated with the binding score. Extending this analysis to human population data, we found only rare (frequency <0.001) variants in 10/25 binding sites. In addition, we found significant signals of selection and accelerated evolution in the ACE2 coding sequence across all mammals, and specific to the bat lineage. Our results, if confirmed by additional experimental data, may lead to the identification of intermediate host species for SARS-CoV-2, guide the selection of animal models of COVID-19, and assist the conservation of animals both in native habitats and in human care.
Rubisco is a large enzyme with a molecular mass of approximately 550 kD. The maximum rate of CO2 fixation (i.e. ribulose-1,5-bisphosphate [RuBP] carboxylation) at CO2 saturation is only 15 to 30 mol CO2 mol−1 Rubisco protein s−1 at 25°C. Affinity to CO2 is also low, and the K m, K c, at 25°C
Abstract Salamanders serve as important tetrapod models for developmental, regeneration and evolutionary studies. An extensive molecular toolkit makes the Mexican axolotl ( Ambystoma mexicanum ) a key representative salamander for molecular investigations. Here we report the sequencing and assembly of the 32-gigabase-pair axolotl genome using an approach that combined long-read sequencing, optical mapping and development of a new genome assembler (MARVEL). We observed a size expansion of introns and intergenic regions, largely attributable to multiplication of long terminal repeat retroelements. We provide evidence that intron size in developmental genes is under constraint and that species-restricted genes may contribute to limb regeneration. The axolotl genome assembly does not contain the essential developmental gene Pax3 . However, mutation of the axolotl Pax3 paralogue Pax7 resulted in an axolotl phenotype that was similar to those seen in Pax3 −/− and Pax7 −/− mutant mice. The axolotl genome provides a rich biological resource for developmental and evolutionary studies.
The SuperPred web server connects chemical similarity of drug-like compounds with molecular targets and the therapeutic approach based on the similar property principle. Since the first release of this server, the number of known compound-target interactions has increased from 7000 to 665,000, which allows not only a better prediction quality but also the estimation of a confidence. Apart from the addition of quantitative binding data and the statistical consideration of the similarity distribution in all drug classes, new approaches were implemented to improve the target prediction. The 3D similarity as well as the occurrence of fragments and the concordance of physico-chemical properties is also taken into account. In addition, the effect of different fingerprints on the prediction was examined. The retrospective prediction of a drug class (ATC code of the WHO) allows the evaluation of methods and descriptors for a well-characterized set of approved drugs. The prediction is improved by 7.5% to a total accuracy of 75.1%. For query compounds with sufficient structural similarity, the web server allows prognoses about the medical indication area of novel compounds and to find new leads for known targets. SuperPred is publicly available without registration at: http://prediction.charite.de.
Deep Learning (DL) methods are powerful analytical tools for microscopy and can outperform conventional image processing pipelines. Despite the enthusiasm and innovations fuelled by DL technology, the need to access powerful and compatible resources to train DL networks leads to an accessibility barrier that novice users often find difficult to overcome. Here, we present ZeroCostDL4Mic, an entry-level platform simplifying DL access by leveraging the free, cloud-based computational resources of Google Colab. ZeroCostDL4Mic allows researchers with no coding expertise to train and apply key DL networks to perform tasks including segmentation (using U-Net and StarDist), object detection (using YOLOv2), denoising (using CARE and Noise2Void), super-resolution microscopy (using Deep-STORM), and image-to-image translation (using Label-free prediction - fnet, pix2pix and CycleGAN). Importantly, we provide suitable quantitative tools for each network to evaluate model performance, allowing model optimisation. We demonstrate the application of the platform to study multiple biological processes.
Liquid-liquid phase separation of proteins underpins the formation of membraneless compartments in living cells. Elucidating the molecular driving forces underlying protein phase transitions is therefore a key objective for understanding biological function and malfunction. Here we show that cellular proteins, which form condensates at low salt concentrations, including FUS, TDP-43, Brd4, Sox2, and Annexin A11, can reenter a phase-separated regime at high salt concentrations. By bringing together experiments and simulations, we demonstrate that this reentrant phase transition in the high-salt regime is driven by hydrophobic and non-ionic interactions, and is mechanistically distinct from the low-salt regime, where condensates are additionally stabilized by electrostatic forces. Our work thus sheds light on the cooperation of hydrophobic and non-ionic interactions as general driving forces in the condensation process, with important implications for aberrant function, druggability, and material properties of biomolecular condensates.
Animal genomes are folded into loops and topologically associating domains (TADs) by CTCF and loop-extruding cohesins, but the live dynamics of loop formation and stability remain unknown. Here, we directly visualized chromatin looping at the Fbn2 TAD in mouse embryonic stem cells using super-resolution live-cell imaging and quantified looping dynamics by Bayesian inference. Unexpectedly, the Fbn2 loop was both rare and dynamic, with a looped fraction of approximately 3 to 6.5% and a median loop lifetime of approximately 10 to 30 minutes. Our results establish that the Fbn2 TAD is highly dynamic, and about 92% of the time, cohesin-extruded loops exist within the TAD without bridging both CTCF boundaries. This suggests that single CTCF boundaries, rather than the fully CTCF-CTCF looped state, may be the primary regulators of functional interactions.
The nucleus contains diverse phase-separated condensates that compartmentalize and concentrate biomolecules with distinct physicochemical properties. Here, we investigated whether condensates concentrate small-molecule cancer therapeutics such that their pharmacodynamic properties are altered. We found that antineoplastic drugs become concentrated in specific protein condensates in vitro and that this occurs through physicochemical properties independent of the drug target. This behavior was also observed in tumor cells, where drug partitioning influenced drug activity. Altering the properties of the condensate was found to affect the concentration and activity of drugs. These results suggest that selective partitioning and concentration of small molecules within condensates contributes to drug pharmacodynamics and that further understanding of this phenomenon may facilitate advances in disease therapy.
Abstract Following DNA damage caused by exogenous sources, such as ionizing radiation, the tumour suppressor p53 mediates cell cycle arrest via expression of the CDK inhibitor, p21. However, the role of p21 in maintaining genomic stability in the absence of exogenous DNA-damaging agents is unclear. Here, using live single-cell measurements of p21 protein in proliferating cultures, we show that naturally occurring DNA damage incurred over S-phase causes p53-dependent accumulation of p21 during mother G2- and daughter G1-phases. High p21 levels mediate G1 arrest via CDK inhibition, yet lower levels have no impact on G1 progression, and the ubiquitin ligases CRL4 Cdt2 and SCF Skp2 couple to degrade p21 prior to the G1/S transition. Mathematical modelling reveals that a bistable switch, created by CRL4 Cdt2 , promotes irreversible S-phase entry by keeping p21 levels low, preventing premature S-phase exit upon DNA damage. Thus, we characterize how p21 regulates the proliferation-quiescence decision to maintain genomic stability.
Non-centrosomal microtubule bundles play important roles in cellular organization and function. Although many diverse proteins are known that can bundle microtubules, biochemical mechanisms by which cells could locally control the nucleation and formation of microtubule bundles are understudied. Here, we demonstrate that the concentration of tubulin into a condensed, liquid-like compartment composed of the unstructured neuronal protein tau is sufficient to nucleate microtubule bundles. We show that, under conditions of macro-molecular crowding, tau forms liquid-like drops. Tubulin partitions into these drops, efficiently increasing tubulin concentration and driving the nucleation of microtubules. These growing microtubules form bundles, which deform the drops while remaining enclosed by diffusible tau molecules exhibiting a liquid-like behavior. Our data suggest that condensed compartments of microtubule bundling proteins could promote the local formation of microtubule bundles in neurons by acting as non-centrosomal microtubule nucleation centers and that liquid-like tau encapsulation could provide both stability and plasticity to long axonal microtubule bundles.
Abstract Bats possess extraordinary adaptations, including flight, echolocation, extreme longevity and unique immunity. High-quality genomes are crucial for understanding the molecular basis and evolution of these traits. Here we incorporated long-read sequencing and state-of-the-art scaffolding protocols 1 to generate, to our knowledge, the first reference-quality genomes of six bat species ( Rhinolophus ferrumequinum , Rousettus aegyptiacus , Phyllostomus discolor , Myotis myotis , Pipistrellus kuhlii and Molossus molossus ). We integrated gene projections from our ‘Tool to infer Orthologs from Genome Alignments’ (TOGA) software with de novo and homology gene predictions as well as short- and long-read transcriptomics to generate highly complete gene annotations. To resolve the phylogenetic position of bats within Laurasiatheria, we applied several phylogenetic methods to comprehensive sets of orthologous protein-coding and noncoding regions of the genome, and identified a basal origin for bats within Scrotifera. Our genome-wide screens revealed positive selection on hearing-related genes in the ancestral branch of bats, which is indicative of laryngeal echolocation being an ancestral trait in this clade. We found selection and loss of immunity-related genes (including pro-inflammatory NF-κB regulators) and expansions of anti-viral APOBEC3 genes, which highlights molecular mechanisms that may contribute to the exceptional immunity of bats. Genomic integrations of diverse viruses provide a genomic record of historical tolerance to viral infection in bats. Finally, we found and experimentally validated bat-specific variation in microRNAs, which may regulate bat-specific gene-expression programs. Our reference-quality bat genomes provide the resources required to uncover and validate the genomic basis of adaptations of bats, and stimulate new avenues of research that are directly relevant to human health and disease 1 .