Center for Theoretical Biological Physics

facilityHouston, Texas, United States

Research output, citation impact, and the most-cited recent papers from Center for Theoretical Biological Physics (United States). Aggregated across the NobleBlocks index of 300M+ scholarly works.

Total works

3.6K

Citations

452.0K

h-index

275

i10-index

4.5K

Also known as

Center for Theoretical Biological Physics

Top-cited papers from Center for Theoretical Biological Physics

Juicer Provides a One-Click System for Analyzing Loop-Resolution Hi-C Experiments

Neva C. Durand, Muhammad S. Shamim, Ido Machol, Suhas S.P. Rao +3 more

2016· Cell Systems4.2Kdoi:10.1016/j.cels.2016.07.002

Hi-C experiments explore the 3D structure of the genome, generating terabases of data to create high-resolution contact maps. Here, we introduce Juicer, an open-source tool for analyzing terabase-scale Hi-C datasets. Juicer allows users without a computational background to transform raw sequence data into normalized contact maps with one click. Juicer produces a hic file containing compressed contact matrices at many resolutions, facilitating visualization and analysis at multiple scales. Structural features, such as loops and domains, are automatically annotated. Juicer is available as open source software at http://aidenlab.org/juicer/.

De novo assembly of the <i>Aedes aegypti</i> genome using Hi-C yields chromosome-length scaffolds

Olga Dudchenko, Sanjit Singh Batra, Arina D. Omer, Sarah K. Nyquist +4 more

2017· Science2.9Kdoi:10.1126/science.aal3327

Hi-C for mosquito genomes Most genomes sequenced today are determined through the generation of short sequenced bits of DNA that are computationally pieced together like a jigsaw puzzle. This has resulted in the need for funds and additional data to fill in gaps in order to fully assemble the many chromosomes that make up a eukaryotic genome. Dudchenko et al. used the Hi-C method, which measures the distance between contact points within and between chromosomes for scaffold validation, together with correction and ordering to more completely determine the arrangement of short sequencing reads for genome mapping. They validated their approach through the de novo generation of a complete human genome. A comparative analysis of mosquito genomes was made possible by improving the Culex quinquefasciatus genome assembly and generating the genome of Aedes aegypti , the vector of Zika virus. Science , this issue p. 92

Juicebox Provides a Visualization System for Hi-C Contact Maps with Unlimited Zoom

Neva C. Durand, James Robinson, Muhammad S. Shamim, Ido Machol +3 more

2016· Cell Systems2.5Kdoi:10.1016/j.cels.2015.07.012

Hi-C experiments study how genomes fold in 3D, generating contact maps containing features as small as 20 bp and as large as 200 Mb. Here we introduce Juicebox, a tool for exploring Hi-C and other contact map data. Juicebox allows users to zoom in and out of Hi-C maps interactively, just as a user of Google Earth might zoom in and out of a geographic map. Maps can be compared to one another, or to 1D tracks or 2D feature sets.

Thirty years of density functional theory in computational chemistry: an overview and extensive assessment of 200 density functionals

Narbe Mardirossian, Martin Head‐Gordon

2017· Molecular Physics2.3Kdoi:10.1080/00268976.2017.1333644

In the past 30 years, Kohn–Sham density functional theory has emerged as the most popular electronic structuremethod in computational chemistry. To assess the ever-increasing number of approximate exchange-correlation functionals, this review benchmarks a total of 200 density functionals on a molecular database (MGCDB84) of nearly 5000 data points. The database employed, provided as Supplemental Data, is comprised of 84 data-sets and contains non-covalent interactions, isomerisation energies, thermochemistry, and barrier heights. In addition, the evolution of nonempirical and semi-empirical density functional design is reviewed, and guidelines are provided for the proper and effective use of density functionals. The most promising functional considered is ωB97M-V, a range-separated hybrid meta-GGA with VV10 nonlocal correlation, designed using a combinatorial approach. From the local GGAs, B97-D3, revPBE-D3, and BLYP-D3 are recommended, while from the local meta-GGAs, B97M-rV is the leading choice, followed by MS1-D3 and M06-LD3. The best hybrid GGAs are ωB97X-V, ωB97X-D3, and ωB97X-D, while useful hybrid meta-GGAs (besides ωB97M-V) include ωM05-D, M06-2X-D3, and MN15. Ultimately, today’s state-of-the-art functionals are close to achieving the level of accuracy desired for a broad range of chemical applications, and the principal remaining limitations are associated with systems that exhibit significant selfinteraction/ delocalisation errors and/or strong correlation effects.

String theory

Brian Greene, David R. Morrison, Joseph Polchinski

1998· Proceedings of the National Academy of Sciences2.0Kdoi:10.1073/pnas.95.19.11039

Proceedings of the National Academy of Sciences (PNAS), a peer reviewed journal of the National Academy of Sciences (NAS) - an authoritative source of high-impact, original research that broadly spans the biological, physical, and social sciences.

Chromatin extrusion explains key features of loop and domain formation in wild-type and engineered genomes

Adrian L. Sanborn, Suhas S.P. Rao, Su-Chen Huang, Neva C. Durand +4 more

2015· Proceedings of the National Academy of Sciences1.9Kdoi:10.1073/pnas.1518552112

We recently used in situ Hi-C to create kilobase-resolution 3D maps of mammalian genomes. Here, we combine these maps with new Hi-C, microscopy, and genome-editing experiments to study the physical structure of chromatin fibers, domains, and loops. We find that the observed contact domains are inconsistent with the equilibrium state for an ordinary condensed polymer. Combining Hi-C data and novel mathematical theorems, we show that contact domains are also not consistent with a fractal globule. Instead, we use physical simulations to study two models of genome folding. In one, intermonomer attraction during polymer condensation leads to formation of an anisotropic "tension globule." In the other, CCCTC-binding factor (CTCF) and cohesin act together to extrude unknotted loops during interphase. Both models are consistent with the observed contact domains and with the observation that contact domains tend to form inside loops. However, the extrusion model explains a far wider array of observations, such as why loops tend not to overlap and why the CTCF-binding motifs at pairs of loop anchors lie in the convergent orientation. Finally, we perform 13 genome-editing experiments examining the effect of altering CTCF-binding sites on chromatin folding. The convergent rule correctly predicts the affected loops in every case. Moreover, the extrusion model accurately predicts in silico the 3D maps resulting from each experiment using only the location of CTCF-binding sites in the WT. Thus, we show that it is possible to disrupt, restore, and move loops and domains using targeted mutations as small as a single base pair.

Density-Functional Theory of the Energy Gap

L. J. Sham, M. Schlüter

1983· Physical Review Letters1.9Kdoi:10.1103/physrevlett.51.1888

The energy-band gap of an insulator is obtained from the eigenvalues of the one-particle density-functional equation for the ground state and a finite correction due to the discontinuity of the functional derivative of the exchange and correlation energy. This correction is expressed in terms of the improper self-energy and the density-functional exchange-correlation potential. It is evaluated for a two-plane-wave model including exchange only.

Direct-coupling analysis of residue coevolution captures native contacts across many protein families

Faruck Morcos, Andrea Pagnani, Bryan Lunt, Arianna Bertolino +4 more

2011· Proceedings of the National Academy of Sciences1.6Kdoi:10.1073/pnas.1111471108

The similarity in the three-dimensional structures of homologous proteins imposes strong constraints on their sequence variability. It has long been suggested that the resulting correlations among amino acid compositions at different sequence positions can be exploited to infer spatial contacts within the tertiary protein structure. Crucial to this inference is the ability to disentangle direct and indirect correlations, as accomplished by the recently introduced direct-coupling analysis (DCA). Here we develop a computationally efficient implementation of DCA, which allows us to evaluate the accuracy of contact prediction by DCA for a large number of protein domains, based purely on sequence information. DCA is shown to yield a large number of correctly predicted contacts, recapitulating the global structure of the contact map for the majority of the protein domains examined. Furthermore, our analysis captures clear signals beyond intradomain residue contacts, arising, e.g., from alternative protein conformations, ligand-mediated residue couplings, and interdomain interactions in protein oligomers. Our findings suggest that contacts predicted by DCA can be used as a reliable guide to facilitate computational predictions of alternative protein conformations, protein complex formation, and even the de novo prediction of protein domain structures, contingent on the existence of a large number of homologous sequences which are being rapidly made available due to advances in genome sequencing.

Interdependence of Cell Growth and Gene Expression: Origins and Consequences

Matthew P. Scott, Carl W. Gunderson, Eduard M. Mateescu, Zhongge Zhang +1 more

2010· Science1.6Kdoi:10.1126/science.1192588

In bacteria, the rate of cell proliferation and the level of gene expression are intimately intertwined. Elucidating these relations is important both for understanding the physiological functions of endogenous genetic circuits and for designing robust synthetic systems. We describe a phenomenological study that reveals intrinsic constraints governing the allocation of resources toward protein synthesis and other aspects of cell growth. A theory incorporating these constraints can accurately predict how cell proliferation and gene expression affect one another, quantitatively accounting for the effect of translation-inhibiting antibiotics on gene expression and the effect of gratuitous protein expression on cell growth. The use of such empirical relations, analogous to phenomenological laws, may facilitate our understanding and manipulation of complex biological systems before underlying regulatory circuits are elucidated.

Accelerated molecular dynamics: A promising and efficient simulation method for biomolecules

Donald Hamelberg, John Mongan, J. Andrew McCammon

2004· The Journal of Chemical Physics1.6Kdoi:10.1063/1.1755656

Many interesting dynamic properties of biological molecules cannot be simulated directly using molecular dynamics because of nanosecond time scale limitations. These systems are trapped in potential energy minima with high free energy barriers for large numbers of computational steps. The dynamic evolution of many molecular systems occurs through a series of rare events as the system moves from one potential energy basin to another. Therefore, we have proposed a robust bias potential function that can be used in an efficient accelerated molecular dynamics approach to simulate the transition of high energy barriers without any advance knowledge of the location of either the potential energy wells or saddle points. In this method, the potential energy landscape is altered by adding a bias potential to the true potential such that the escape rates from potential wells are enhanced, which accelerates and extends the time scale in molecular dynamics simulations. Our definition of the bias potential echoes the underlying shape of the potential energy landscape on the modified surface, thus allowing for the potential energy minima to be well defined, and hence properly sampled during the simulation. We have shown that our approach, which can be extended to biomolecules, samples the conformational space more efficiently than normal molecular dynamics simulations, and converges to the correct canonical distribution.

Molecular dynamics simulations and drug discovery

Jacob D. Durrant, J. Andrew McCammon

2011· BMC Biology1.4Kdoi:10.1186/1741-7007-9-71

This review discusses the many roles atomistic computer simulations of macromolecular (for example, protein) receptors and their associated small-molecule ligands can play in drug discovery, including the identification of cryptic or allosteric binding sites, the enhancement of traditional virtual-screening methodologies, and the direct prediction of small-molecule binding energies. The limitations of current simulation methodologies, including the high computational costs and approximations of molecular forces required, are also discussed. With constant improvements in both computer power and algorithm design, the future of computer-aided drug design is promising; molecular dynamics simulations are likely to play an increasingly important role.

Molecular Dynamics: Survey of Methods for Simulating the Activity of Proteins

Stewart A. Adcock, J. Andrew McCammon

2006· Chemical Reviews1.3Kdoi:10.1021/cr040426m

ADVERTISEMENT RETURN TO ISSUEPREVArticleNEXTMolecular Dynamics: Survey of Methods for Simulating the Activity of ProteinsStewart A. Adcock and J. Andrew McCammonView Author Information NSF Center for Theoretical Biological Physics, Department of Chemistry and Biochemistry, University of California at San Diego, 9500 Gilman Drive, La Jolla, California 92093-0365 Cite this: Chem. Rev. 2006, 106, 5, 1589–1615Publication Date (Web):February 9, 2006Publication History Received15 September 2004Published online9 February 2006Published inissue 1 May 2006https://pubs.acs.org/doi/10.1021/cr040426mhttps://doi.org/10.1021/cr040426mresearch-articleACS PublicationsCopyright © 2006 American Chemical SocietyRequest reuse permissionsArticle Views13005Altmetric-Citations913LEARN ABOUT THESE METRICSArticle Views are the COUNTER-compliant sum of full text article downloads since November 2008 (both PDF and HTML) across all institutions and individuals. These metrics are regularly updated to reflect usage leading up to the last few days.Citations are the number of other articles citing this article, calculated by Crossref and updated daily. Find more information about Crossref citation counts.The Altmetric Attention Score is a quantitative measure of the attention that a research article has received online. Clicking on the donut icon will load a page at altmetric.com with additional details about the score and the social media presence for the given article. Find more information on the Altmetric Attention Score and how the score is calculated. Share Add toView InAdd Full Text with ReferenceAdd Description ExportRISCitationCitation and abstractCitation and referencesMore Options Share onFacebookTwitterWechatLinked InRedditEmail Other access optionsGet e-Alertsclose SUBJECTS:Algorithms,Computer simulations,Molecular dynamics,Molecular mechanics,Solvents Get e-Alerts

<i>ω</i>B97M-V: A combinatorially optimized, range-separated hybrid, meta-GGA density functional with VV10 nonlocal correlation

Narbe Mardirossian, Martin Head‐Gordon

2016· The Journal of Chemical Physics1.2Kdoi:10.1063/1.4952647

A combinatorially optimized, range-separated hybrid, meta-GGA density functional with VV10 nonlocal correlation is presented. The final 12-parameter functional form is selected from approximately 10 × 10(9) candidate fits that are trained on a training set of 870 data points and tested on a primary test set of 2964 data points. The resulting density functional, ωB97M-V, is further tested for transferability on a secondary test set of 1152 data points. For comparison, ωB97M-V is benchmarked against 11 leading density functionals including M06-2X, ωB97X-D, M08-HX, M11, ωM05-D, ωB97X-V, and MN15. Encouragingly, the overall performance of ωB97M-V on nearly 5000 data points clearly surpasses that of all of the tested density functionals. In order to facilitate the use of ωB97M-V, its basis set dependence and integration grid sensitivity are thoroughly assessed, and recommendations that take into account both efficiency and accuracy are provided.

Identification of direct residue contacts in protein–protein interaction by message passing

Martin Weigt, Robert A. White, Hendrik Szurmant, James A. Hoch +1 more

2008· Proceedings of the National Academy of Sciences1.1Kdoi:10.1073/pnas.0805923106

Understanding the molecular determinants of specificity in protein-protein interaction is an outstanding challenge of postgenome biology. The availability of large protein databases generated from sequences of hundreds of bacterial genomes enables various statistical approaches to this problem. In this context covariance-based methods have been used to identify correlation between amino acid positions in interacting proteins. However, these methods have an important shortcoming, in that they cannot distinguish between directly and indirectly correlated residues. We developed a method that combines covariance analysis with global inference analysis, adopted from use in statistical physics. Applied to a set of >2,500 representatives of the bacterial two-component signal transduction system, the combination of covariance with global inference successfully and robustly identified residue pairs that are proximal in space without resorting to ad hoc tuning parameters, both for heterointeractions between sensor kinase (SK) and response regulator (RR) proteins and for homointeractions between RR proteins. The spectacular success of this approach illustrates the effectiveness of the global inference approach in identifying direct interaction based on sequence information alone. We expect this method to be applicable soon to interaction surfaces between proteins present in only 1 copy per genome as the number of sequenced genomes continues to expand. Use of this method could significantly increase the potential targets for therapeutic intervention, shed light on the mechanism of protein-protein interaction, and establish the foundation for the accurate prediction of interacting protein partners.

WATER MEDIATION IN PROTEIN FOLDING AND MOLECULAR RECOGNITION

Yaakov Levy, José N. Onuchic

2006· Annual Review of Biophysics and Biomolecular Structure1.1Kdoi:10.1146/annurev.biophys.35.040405.102134

Water is essential for life in many ways, and without it biomolecules might no longer truly be biomolecules. In particular, water is important to the structure, stability, dynamics, and function of biological macromolecules. In protein folding, water mediates the collapse of the chain and the search for the native topology through a funneled energy landscape. Water actively participates in molecular recognition by mediating the interactions between binding partners and contributes to either enthalpic or entropic stabilization. Accordingly, water must be included in recognition and structure prediction codes to capture specificity. Thus water should not be treated as an inert environment, but rather as an integral and active component of biomolecular systems, where it has both dynamic and structural roles. Focusing on water sheds light on the physics and function of biological machinery and self-assembly and may advance our understanding of the natural design of proteins and nucleic acids.

CaImAn an open source tool for scalable calcium imaging data analysis

Andrea Giovannucci, Johannes Friedrich, Pat Gunn, Jérémie Kalfon +4 more

2019· eLife1.0Kdoi:10.7554/elife.38173

Advances in fluorescence microscopy enable monitoring larger brain areas in-vivo with finer time resolution. The resulting data rates require reproducible analysis pipelines that are reliable, fully automated, and scalable to datasets generated over the course of months. We present CaImAn, an open-source library for calcium imaging data analysis. CaImAn provides automatic and scalable methods to address problems common to pre-processing, including motion correction, neural activity identification, and registration across different sessions of data collection. It does this while requiring minimal user intervention, with good scalability on computers ranging from laptops to high-performance computing clusters. CaImAn is suitable for two-photon and one-photon imaging, and also enables real-time analysis on streaming data. To benchmark the performance of CaImAn we collected and combined a corpus of manual annotations from multiple labelers on nine mouse two-photon datasets. We demonstrate that CaImAn achieves near-human performance in detecting locations of active neurons.

Machine Learning for Molecular Simulation

Frank Noé, Alexandre Tkatchenko, Klaus-Robert Müller, Cecilia Clementi

2020· Annual Review of Physical Chemistry878doi:10.1146/annurev-physchem-042018-052331

Machine learning (ML) is transforming all areas of science. The complex and time-consuming calculations in molecular simulations are particularly suitable for an ML revolution and have already been profoundly affected by the application of existing ML methods. Here we review recent ML methods for molecular simulation, with particular focus on (deep) neural networks for the prediction of quantum-mechanical energies and forces, on coarse-grained molecular dynamics, on the extraction of free energy surfaces and kinetics, and on generative network approaches to sample molecular equilibrium structures and compute thermodynamics. To explain these methods and illustrate open methodological problems, we review some important principles of molecular physics and describe how they can be incorporated into ML structures. Finally, we identify and describe a list of open challenges for the interface between ML and molecular simulation.

Electroencephalogram signatures of loss and recovery of consciousness from propofol

Patrick L. Purdon, Eric T. Pierce, Eran A. Mukamel, Michael J. Prerau +4 more

2013· Proceedings of the National Academy of Sciences846doi:10.1073/pnas.1221180110

Unconsciousness is a fundamental component of general anesthesia (GA), but anesthesiologists have no reliable ways to be certain that a patient is unconscious. To develop EEG signatures that track loss and recovery of consciousness under GA, we recorded high-density EEGs in humans during gradual induction of and emergence from unconsciousness with propofol. The subjects executed an auditory task at 4-s intervals consisting of interleaved verbal and click stimuli to identify loss and recovery of consciousness. During induction, subjects lost responsiveness to the less salient clicks before losing responsiveness to the more salient verbal stimuli; during emergence they recovered responsiveness to the verbal stimuli before recovering responsiveness to the clicks. The median frequency and bandwidth of the frontal EEG power tracked the probability of response to the verbal stimuli during the transitions in consciousness. Loss of consciousness was marked simultaneously by an increase in low-frequency EEG power (<1 Hz), the loss of spatially coherent occipital alpha oscillations (8-12 Hz), and the appearance of spatially coherent frontal alpha oscillations. These dynamics reversed with recovery of consciousness. The low-frequency phase modulated alpha amplitude in two distinct patterns. During profound unconsciousness, alpha amplitudes were maximal at low-frequency peaks, whereas during the transition into and out of unconsciousness, alpha amplitudes were maximal at low-frequency nadirs. This latter phase-amplitude relationship predicted recovery of consciousness. Our results provide insights into the mechanisms of propofol-induced unconsciousness, establish EEG signatures of this brain state that track transitions in consciousness precisely, and suggest strategies for monitoring the brain activity of patients receiving GA.

Efficient and accurate extraction of in vivo calcium signals from microendoscopic video data

Pengcheng Zhou, Shanna L. Resendez, Jose Rodríguez-Romaguera, Jessica Jimenez +4 more

2018· eLife828doi:10.7554/elife.28728

In vivo calcium imaging through microendoscopic lenses enables imaging of previously inaccessible neuronal populations deep within the brains of freely moving animals. However, it is computationally challenging to extract single-neuronal activity from microendoscopic data, because of the very large background fluctuations and high spatial overlaps intrinsic to this recording modality. Here, we describe a new constrained matrix factorization approach to accurately separate the background and then demix and denoise the neuronal signals of interest. We compared the proposed method against previous independent components analysis and constrained nonnegative matrix factorization approaches. On both simulated and experimental data recorded from mice, our method substantially improved the quality of extracted cellular signals and detected more well-isolated neural signals, especially in noisy data regimes. These advances can in turn significantly enhance the statistical power of downstream analyses, and ultimately improve scientific conclusions derived from microendoscopic data.

On schemes of combinatorial transcription logic

Nicolas E. Buchler, Ulrich Gerland, Terence Hwa

2003· Proceedings of the National Academy of Sciences708doi:10.1073/pnas.0930314100

Cells receive a wide variety of cellular and environmental signals, which are often processed combinatorially to generate specific genetic responses. Here we explore theoretically the potentials and limitations of combinatorial signal integration at the level of cis-regulatory transcription control. Our analysis suggests that many complex transcription-control functions of the type encountered in higher eukaryotes are already implementable within the much simpler bacterial transcription system. Using a quantitative model of bacterial transcription and invoking only specific protein-DNA interaction and weak glue-like interaction between regulatory proteins, we show explicit schemes to implement regulatory logic functions of increasing complexity by appropriately selecting the strengths and arranging the relative positions of the relevant protein-binding DNA sequences in the cis-regulatory region. The architectures that emerge are naturally modular and evolvable. Our results suggest that the transcription regulatory apparatus is a "programmable" computing machine, belonging formally to the class of Boltzmann machines. Crucial to our results is the ability to regulate gene expression at a distance. In bacteria, this can be achieved for isolated genes via DNA looping controlled by the dimerization of DNA-bound proteins. However, if adopted extensively in the genome, long-distance interaction can cause unintentional intergenic cross talk, a detrimental side effect difficult to overcome by the known bacterial transcription-regulation systems. This may be a key factor limiting the genome-wide adoption of complex transcription control in bacteria. Implications of our findings for combinatorial transcription control in eukaryotes are discussed.

Search all NobleBlocks papers mentioning “Center for Theoretical Biological Physics” →