
New York University Shanghai
UniversityShanghai, China
Research output, citation impact, and the most-cited recent papers from New York University Shanghai (China). Aggregated across the NobleBlocks index of 300M+ scholarly works.
Top-cited papers from New York University Shanghai
Molecular mechanics Poisson-Boltzmann surface area (MM/PBSA) and molecular mechanics generalized Born surface area (MM/GBSA) are arguably very popular methods for binding free energy prediction since they are more accurate than most scoring functions of molecular docking and less computationally demanding than alchemical free energy methods. MM/PBSA and MM/GBSA have been widely used in biomolecular studies such as protein folding, protein-ligand binding, protein-protein interaction, etc. In this review, methods to adjust the polar solvation energy and to improve the performance of MM/PBSA and MM/GBSA calculations are reviewed and discussed. The latest applications of MM/GBSA and MM/PBSA in drug design are also presented. This review intends to provide readers with guidance for practically applying MM/PBSA and MM/GBSA in drug design and related research fields.
. We further tested MN15 for 10 transition-metal coordination energies, the entire S66x8 database of noncovalent interactions, 21 transition-metal reaction barrier heights, 69 electronic excitation energies of organic molecules, 31 semiconductor band gaps, seven transition-metal dimer bond lengths, and 193 bond lengths of 47 organic molecules. The MN15 functional not only performs very well for our training set, which has 481 pieces of data, but also performs very well for our test set, which has 823 data that are not in our training set. The test set includes both ground-state properties and molecular excitation energies. For the latter MN15 achieves simultaneous accuracy for both valence and Rydberg electronic excitations when used with linear-response time-dependent density functional theory, with an MUE of less than 0.3 eV for both types of excitations.
Fine-grained classification is challenging because categories can only be discriminated by subtle and local differences. Variances in the pose, scale or rotation usually make the problem more difficult. Most fine-grained classification systems follow the pipeline of finding foreground object or object parts (where) to extract discriminative features (what). In this paper, we propose to apply visual attention to fine-grained classification task using deep neural network. Our pipeline integrates three types of attention: the bottom-up attention that propose candidate patches, the object-level top-down attention that selects relevant patches to a certain object, and the part-level top-down attention that localizes discriminative parts. We combine these attentions to train domain-specific deep nets, then use it to improve both the what and where aspects. Importantly, we avoid using expensive annotations like bounding box or part information from end-to-end. The weak supervision constraint makes our work easier to generalize. We have verified the effectiveness of the method on the subsets of ILSVRC2012 dataset and CUB200 2011 dataset. Our pipeline delivered significant improvements and achieved the best accuracy under the weakest supervision condition. The performance is competitive against other methods that rely on additional annotations.
A number of human cancers harbor somatic point mutations in the genes encoding isocitrate dehydrogenases 1 and 2 (IDH1 and IDH2). These mutations alter residues in the enzyme active sites and confer a gain-of-function in cancer cells, resulting in the accumulation and secretion of the oncometabolite (R)-2-hydroxyglutarate (2HG). We developed a small molecule, AGI-6780, that potently and selectively inhibits the tumor-associated mutant IDH2/R140Q. A crystal structure of AGI-6780 complexed with IDH2/R140Q revealed that the inhibitor binds in an allosteric manner at the dimer interface. The results of steady-state enzymology analysis were consistent with allostery and slow-tight binding by AGI-6780. Treatment with AGI-6780 induced differentiation of TF-1 erythroleukemia and primary human acute myelogenous leukemia cells in vitro. These data provide proof-of-concept that inhibitors targeting mutant IDH2/R140Q could have potential applications as a differentiation therapy for cancer.
We conduct a genome-wide association study (GWAS) of educational attainment (EA) in a sample of ~3 million individuals and identify 3,952 approximately uncorrelated genome-wide-significant single-nucleotide polymorphisms (SNPs). A genome-wide polygenic predictor, or polygenic index (PGI), explains 12-16% of EA variance and contributes to risk prediction for ten diseases. Direct effects (i.e., controlling for parental PGIs) explain roughly half the PGI's magnitude of association with EA and other phenotypes. The correlation between mate-pair PGIs is far too large to be consistent with phenotypic assortment alone, implying additional assortment on PGI-associated factors. In an additional GWAS of dominance deviations from the additive model, we identify no genome-wide-significant SNPs, and a separate X-chromosome additive GWAS identifies 57.
Last year, at least 30,000 scientific papers used the Kohn-Sham scheme of density functional theory to solve electronic structure problems in a wide variety of scientific fields. Machine learning holds the promise of learning the energy functional via examples, bypassing the need to solve the Kohn-Sham equations. This should yield substantial savings in computer time, allowing larger systems and/or longer time-scales to be tackled, but attempts to machine-learn this functional have been limited by the need to find its derivative. The present work overcomes this difficulty by directly learning the density-potential and energy-density maps for test systems and various molecules. We perform the first molecular dynamics simulation with a machine-learned density functional on malonaldehyde and are able to capture the intramolecular proton transfer process. Learning density models now allows the construction of accurate density functionals for realistic molecular systems.Machine learning allows electronic structure calculations to access larger system sizes and, in dynamical simulations, longer time scales. Here, the authors perform such a simulation using a machine-learned density functional that avoids direct solution of the Kohn-Sham equations.
The advent of inexpensive consumer virtual reality equipment enables many more researchers to study perception with naturally moving observers. One such system, the HTC Vive, offers a large field-of-view, high-resolution head mounted display together with a room-scale tracking system for less than a thousand U.S. dollars. If the position and orientation tracking of this system is of sufficient accuracy and precision, it could be suitable for much research that is currently done with far more expensive systems. Here we present a quantitative test of the HTC Vive's position and orientation tracking as well as its end-to-end system latency. We report that while the precision of the Vive's tracking measurements is high and its system latency (22 ms) is low, its position and orientation measurements are provided in a coordinate system that is tilted with respect to the physical ground plane. Because large changes in offset were found whenever tracking was briefly lost, it cannot be corrected for with a one-time calibration procedure. We conclude that the varying offset between the virtual and the physical tracking space makes the HTC Vive at present unsuitable for scientific experiments that require accurate visual stimulation of self-motion through a virtual world. It may however be suited for other experiments that do not have this requirement.
Efficient and reliable calculation of protein-ligand binding free energy is a grand challenge in computational biology and is of critical importance in drug design and many other molecular recognition problems. The main challenge lies in the calculation of entropic contribution to protein-ligand binding or interaction systems. In this report, we present a new interaction entropy method which is theoretically rigorous, computationally efficient, and numerically reliable for calculating entropic contribution to free energy in protein-ligand binding and other interaction processes. Drastically different from the widely employed but extremely expensive normal mode method for calculating entropy change in protein-ligand binding, the new method calculates the entropic component (interaction entropy or -TΔS) of the binding free energy directly from molecular dynamics simulation without any extra computational cost. Extensive study of over a dozen randomly selected protein-ligand binding systems demonstrated that this interaction entropy method is both computationally efficient and numerically reliable and is vastly superior to the standard normal mode approach. This interaction entropy paradigm introduces a novel and intuitive conceptual understanding of the entropic effect in protein-ligand binding and other general interaction systems as well as a practical method for highly efficient calculation of this effect.
Working memory (WM) is a cognitive function for temporary maintenance and manipulation of information, which requires conversion of stimulus-driven signals into internal representations that are maintained across seconds-long mnemonic delays. Within primate prefrontal cortex (PFC), a critical node of the brain's WM network, neurons show stimulus-selective persistent activity during WM, but many of them exhibit strong temporal dynamics and heterogeneity, raising the questions of whether, and how, neuronal populations in PFC maintain stable mnemonic representations of stimuli during WM. Here we show that despite complex and heterogeneous temporal dynamics in single-neuron activity, PFC activity is endowed with a population-level coding of the mnemonic stimulus that is stable and robust throughout WM maintenance. We applied population-level analyses to hundreds of recorded single neurons from lateral PFC of monkeys performing two seminal tasks that demand parametric WM: oculomotor delayed response and vibrotactile delayed discrimination. We found that the high-dimensional state space of PFC population activity contains a low-dimensional subspace in which stimulus representations are stable across time during the cue and delay epochs, enabling robust and generalizable decoding compared with time-optimized subspaces. To explore potential mechanisms, we applied these same population-level analyses to theoretical neural circuit models of WM activity. Three previously proposed models failed to capture the key population-level features observed empirically. We propose network connectivity properties, implemented in a linear network model, which can underlie these features. This work uncovers stable population-level WM representations in PFC, despite strong temporal neural dynamics, thereby providing insights into neural circuit mechanisms supporting WM.
Nearly twenty years ago, Dunitz and Bernstein described a selection of intriguing cases of polymorphs that disappear. The inability to obtain a crystal form that has previously been prepared is indeed a frustrating and potentially serious problem for solid-state scientists. This Review discusses recent occurrences and examples of disappearing polymorphs (as well as the emergence of elusive crystal forms) to demonstrate the enduring relevance of this troublesome, but always captivating, phenomenon in solid-state research. A number of these instances have been central issues in patent litigations. This Review, therefore, also highlights the complex relationship between crystal chemistry and the law.
Neuropsychiatric conditions like schizophrenia display a complex neurobiology, which has long been associated with distributed brain dysfunction. However, no investigation has tested whether schizophrenia shows alterations in global brain signal (GS), a signal derived from functional MRI and often discarded as a meaningless baseline in many studies. To evaluate GS alterations associated with schizophrenia, we studied two large chronic patient samples (n = 90, n = 71), comparing them to healthy subjects (n = 220) and patients diagnosed with bipolar disorder (n = 73). We identified and replicated increased cortical power and variance in schizophrenia, an effect predictive of symptoms yet obscured by GS removal. Voxel-wise signal variance was also increased in schizophrenia, independent of GS effects. Both findings were absent in bipolar patients, confirming diagnostic specificity. Biologically informed computational modeling of shared and nonshared signal propagation through the brain suggests that these findings may be explained by altered net strength of overall brain connectivity in schizophrenia.
Entropy effects play an important role in drug-target interactions, but the entropic contribution to ligand-binding affinity is often neglected by end-point binding free energy calculation methods, such as MM/GBSA and MM/PBSA, due to the expensive computational cost of normal mode analysis (NMA). Here, we systematically investigated entropy effects on the prediction power of MM/GBSA and MM/PBSA using >1500 protein-ligand systems and six representative AMBER force fields. Two computationally efficient methods, including NMA based on truncated structures and the interaction entropy approach, were used to estimate the entropic contributions to ligand-target binding free energies. In terms of the overall accuracy, we found that, for the minimized structures, in most cases the inclusion of the conformational entropies predicted by truncated NMA (enthalpynmode_min_9Å) compromises the overall accuracy of MM/GBSA and MM/PBSA compared with the enthalpies calculated based on the minimized structures (enthalpymin). However, for the MD trajectories, the binding free energies can be improved by the inclusion of the conformation entropies predicted by either truncated-NMA for a relatively high dielectric constant (εin = 4) or the interaction entropy method for εin = 1-4. In terms of reproducing the absolute binding free energies, the binding free energies estimated by including the truncated-NMA entropies based on the MD trajectories (ΔGnmode_md_9Å) give the lowest average absolute deviations against the experimental data among all the tested strategies for both MM/GBSA and MM/PBSA. Although the inclusion of the truncated NMA based on the MD trajectories (ΔGnmode_md_9Å) for a relatively high dielectric constant gave the overall best result and the lowest average absolute deviations against the experimental data (for the ff03 force field), it needs too much computational time. Alternatively, considering that the interaction entropy method does not incur any additional computational cost and can give comparable (at high dielectric constant, εin = 4) or even better (at low dielectric constant, εin = 1-2) results than the truncated-NMA entropy (ΔGnmode_md_9Å), the interaction entropy approach is recommended to estimate the entropic component for MM/GBSA and MM/PBSA based on MD trajectories, especially for a diverse dataset. Furthermore, we compared the predictions of MM/GBSA with six different AMBER force fields. The results show that the ff03 force field (ff03 for proteins and gaff with AM1-BCC charges for ligands) performs the best, but the predictions given by the tested force fields are comparable, implying that the MM/GBSA predictions are not very sensitive to force fields.
Abstract Kohn-Sham density functional theory (DFT) is a standard tool in most branches of chemistry, but accuracies for many molecules are limited to 2-3 kcal ⋅ mol −1 with presently-available functionals. Ab initio methods, such as coupled-cluster, routinely produce much higher accuracy, but computational costs limit their application to small molecules. In this paper, we leverage machine learning to calculate coupled-cluster energies from DFT densities, reaching quantum chemical accuracy (errors below 1 kcal ⋅ mol −1 ) on test data. Moreover, density-based Δ -learning (learning only the correction to a standard DFT calculation, termed Δ -DFT ) significantly reduces the amount of training data required, particularly when molecular symmetries are included. The robustness of Δ -DFT is highlighted by correcting “on the fly” DFT-based molecular dynamics (MD) simulations of resorcinol (C 6 H 4 (OH) 2 ) to obtain MD trajectories with coupled-cluster accuracy. We conclude, therefore, that Δ -DFT facilitates running gas-phase MD simulations with quantum chemical accuracy, even for strained geometries and conformer changes where standard DFT fails.
Abstract Recently, chimeric antigen receptor (CAR)-T cell therapy has shown great promise in treating haematological malignancies 1–7 . However, CAR-T cell therapy currently has several limitations 8–12 . Here we successfully developed a two-in-one approach to generate non-viral, gene-specific targeted CAR-T cells through CRISPR–Cas9. Using the optimized protocol, we demonstrated feasibility in a preclinical study by inserting an anti-CD19 CAR cassette into the AAVS1 safe-harbour locus. Furthermore, an innovative type of anti-CD19 CAR-T cell with PD1 integration was developed and showed superior ability to eradicate tumour cells in xenograft models. In adoptive therapy for relapsed/refractory aggressive B cell non-Hodgkin lymphoma (ClinicalTrials.gov, NCT04213469 ), we observed a high rate (87.5%) of complete remission and durable responses without serious adverse events in eight patients. Notably, these enhanced CAR-T cells were effective even at a low infusion dose and with a low percentage of CAR + cells. Single-cell analysis showed that the electroporation method resulted in a high percentage of memory T cells in infusion products, and PD1 interference enhanced anti-tumour immune functions, further validating the advantages of non-viral, PD1 -integrated CAR-T cells. Collectively, our results demonstrate the high safety and efficacy of non-viral, gene-specific integrated CAR-T cells, thus providing an innovative technology for CAR-T cell therapy.
The development of new protein–ligand scoring functions using machine learning algorithms, such as random forest, has been of significant interest. By efficiently utilizing expanded feature sets and a large set of experimental data, random forest based scoring functions (RFbScore) can achieve better correlations to experimental protein–ligand binding data with known crystal structures; however, more extensive tests indicate that such enhancement in scoring power comes with significant under-performance in docking and screening power tests compared to traditional scoring functions. In this work, to improve scoring-docking-screening powers of protein–ligand docking functions simultaneously, we have introduced a ΔvinaRF parameterization and feature selection framework based on random forest. Our developed scoring function ΔvinaRF20, which employs 20 descriptors in addition to the AutoDock Vina score, can achieve superior performance in all power tests of both CASF-2013 and CASF-2007 benchmarks compared to classical scoring functions. The ΔvinaRF20 scoring function and its code are freely available on the web at: https://www.nyu.edu/projects/yzhang/DeltaVina. © 2016 Wiley Periodicals, Inc.
Quantum states of quasiparticles in solids are dictated by symmetry. We have experimentally demonstrated quantum states of Dirac electrons in a two-dimensional quasicrystal without translational symmetry. A dodecagonal quasicrystalline order was realized by epitaxial growth of twisted bilayer graphene rotated exactly 30°. We grew the graphene quasicrystal up to a millimeter scale on a silicon carbide surface while maintaining the single rotation angle over an entire sample and successfully isolated the quasicrystal from a substrate, demonstrating its structural and chemical stability under ambient conditions. Multiple Dirac cones replicated with the 12-fold rotational symmetry were observed in angle-resolved photoemission spectra, which revealed anomalous strong interlayer coupling with quasi-periodicity. Our study provides a way to explore physical properties of relativistic fermions with controllable quasicrystalline orders.
Science's changing demographics raise new questions about research team diversity and research outcomes. We study mixed-gender research teams, examining 6.6 million papers published across the medical sciences since 2000 and establishing several core findings. First, the fraction of publications by mixed-gender teams has grown rapidly, yet mixed-gender teams continue to be underrepresented compared to the expectations of a null model. Second, despite their underrepresentation, the publications of mixed-gender teams are substantially more novel and impactful than the publications of same-gender teams of equivalent size. Third, the greater the gender balance on a team, the better the team scores on these performance measures. Fourth, these patterns generalize across medical subfields. Finally, the novelty and impact advantages seen with mixed-gender teams persist when considering numerous controls and potential related features, including fixed effects for the individual researchers, team structures, and network positioning, suggesting that a team's gender balance is an underrecognized yet powerful correlate of novel and impactful scientific discoveries.
The ability to simultaneously record from large numbers of neurons in behaving animals has ushered in a new era for the study of the neural circuit mechanisms underlying cognitive functions. One promising approach to uncovering the dynamical and computational principles governing population responses is to analyze model recurrent neural networks (RNNs) that have been optimized to perform the same tasks as behaving animals. Because the optimization of network parameters specifies the desired output but not the manner in which to achieve this output, "trained" networks serve as a source of mechanistic hypotheses and a testing ground for data analyses that link neural computation to behavior. Complete access to the activity and connectivity of the circuit, and the ability to manipulate them arbitrarily, make trained networks a convenient proxy for biological circuits and a valuable platform for theoretical investigation. However, existing RNNs lack basic biological features such as the distinction between excitatory and inhibitory units (Dale's principle), which are essential if RNNs are to provide insights into the operation of biological circuits. Moreover, trained networks can achieve the same behavioral performance but differ substantially in their structure and dynamics, highlighting the need for a simple and flexible framework for the exploratory training of RNNs. Here, we describe a framework for gradient descent-based training of excitatory-inhibitory RNNs that can incorporate a variety of biological knowledge. We provide an implementation based on the machine learning library Theano, whose automatic differentiation capabilities facilitate modifications and extensions. We validate this framework by applying it to well-known experimental paradigms such as perceptual decision-making, context-dependent integration, multisensory integration, parametric working memory, and motor sequence generation. Our results demonstrate the wide range of neural activity patterns and behavior that can be modeled, and suggest a unified setting in which diverse cognitive computations and mechanisms can be studied.
A new fluorescent polyimide covalent organic framework (PI-COF) has been successfully synthesized through solvothermal route using tetra(4-aminophenyl) porphyrin and perylenetracarboxylic dianhydride, which possesses porous crystalline and excellent thermal stability (>500 °C). Furthermore, few-layered PI covalent organic nanosheets (PI-CONs) can be easily obtained from the fluorescent PI-COF through a facile liquid phase exfoliation approach, which were confirmed by atomic force microscopy and transmission electron microscopy analysis. It is interesting that the fluorescent intensity of PI-CONs is obviously enhanced relative to that of PI-COF. The PI-CONs have been successfully utilized as an efficient fluorescent probe for the highly sensitive and selective detection of 2,4,6-trinitrophenol (TNP). The mechanism might be attributed to the combination of electron transfer and inner filter effect based on DFT calculations and spectral overlap data. The system exhibits a good linear response toward TNP over the range from 0.5 to 10 μM with a detection limit of 0.25 μM.
Combustion is a complex chemical system which involves thousands of chemical reactions and generates hundreds of molecular species and radicals during the process. In this work, a neural network-based molecular dynamics (MD) simulation is carried out to simulate the benchmark combustion of methane. During MD simulation, detailed reaction processes leading to the creation of specific molecular species including various intermediate radicals and the products are intimately revealed and characterized. Overall, a total of 798 different chemical reactions were recorded and some new chemical reaction pathways were discovered. We believe that the present work heralds the dawn of a new era in which neural network-based reactive MD simulation can be practically applied to simulating important complex reaction systems at ab initio level, which provides atomic-level understanding of chemical reaction processes as well as discovery of new reaction pathways at an unprecedented level of detail beyond what laboratory experiments could accomplish.