Genomics Institute of the Novartis Research Foundation
facilitySan Diego, California, United States
Research output, citation impact, and the most-cited recent papers from Genomics Institute of the Novartis Research Foundation (United States). Aggregated across the NobleBlocks index of 300M+ scholarly works.
Top-cited papers from Genomics Institute of the Novartis Research Foundation
A critical component in the interpretation of systems-level studies is the inference of enriched biological pathways and protein complexes contained within OMICs datasets. Successful analysis requires the integration of a broad set of current biological databases and the application of a robust analytical pipeline to produce readily interpretable results. Metascape is a web-based portal designed to provide a comprehensive gene list annotation and analysis resource for experimental biologists. In terms of design features, Metascape combines functional enrichment, interactome analysis, gene annotation, and membership search to leverage over 40 independent knowledgebases within one integrated portal. Additionally, it facilitates comparative analyses of datasets across multiple independent and orthogonal experiments. Metascape provides a significantly simplified user experience through a one-click Express Analysis interface to generate interpretable outputs. Taken together, Metascape is an effective and efficient tool for experimental biologists to comprehensively analyze and interpret OMICs-based studies in the big data era.
The tissue-specific pattern of mRNA expression can indicate important clues about gene function. High-density oligonucleotide arrays offer the opportunity to examine patterns of gene expression on a genome scale. Toward this end, we have designed custom arrays that interrogate the expression of the vast majority of protein-encoding human and mouse genes and have used them to profile a panel of 79 human and 61 mouse tissues. The resulting data set provides the expression patterns for thousands of predicted genes, as well as known and poorly characterized genes, from mice and humans. We have explored this data set for global trends in gene expression, evaluated commonly used lines of evidence in gene prediction methodologies, and investigated patterns indicative of chromosomal organization of transcription. We describe hundreds of regions of correlated transcription and show that some are subject to both tissue and parental allele-specific expression, suggesting a link between spatial expression and imprinting.
Mechanical stimuli drive many physiological processes, including touch and pain sensation, hearing, and blood pressure regulation. Mechanically activated (MA) cation channel activities have been recorded in many cells, but the responsible molecules have not been identified. We characterized a rapidly adapting MA current in a mouse neuroblastoma cell line. Expression profiling and RNA interference knockdown of candidate genes identified Piezo1 (Fam38A) to be required for MA currents in these cells. Piezo1 and related Piezo2 (Fam38B) are vertebrate multipass transmembrane proteins with homologs in invertebrates, plants, and protozoa. Overexpression of mouse Piezo1 or Piezo2 induced two kinetically distinct MA currents. Piezos are expressed in several tissues, and knockdown of Piezo2 in dorsal root ganglia neurons specifically reduced rapidly adapting MA currents. We propose that Piezos are components of MA cation channels.
Although it has long been recognized that the enteric community of bacteria that inhabit the human distal intestinal track broadly impacts human health, the biochemical details that underlie these effects remain largely undefined. Here, we report a broad MS-based metabolomics study that demonstrates a surprisingly large effect of the gut "microbiome" on mammalian blood metabolites. Plasma extracts from germ-free mice were compared with samples from conventional (conv) animals by using various MS-based methods. Hundreds of features were detected in only 1 sample set, with the majority of these being unique to the conv animals, whereas approximately 10% of all features observed in both sample sets showed significant changes in their relative signal intensity. Amino acid metabolites were particularly affected. For example, the bacterial-mediated production of bioactive indole-containing metabolites derived from tryptophan such as indoxyl sulfate and the antioxidant indole-3-propionic acid (IPA) was impacted. Production of IPA was shown to be completely dependent on the presence of gut microflora and could be established by colonization with the bacterium Clostridium sporogenes. Multiple organic acids containing phenyl groups were also greatly increased in the presence of gut microbes. A broad, drug-like phase II metabolic response of the host to metabolites generated by the microbiome was observed, suggesting that the gut microflora has a direct impact on the drug metabolism capacity of the host. Together, these results suggest a significant interplay between bacterial and mammalian metabolism.
The Bioperl project is an international open-source collaboration of biologists, bioinformaticians, and computer scientists that has evolved over the past 7 yr into the most comprehensive library of Perl modules available for managing and manipulating life-science information. Bioperl provides an easy-to-use, stable, and consistent programming interface for bioinformatics application programmers. The Bioperl modules have been successfully and repeatedly used to reduce otherwise complex tasks to only a few lines of code. The Bioperl object model has been proven to be flexible enough to support enterprise-level applications such as EnsEMBL, while maintaining an easy learning curve for novice Perl programmers. Bioperl is capable of executing analyses and processing results from programs such as BLAST, ClustalW, or the EMBOSS suite. Interoperation with modules written in Python and Java is supported through the evolving BioCORBA bridge. Bioperl provides access to data stores such as GenBank and SwissProt via a flexible series of sequence input/output modules, and to the emerging common sequence data storage format of the Open Bioinformatics Database Access project. This study describes the overall architecture of the toolkit, the problem domains that it addresses, and gives specific examples of how the toolkit can be used to solve common life-sciences problems. We conclude with a discussion of how the open-source nature of the project has contributed to the development effort. [Supplemental material is available online at www.genome.org . Bioperl is available as open-source software free of charge and is licensed under the Perl Artistic License ( http://www.perl.com/pub/a/language/misc/Artistic.html ). It is available for download at http://www.bioperl.org . Support inquiries should be addressed to bioperl-l@bioperl.org .]
Like most organisms, plants have endogenous biological clocks that coordinate internal events with the external environment. We used high-density oligonucleotide microarrays to examine gene expression in Arabidopsis and found that 6% of the more than 8000 genes on the array exhibited circadian changes in steady-state messenger RNA levels. Clusters of circadian-regulated genes were found in pathways involved in plant responses to light and other key metabolic pathways. Computational analysis of cycling genes allowed the identification of a highly conserved promoter motif that we found to be required for circadian control of gene expression. Our study presents a comprehensive view of the temporal compartmentalization of physiological pathways by the circadian clock in a eukaryote.
Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 'transcriptional units', contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense-antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.
A unique transfer RNA (tRNA)/aminoacyl-tRNA synthetase pair has been generated that expands the number of genetically encoded amino acids in Escherichia coli. When introduced into E. coli, this pair leads to the in vivo incorporation of the synthetic amino acid O-methyl-l-tyrosine into protein in response to an amber nonsense codon. The fidelity of translation is greater than 99%, as determined by analysis of dihydrofolate reductase containing the unnatural amino acid. This approach should provide a general method for increasing the genetic repertoire of living cells to include a variety of amino acids with novel structural, chemical, and physical properties not found in the common 20 amino acids.
High-throughput gene expression profiling has become an important tool for investigating transcriptional activity in a variety of biological samples. To date, the vast majority of these experiments have focused on specific biological processes and perturbations. Here, we have generated and analyzed gene expression from a set of samples spanning a broad range of biological conditions. Specifically, we profiled gene expression from 91 human and mouse samples across a diverse array of tissues, organs, and cell lines. Because these samples predominantly come from the normal physiological state in the human and mouse, this dataset represents a preliminary, but substantial, description of the normal mammalian transcriptome. We have used this dataset to illustrate methods of mining these data, and to reveal insights into molecular and physiological gene function, mechanisms of transcriptional regulation, disease etiology, and comparative genomics. Finally, to allow the scientific community to use this resource, we have built a free and publicly accessible website (http://expression.gnf.org) that integrates data visualization and curation of current gene annotations.
OBJECTIVE: The authors tested whether the anti-interleukin (IL)-17A monoclonal antibody secukinumab was safe and effective for the treatment of active Crohn's disease. DESIGN: In a double-blind, randomised, placebo-controlled proof-of-concept study, 59 patients with moderate to severe Crohn's disease (Crohn's Disease Activity Index (CDAI) ≥220 to ≤450) were assigned in a 2:1 ratio to 2×10 mg/kg intravenous secukinumab or placebo. The primary end point, addressed by bayesian statistics augmented with historical placebo information, was the probability that secukinumab reduces the CDAI by ≥50 points more than placebo at week 6. Ancillary analyses explored associations of 35 candidate genetic polymorphisms and faecal calprotectin response. RESULTS: 59 patients (39 secukinumab, 20 placebo, mean baseline CDAI 307 and 301, respectively) were recruited. 18/59 (31%) patients discontinued prematurely (12/39 (31%) secukinumab, 6/20 (30%) placebo), 10/59 (17%) due to insufficient therapeutic effect (8/39 (21%) secukinumab, 2/20 (10%) placebo). Fourteen serious adverse events occurred in 10 patients (seven secukinumab, three placebo); 20 infections, including four local fungal infections, were seen on secukinumab versus none on placebo. Primary end point analysis estimated <0.1% probability (CDAI (SD) =33.9 (19.7), 95% credible interval -4.9 to 72.9) that secukinumab reduces CDAI by ≥50 points more than placebo. Secondary area under the curve analysis (weeks 4-10) showed a significant difference (mean ΔCDAI=49; 95% CI (2 to 96), p=0.043) in favour of placebo. Post hoc subgroup analysis showed that unfavourable responses on secukinumab were driven by patients with elevated inflammatory markers (CRP≥10 mg/l and/or faecal calprotectin≥200 ng/ml; mean ΔCDAI=62; 95% CI (-1 to 125), p=0.054 in favour of placebo). Absence of the minor allele of tumour necrosis factor-like ligand 1A was strongly associated with lack of response measured by baseline-adjusted changes in calprotectin at week 6 (p=0.00035 Bonferroni-corrected). CONCLUSIONS: Blockade of IL-17A was ineffective and higher rates of adverse events were noted compared with placebo. CLINICAL TRIAL REGISTRATION: This trial was registered at ClinicalTrial.gov with the number NCT01009281.
Specialized peripheral sensory neurons known as nociceptors alert us to potentially damaging stimuli at the skin by detecting extremes in temperature and pressure and injury-related chemicals, and transducing these stimuli into long-ranging electrical signals that are relayed to higher brain centers. The activation of functionally distinct cutaneous nociceptor populations and the processing of information they convey provide a rich diversity of pain qualities. Current work in this field is providing researchers with a more thorough understanding of nociceptor cell biology at molecular and systems levels and insight that will allow the targeted design of novel pain therapeutics.
Online gene annotation resources are indispensable for analysis of genomics data. However, the landscape of these online resources is highly fragmented, and scientists often visit dozens of these sites for each gene in a candidate gene list. Here, we introduce BioGPS http://biogps.gnf.org, a centralized gene portal for aggregating distributed gene annotation resources. Moreover, BioGPS embraces the principle of community intelligence, enabling any user to easily and directly contribute to the BioGPS platform.
The completion of the genome sequence for Plasmodium falciparum, the species responsible for most malaria human deaths, has the potential to reveal hundreds of new drug targets and proteins involved in pathogenesis. However, only approximately 35% of the genes code for proteins with an identifiable function. The absence of routine genetic tools for studying Plasmodium parasites suggests that this number is unlikely to change quickly if conventional serial methods are used to characterize encoded proteins. Here, we use a high-density oligonucleotide array to generate expression profiles of human and mosquito stages of the malaria parasite's life cycle. Genes with highly correlated levels and temporal patterns of expression were often involved in similar functions or cellular processes.
Recent reports of increased tolerance to artemisinin derivatives--the most recently adopted class of antimalarials--have prompted a need for new treatments. The spirotetrahydro-beta-carbolines, or spiroindolones, are potent drugs that kill the blood stages of Plasmodium falciparum and Plasmodium vivax clinical isolates at low nanomolar concentration. Spiroindolones rapidly inhibit protein synthesis in P. falciparum, an effect that is ablated in parasites bearing nonsynonymous mutations in the gene encoding the P-type cation-transporter ATPase4 (PfATP4). The optimized spiroindolone NITD609 shows pharmacokinetic properties compatible with once-daily oral dosing and has single-dose efficacy in a rodent malaria model.
Neuropathological and brain imaging studies suggest that schizophrenia may result from neurodevelopmental defects. Cytoarchitectural studies indicate cellular abnormalities suggestive of a disruption in neuronal connectivity in schizophrenia, particularly in the dorsolateral prefrontal cortex. Yet, the molecular mechanisms underlying these findings remain unclear. To identify molecular substrates associated with schizophrenia, DNA microarray analysis was used to assay gene expression levels in postmortem dorsolateral prefrontal cortex of schizophrenic and control patients. Genes determined to have altered expression levels in schizophrenics relative to controls are involved in a number of biological processes, including synaptic plasticity, neuronal development, neurotransmission, and signal transduction. Most notable was the differential expression of myelination-related genes suggesting a disruption in oligodendrocyte function in schizophrenia.
We have examined the progression of hepatitis C virus (HCV) infections by gene expression analysis of liver biopsies in acutely infected chimpanzees that developed persistent infection, transient viral clearance, or sustained clearance. Both common responses and outcome-specific changes in expression were observed. All chimpanzees showed gene expression patterns consistent with an IFN-alpha response that correlated with the magnitude and duration of infection. Transient and sustained viral clearance were uniquely associated with induction of IFN-gamma-induced genes and other genes involved in antigen processing and presentation and the adaptive immune response. During the early stages of infection, host genes involved in lipid metabolism were also differentially regulated. We also show that drugs that affect these biosynthetic pathways can regulate HCV replication in HCV replicon systems. Our results reveal genome-wide transcriptional changes that reflect the establishment, spread, and control of infection, and they reveal potentially unique antiviral programs associated with clearance of HCV infection.
Circadian timing is generated through a unique series of autoregulatory interactions termed the molecular clock. Behavioral rhythms subject to the molecular clock are well characterized. We demonstrate a role for Bmal1 and Clock in the regulation of glucose homeostasis. Inactivation of the known clock components Bmal1 (Mop3) and Clock suppress the diurnal variation in glucose and triglycerides. Gluconeogenesis is abolished by deletion of Bmal1 and is depressed in Clock mutants, but the counterregulatory response of corticosterone and glucagon to insulin-induced hypoglycaemia is retained. Furthermore, a high-fat diet modulates carbohydrate metabolism by amplifying circadian variation in glucose tolerance and insulin sensitivity, and mutation of Clock restores the chow-fed phenotype. Bmal1 and Clock, genes that function in the core molecular clock, exert profound control over recovery from insulin-induced hypoglycaemia. Furthermore, asynchronous dietary cues may modify glucose homeostasis via their interactions with peripheral molecular clocks.
Although practiced clinically for more than 40 years, the use of hematopoietic stem cell (HSC) transplants remains limited by the ability to expand these cells ex vivo. An unbiased screen with primary human HSCs identified a purine derivative, StemRegenin 1 (SR1), that promotes the ex vivo expansion of CD34+ cells. Culture of HSCs with SR1 led to a 50-fold increase in cells expressing CD34 and a 17-fold increase in cells that retain the ability to engraft immunodeficient mice. Mechanistic studies show that SR1 acts by antagonizing the aryl hydrocarbon receptor (AHR). The identification of SR1 and AHR modulation as a means to induce ex vivo HSC expansion should facilitate the clinical use of HSC therapy.
Hormones and nutrients often induce genetic programs via signaling pathways that interface with gene-specific activators. Activation of the cAMP pathway, for example, stimulates cellular gene expression by means of the PKA-mediated phosphorylation of cAMP-response element binding protein (CREB) at Ser-133. Here, we use genome-wide approaches to characterize target genes that are regulated by CREB in different cellular contexts. CREB was found to occupy approximately 4,000 promoter sites in vivo, depending on the presence and methylation state of consensus cAMP response elements near the promoter. The profiles for CREB occupancy were very similar in different human tissues, and exposure to a cAMP agonist stimulated CREB phosphorylation over a majority of these sites. Only a small proportion of CREB target genes was induced by cAMP in any cell type, however, due in part to the preferential recruitment of the coactivator CREB-binding protein to those promoters. These results indicate that CREB phosphorylation alone is not a reliable predictor of target gene activation and that additional CREB regulatory partners are required for recruitment of the transcriptional apparatus to the promoter.
Mechanical and thermal cues stimulate a specialized group of sensory neurons that terminate in the skin. Three members of the transient receptor potential (TRP) family of channels are expressed in subsets of these neurons and are activated at distinct physiological temperatures. Here, we describe the cloning and characterization of a novel thermosensitive TRP channel. TRPV3 has a unique threshold: It is activated at innocuous (warm) temperatures and shows an increased response at noxious temperatures. TRPV3 is specifically expressed in keratinocytes; hence, skin cells are capable of detecting heat via molecules similar to those in heat-sensing neurons.