Genomics England
otherLondon, United Kingdom
Research output, citation impact, and the most-cited recent papers from Genomics England (United Kingdom). Aggregated across the NobleBlocks index of 300M+ scholarly works.
Top-cited papers from Genomics England
Bromodomains (BRDs) are protein interaction modules that specifically recognize ε-N-lysine acetylation motifs, a key event in the reading process of epigenetic marks. The 61 BRDs in the human genome cluster into eight families based on structure/sequence similarity. Here, we present 29 high-resolution crystal structures, covering all BRD families. Comprehensive crossfamily structural analysis identifies conserved and family-specific structural features that are necessary for specific acetylation-dependent substrate recognition. Screening of more than 30 representative BRDs against systematic histone-peptide arrays identifies new BRD substrates and reveals a strong influence of flanking posttranslational modifications, such as acetylation and phosphorylation, suggesting that BRDs recognize combinations of marks rather than singly acetylated sequences. We further uncovered a structural mechanism for the simultaneous binding and recognition of diverse diacetyl-containing peptides by BRD4. These data provide a foundation for structure-based drug design of specific inhibitors for this emerging target family.
Plastic pollution is a pervasive and growing problem. To estimate the effectiveness of interventions to reduce plastic pollution, we modeled stocks and flows of municipal solid waste and four sources of microplastics through the global plastic system for five scenarios between 2016 and 2040. Implementing all feasible interventions reduced plastic pollution by 40% from 2016 rates and 78% relative to "business as usual" in 2040. Even with immediate and concerted action, 710 million metric tons of plastic waste cumulatively entered aquatic and terrestrial ecosystems. To avoid a massive build-up of plastic in the environment, coordinated global action is urgently needed to reduce plastic consumption; increase rates of reuse, waste collection, and recycling; expand safe disposal systems; and accelerate innovation in the plastic value chain.
Host-mediated lung inflammation is present 1 , and drives mortality 2 , in the critical illness caused by coronavirus disease 2019 (COVID-19). Host genetic variants associated with critical illness may identify mechanistic targets for therapeutic development 3 . Here we report the results of the GenOMICC (Genetics Of Mortality In Critical Care) genome-wide association study in 2,244 critically ill patients with COVID-19 from 208 UK intensive care units. We have identified and replicated the following new genome-wide significant associations: on chromosome 12q24.13 (rs10735079, P = 1.65 10 -8 ) in a gene cluster that encodes antiviral restriction enzyme activators (OAS1, OAS2 and OAS3); on chromosome 19p13.2 (rs74956615, P = 2.3 10 -8 ) near the gene that encodes tyrosine kinase 2 (TYK2); on chromosome 19p13.3 (rs2109069, P = 3.98 10 -12 ) within the gene that encodes dipeptidyl peptidase 9 (DPP9); and on chromosome 21q22.1 (rs2236757, P = 4.99 10 -8 ) in the interferon receptor gene IFNAR2. We identified potential targets for repurposing of licensed medications: using Mendelian randomization, we found evidence that low expression of IFNAR2, or high expression of TYK2, are associated with life-threatening disease; and transcriptome-wide association in lung tissue revealed that high expression of the monocyte-macrophage chemotactic receptor CCR2 is associated with severe COVID-19. Our results identify robust genetic signals relating to key host antiviral defence mechanisms and mediators of inflammatory organ damage in COVID-19. Both mechanisms may be amenable to targeted treatment with existing drugs. However, large-scale randomized clinical trials will be essential before any change to clinical practice.
SUMMARY: The Illumina Infinium HumanMethylationEPIC BeadChip is the new platform for high-throughput DNA methylation analysis, effectively doubling the coverage compared to the older 450 K array. Here we present a significantly updated and improved version of the Bioconductor package ChAMP, which can be used to analyze EPIC and 450k data. Many enhanced functionalities have been added, including correction for cell-type heterogeneity, network analysis and a series of interactive graphical user interfaces. AVAILABILITY AND IMPLEMENTATION: ChAMP is a BioC package available from https://bioconductor.org/packages/release/bioc/html/ChAMP.html. CONTACT: a.teschendorff@ucl.ac.uk or s.beck@ucl.ac.uk or a.feber@ucl.ac.uk. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Heavy alcohol consumption is an established risk factor for hypertension; the mechanism by which alcohol consumption impact blood pressure (BP) regulation remains unknown. We hypothesized that a genome-wide association study accounting for gene-alcohol consumption interaction for BP might identify additional BP loci and contribute to the understanding of alcohol-related BP regulation. We conducted a large two-stage investigation incorporating joint testing of main genetic effects and single nucleotide variant (SNV)-alcohol consumption interactions. In Stage 1, genome-wide discovery meta-analyses in ≈131K individuals across several ancestry groups yielded 3,514 SNVs (245 loci) with suggestive evidence of association (P < 1.0 x 10-5). In Stage 2, these SNVs were tested for independent external replication in ≈440K individuals across multiple ancestries. We identified and replicated (at Bonferroni correction threshold) five novel BP loci (380 SNVs in 21 genes) and 49 previously reported BP loci (2,159 SNVs in 109 genes) in European ancestry, and in multi-ancestry meta-analyses (P < 5.0 x 10-8). For African ancestry samples, we detected 18 potentially novel BP loci (P < 5.0 x 10-8) in Stage 1 that warrant further replication. Additionally, correlated meta-analysis identified eight novel BP loci (11 genes). Several genes in these loci (e.g., PINX1, GATA4, BLK, FTO and GABBR2) have been previously reported to be associated with alcohol consumption. These findings provide insights into the role of alcohol consumption in the genetic architecture of hypertension.
Abstract The genetic make-up of an individual contributes to the susceptibility and response to viral infection. Although environmental, clinical and social factors have a role in the chance of exposure to SARS-CoV-2 and the severity of COVID-19 1,2 , host genetics may also be important. Identifying host-specific genetic factors may reveal biological mechanisms of therapeutic relevance and clarify causal relationships of modifiable environmental risk factors for SARS-CoV-2 infection and outcomes. We formed a global network of researchers to investigate the role of human genetics in SARS-CoV-2 infection and COVID-19 severity. Here we describe the results of three genome-wide association meta-analyses that consist of up to 49,562 patients with COVID-19 from 46 studies across 19 countries. We report 13 genome-wide significant loci that are associated with SARS-CoV-2 infection or severe manifestations of COVID-19. Several of these loci correspond to previously documented associations to lung or autoimmune and inflammatory diseases 3–7 . They also represent potentially actionable mechanisms in response to infection. Mendelian randomization analyses support a causal role for smoking and body-mass index for severe COVID-19 although not for type II diabetes. The identification of novel host genetic factors associated with COVID-19 was made possible by the community of human genetics researchers coming together to prioritize the sharing of data, results, resources and analytical frameworks. This working model of international collaboration underscores what is possible for future genetic discoveries in emerging pandemics, or indeed for any complex human disease.
INTRODUCTION The cerebral cortex underlies our complex cognitive capabilities. Variations in human cortical surface area and thickness are associated with neurological, psychological, and behavioral traits and can be measured in vivo by magnetic resonance imaging (MRI). Studies in model organisms have identified genes that influence cortical structure, but little is known about common genetic variants that affect human cortical structure. RATIONALE To identify genetic variants associated with human cortical structure at both global and regional levels, we conducted a genome-wide association meta-analysis of brain MRI data from 51,665 individuals across 60 cohorts. We analyzed the surface area and average thickness of the whole cortex and 34 cortical regions with known functional specializations. RESULTS We identified 369 nominally genome-wide significant loci ( P < 5 × 10 −8 ) associated with cortical structure in a discovery sample of 33,992 participants of European ancestry. Of the 360 loci for which replication data were available, 241 loci influencing surface area and 66 influencing thickness remained significant after replication, with 237 loci passing multiple testing correction ( P < 8.3 × 10 −10 ; 187 influencing surface area and 50 influencing thickness). Common genetic variants explained 34% (SE = 3%) of the variation in total surface area and 26% (SE = 2%) in average thickness; surface area and thickness showed a negative genetic correlation ( r G = −0.32, SE = 0.05, P = 6.5 × 10 −12 ), which suggests that genetic influences have opposing effects on surface area and thickness. Bioinformatic analyses showed that total surface area is influenced by genetic variants that alter gene regulatory activity in neural progenitor cells during fetal development. By contrast, average thickness is influenced by active regulatory elements in adult brain samples, which may reflect processes that occur after mid-fetal development, such as myelination, branching, or pruning. When considered together, these results support the radial unit hypothesis that different developmental mechanisms promote surface area expansion and increases in thickness. To identify specific genetic influences on individual cortical regions, we controlled for global measures (total surface area or average thickness) in the regional analyses. After multiple testing correction, we identified 175 loci that influence regional surface area and 46 that influence regional thickness. Loci that affect regional surface area cluster near genes involved in the Wnt signaling pathway, which is known to influence areal identity. We observed significant positive genetic correlations and evidence of bidirectional causation of total surface area with both general cognitive functioning and educational attainment. We found additional positive genetic correlations between total surface area and Parkinson’s disease but did not find evidence of causation. Negative genetic correlations were evident between total surface area and insomnia, attention deficit hyperactivity disorder, depressive symptoms, major depressive disorder, and neuroticism. CONCLUSION This large-scale collaborative work enhances our understanding of the genetic architecture of the human cerebral cortex and its regional patterning. The highly polygenic architecture of the cortex suggests that distinct genes are involved in the development of specific cortical areas. Moreover, we find evidence that brain structure is a key phenotype along the causal pathway that leads from genetic variation to differences in general cognitive function. Identifying genetic influences on human cortical structure. ( A ) Measurement of cortical surface area and thickness from MRI. ( B ) Genomic locations of common genetic variants that influence global and regional cortical structure. ( C ) Our results support the radial unit hypothesis that the expansion of cortical surface area is driven by proliferating neural progenitor cells. ( D ) Cortical surface area shows genetic correlation with psychiatric and cognitive traits. Error bars indicate SE. IMAGE CREDITS: (A) K. COURTNEY; (C) M. R. GLASS
BACKGROUND: The prevailing paradigm of host-parasite evolution is that arms races lead to increasing specialisation via genetic adaptation. Insect herbivores are no exception and the majority have evolved to colonise a small number of closely related host species. Remarkably, the green peach aphid, Myzus persicae, colonises plant species across 40 families and single M. persicae clonal lineages can colonise distantly related plants. This remarkable ability makes M. persicae a highly destructive pest of many important crop species. RESULTS: To investigate the exceptional phenotypic plasticity of M. persicae, we sequenced the M. persicae genome and assessed how one clonal lineage responds to host plant species of different families. We show that genetically identical individuals are able to colonise distantly related host species through the differential regulation of genes belonging to aphid-expanded gene families. Multigene clusters collectively upregulate in single aphids within two days upon host switch. Furthermore, we demonstrate the functional significance of this rapid transcriptional change using RNA interference (RNAi)-mediated knock-down of genes belonging to the cathepsin B gene family. Knock-down of cathepsin B genes reduced aphid fitness, but only on the host that induced upregulation of these genes. CONCLUSIONS: Previous research has focused on the role of genetic adaptation of parasites to their hosts. Here we show that the generalist aphid pest M. persicae is able to colonise diverse host plant species in the absence of genetic specialisation. This is achieved through rapid transcriptional plasticity of genes that have duplicated during aphid evolution.
To characterize type 2 diabetes (T2D)-associated variation across the allele frequency spectrum, we conducted a meta-analysis of genome-wide association data from 26,676 T2D case and 132,532 control subjects of European ancestry after imputation using the 1000 Genomes multiethnic reference panel. Promising association signals were followed up in additional data sets (of 14,545 or 7,397 T2D case and 38,994 or 71,604 control subjects). We identified 13 novel T2D-associated loci (P &lt; 5 × 10−8), including variants near the GLP2R, GIP, and HLA-DQA1 genes. Our analysis brought the total number of independent T2D associations to 128 distinct signals at 113 loci. Despite substantially increased sample size and more complete coverage of low-frequency variation, all novel associations were driven by common single nucleotide variants. Credible sets of potentially causal variants were generally larger than those based on imputation with earlier reference panels, consistent with resolution of causal signals to common risk haplotypes. Stratification of T2D-associated loci based on T2D-related quantitative trait associations revealed tissue-specific enrichment of regulatory annotations in pancreatic islet enhancers for loci influencing insulin secretion and in adipocytes, monocytes, and hepatocytes for insulin action–associated loci. These findings highlight the predominant role played by common variants of modest effect and the diversity of biological mechanisms influencing T2D pathophysiology.
Deep phenotyping has been defined as the precise and comprehensive analysis of phenotypic abnormalities in which the individual components of the phenotype are observed and described. The three components of the Human Phenotype Ontology (HPO; www.human-phenotype-ontology.org) project are the phenotype vocabulary, disease-phenotype annotations and the algorithms that operate on these. These components are being used for computational deep phenotyping and precision medicine as well as integration of clinical data into translational research. The HPO is being increasingly adopted as a standard for phenotypic abnormalities by diverse groups such as international rare disease organizations, registries, clinical labs, biomedical resources, and clinical software tools and will thereby contribute toward nascent efforts at global data exchange for identifying disease etiologies. This update article reviews the progress of the HPO project since the debut Nucleic Acids Research database article in 2014, including specific areas of expansion such as common (complex) disease, new algorithms for phenotype driven genomic discovery and diagnostics, integration of cross-species mapping efforts with the Mammalian Phenotype Ontology, an improved quality control pipeline, and the addition of patient-friendly terminology.
T cells) from up to 197 individuals. We assess, quantitatively, the relative contribution of cis-genetic and epigenetic factors to transcription and evaluate their impact as potential sources of confounding in epigenome-wide association studies. Further, we characterize highly coordinated genetic effects on gene expression, methylation, and histone variation through quantitative trait locus (QTL) mapping and allele-specific (AS) analyses. Finally, we demonstrate colocalization of molecular trait QTLs at 345 unique immune disease loci. This expansive, high-resolution atlas of multi-omics changes yields insights into cell-type-specific correlation between diverse genomic inputs, more generalizable correlations between these inputs, and defines molecular events that may underpin complex disease risk.
The Human Phenotype Ontology (HPO)-a standardized vocabulary of phenotypic abnormalities associated with 7000+ diseases-is used by thousands of researchers, clinicians, informaticians and electronic health record systems around the world. Its detailed descriptions of clinical abnormalities and computable disease definitions have made HPO the de facto standard for deep phenotyping in the field of rare disease. The HPO's interoperability with other ontologies has enabled it to be used to improve diagnostic accuracy by incorporating model organism data. It also plays a key role in the popular Exomiser tool, which identifies potential disease-causing variants from whole-exome or whole-genome sequencing data. Since the HPO was first introduced in 2008, its users have become both more numerous and more diverse. To meet these emerging needs, the project has added new content, language translations, mappings and computational tooling, as well as integrations with external community data. The HPO continues to collaborate with clinical adopters to improve specific areas of the ontology and extend standardized disease descriptions. The newly redesigned HPO website (www.human-phenotype-ontology.org) simplifies browsing terms and exploring clinical features, diseases, and human genes.
We conduct a genome-wide association study (GWAS) of educational attainment (EA) in a sample of ~3 million individuals and identify 3,952 approximately uncorrelated genome-wide-significant single-nucleotide polymorphisms (SNPs). A genome-wide polygenic predictor, or polygenic index (PGI), explains 12-16% of EA variance and contributes to risk prediction for ten diseases. Direct effects (i.e., controlling for parental PGIs) explain roughly half the PGI's magnitude of association with EA and other phenotypes. The correlation between mate-pair PGIs is far too large to be consistent with phenotypic assortment alone, implying additional assortment on PGI-associated factors. In an additional GWAS of dominance deviations from the additive model, we identify no genome-wide-significant SNPs, and a separate X-chromosome additive GWAS identifies 57.
BACKGROUND: A new variant of SARS-CoV-2, B.1.1.7, emerged as the dominant cause of COVID-19 disease in the UK from November, 2020. We report a post-hoc analysis of the efficacy of the adenoviral vector vaccine, ChAdOx1 nCoV-19 (AZD1222), against this variant. METHODS: Volunteers (aged ≥18 years) who were enrolled in phase 2/3 vaccine efficacy studies in the UK, and who were randomly assigned (1:1) to receive ChAdOx1 nCoV-19 or a meningococcal conjugate control (MenACWY) vaccine, provided upper airway swabs on a weekly basis and also if they developed symptoms of COVID-19 disease (a cough, a fever of 37·8°C or higher, shortness of breath, anosmia, or ageusia). Swabs were tested by nucleic acid amplification test (NAAT) for SARS-CoV-2 and positive samples were sequenced through the COVID-19 Genomics UK consortium. Neutralising antibody responses were measured using a live-virus microneutralisation assay against the B.1.1.7 lineage and a canonical non-B.1.1.7 lineage (Victoria). The efficacy analysis included symptomatic COVID-19 in seronegative participants with a NAAT positive swab more than 14 days after a second dose of vaccine. Participants were analysed according to vaccine received. Vaccine efficacy was calculated as 1 - relative risk (ChAdOx1 nCoV-19 vs MenACWY groups) derived from a robust Poisson regression model. This study is continuing and is registered with ClinicalTrials.gov, NCT04400838, and ISRCTN, 15281137. FINDINGS: Participants in efficacy cohorts were recruited between May 31 and Nov 13, 2020, and received booster doses between Aug 3 and Dec 30, 2020. Of 8534 participants in the primary efficacy cohort, 6636 (78%) were aged 18-55 years and 5065 (59%) were female. Between Oct 1, 2020, and Jan 14, 2021, 520 participants developed SARS-CoV-2 infection. 1466 NAAT positive nose and throat swabs were collected from these participants during the trial. Of these, 401 swabs from 311 participants were successfully sequenced. Laboratory virus neutralisation activity by vaccine-induced antibodies was lower against the B.1.1.7 variant than against the Victoria lineage (geometric mean ratio 8·9, 95% CI 7·2-11·0). Clinical vaccine efficacy against symptomatic NAAT positive infection was 70·4% (95% CI 43·6-84·5) for B.1.1.7 and 81·5% (67·9-89·4) for non-B.1.1.7 lineages. INTERPRETATION: ChAdOx1 nCoV-19 showed reduced neutralisation activity against the B.1.1.7 variant compared with a non-B.1.1.7 variant in vitro, but the vaccine showed efficacy against the B.1.1.7 variant of SARS-CoV-2. FUNDING: UK Research and Innovation, National Institute for Health Research (NIHR), Coalition for Epidemic Preparedness Innovations, NIHR Oxford Biomedical Research Centre, Thames Valley and South Midlands NIHR Clinical Research Network, and AstraZeneca.
Significance: Metabolic syndrome is a frequent precursor of type 2 diabetes mellitus (T2D), a disease that currently affects ∼8% of the adult population worldwide. Pancreatic beta-cell dysfunction and loss are central to the disease process, although understanding of the underlying molecular mechanisms is still fragmentary. Recent Advances: Oversupply of nutrients, including glucose and fatty acids, and the subsequent overstimulation of beta cells, are believed to be an important contributor to insulin secretory failure in T2D. Hypoxia has also recently been implicated in beta-cell damage. Accumulating evidence points to a role for oxidative stress in both processes. Although the production of reactive oxygen species (ROS) results from enhanced mitochondrial respiration during stimulation with glucose and other fuels, the expression of antioxidant defense genes is unusually low (or disallowed) in beta cells. Critical Issues: Not all subjects with metabolic syndrome and hyperglycemia go on to develop full-blown diabetes, implying an important role in disease risk for gene–environment interactions. Possession of common risk alleles at the SLC30A8 locus, encoding the beta-cell granule zinc transporter ZnT8, may affect cytosolic Zn 2+ concentrations and thus susceptibility to hypoxia and oxidative stress. Future Directions: Loss of normal beta-cell function, rather than total mass, is increasingly considered to be the major driver for impaired insulin secretion in diabetes. Better understanding of the role of oxidative changes, its modulation by genes involved in disease risk, and effects on beta-cell identity may facilitate the development of new therapeutic strategies to this disease. Antioxid. Redox Signal . 26, 501–518.
OBJECTIVE: To evaluate the extent to which circulating biomarker and supplements of vitamin D are associated with mortality from cardiovascular, cancer, or other conditions, under various circumstances. DESIGN: Systematic review and meta-analysis of observational studies and randomised controlled trials. DATA SOURCES: Medline, Embase, Cochrane Library, and reference lists of relevant studies to August 2013; correspondance with investigators. STUDY SELECTION: Observational cohort studies and randomised controlled trials in adults, which reported associations between vitamin D (measured as circulating 25-hydroxyvitamin D concentration or vitamin D supplement given singly) and cause specific mortality outcomes. DATA EXTRACTION: Data were extracted by two independent investigators, and a consensus was reached with involvement of a third. Study specific relative risks from 73 cohort studies (849,412 participants) and 22 randomised controlled trials (vitamin D given alone versus placebo or no treatment; 30,716 participants) were meta-analysed using random effects models and were grouped by study and population characteristics. RESULTS: In the primary prevention observational studies, comparing bottom versus top thirds of baseline circulating 25-hydroxyvitamin D distribution, pooled relative risks were 1.35 (95% confidence interval 1.13 to 1.61) for death from cardiovascular disease, 1.14 (1.01 to 1.29) for death from cancer, 1.30 (1.07 to 1.59) for non-vascular, non-cancer death, and 1.35 (1.22 to 1.49) for all cause mortality. Subgroup analyses in the observational studies indicated that risk of mortality was significantly higher in studies with lower baseline use of vitamin D supplements. In randomised controlled trials, relative risks for all cause mortality were 0.89 (0.80 to 0.99) for vitamin D3 supplementation and 1.04 (0.97 to 1.11) for vitamin D2 supplementation. The effects observed for vitamin D3 supplementation remained unchanged when grouped by various characteristics. However, for vitamin D2 supplementation, increased risks of mortality were observed in studies with lower intervention doses and shorter average intervention periods. CONCLUSIONS: Evidence from observational studies indicates inverse associations of circulating 25-hydroxyvitamin D with risks of death due to cardiovascular disease, cancer, and other causes. Supplementation with vitamin D3 significantly reduces overall mortality among older adults; however, before any widespread supplementation, further investigations will be required to establish the optimal dose and duration and whether vitamin D3 and D2 have different effects on mortality risk.
COVID-19, caused by SARS-CoV-2, lacks effective therapeutics. Additionally, no antiviral drugs or vaccines were developed against the closely related coronavirus, SARS-CoV-1 or MERS-CoV, despite previous zoonotic outbreaks. To identify starting points for such therapeutics, we performed a large-scale screen of electrophile and non-covalent fragments through a combined mass spectrometry and X-ray approach against the SARS-CoV-2 main protease, one of two cysteine viral proteases essential for viral replication. Our crystallographic screen identified 71 hits that span the entire active site, as well as 3 hits at the dimer interface. These structures reveal routes to rapidly develop more potent inhibitors through merging of covalent and non-covalent fragment hits; one series of low-reactivity, tractable covalent fragments were progressed to discover improved binders. These combined hits offer unprecedented structural and reactivity information for on-going structure-based drug design against SARS-CoV-2 main protease.
The NLRP3 inflammasome controls interleukin-1β maturation in antigen-presenting cells, but a direct role for NLRP3 in human adaptive immune cells has not been described. We found that the NLRP3 inflammasome assembles in human CD4(+) T cells and initiates caspase-1-dependent interleukin-1β secretion, thereby promoting interferon-γ production and T helper 1 (T(H)1) differentiation in an autocrine fashion. NLRP3 assembly requires intracellular C5 activation and stimulation of C5a receptor 1 (C5aR1), which is negatively regulated by surface-expressed C5aR2. Aberrant NLRP3 activity in T cells affects inflammatory responses in human autoinflammatory disease and in mouse models of inflammation and infection. Our results demonstrate that NLRP3 inflammasome activity is not confined to "innate immune cells" but is an integral component of normal adaptive T(H)1 responses.
Protein tyrosine phosphatases (PTPs) play a critical role in regulating cellular functions by selectively dephosphorylating their substrates. Here we present 22 human PTP crystal structures that, together with prior structural knowledge, enable a comprehensive analysis of the classical PTP family. Despite their largely conserved fold, surface properties of PTPs are strikingly diverse. A potential secondary substrate-binding pocket is frequently found in phosphatases, and this has implications for both substrate recognition and development of selective inhibitors. Structural comparison identified four diverse catalytic loop (WPD) conformations and suggested a mechanism for loop closure. Enzymatic assays revealed vast differences in PTP catalytic activity and identified PTPD1, PTPD2, and HDPTP as catalytically inert protein phosphatases. We propose a "head-to-toe" dimerization model for RPTPgamma/zeta that is distinct from the "inhibitory wedge" model and that provides a molecular basis for inhibitory regulation. This phosphatome resource gives an expanded insight into intrafamily PTP diversity, catalytic activity, substrate recognition, and autoregulatory self-association.
Heterogeneity within the self-renewal durability of adult hematopoietic stem cells (HSCs) challenges our understanding of the molecular framework underlying HSC function. Gene expression studies have been hampered by the presence of multiple HSC subtypes and contaminating non-HSCs in bulk HSC populations. To gain deeper insight into the gene expression program of murine HSCs, we combined single-cell functional assays with flow cytometric index sorting and single-cell gene expression assays. Through bioinformatic integration of these datasets, we designed an unbiased sorting strategy that separates non-HSCs away from HSCs, and single-cell transplantation experiments using the enriched population were combined with RNA-seq data to identify key molecules that associate with long-term durable self-renewal, producing a single-cell molecular dataset that is linked to functional stem cell activity. Finally, we demonstrated the broader applicability of this approach for linking key molecules with defined cellular functions in another stem cell system.