KBase
governmentBerkeley, United States
Research output, citation impact, and the most-cited recent papers from KBase. Aggregated across the NobleBlocks index of 300M+ scholarly works.
Top-cited papers from KBase
Sorghum, an African grass related to sugar cane and maize, is grown for food, feed, fibre and fuel. We present an initial analysis of the ∼730-megabase Sorghum bicolor (L.) Moench genome, placing ∼98% of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information. Genetic recombination is largely confined to about one-third of the sorghum genome with gene order and density similar to those of rice. Retrotransposon accumulation in recombinationally recalcitrant heterochromatin explains the ∼75% larger genome size of sorghum compared with rice. Although gene and repetitive DNA distributions have been preserved since palaeopolyploidization ∼70 million years ago, most duplicated gene sets lost one member before the sorghum–rice divergence. Concerted evolution makes one duplicated chromosomal segment appear to be only a few million years old. About 24% of genes are grass-specific and 7% are sorghum-specific. Recent gene and microRNA duplications may contribute to sorghum’s drought tolerance. The Sorghum bicolor genome sequence is published this week. Sorghum is a cereal grown widely as food, animal feed, fibre and fuel. Tolerant to hot, dry conditions, it is a staple for large populations in the West African Sahel region. Comparisons of the genome with those of maize and rice shed light on the evolution of grasses and of C4 photosynthesis, which is particularly efficient at assimilating carbon at high temperatures. In addition, protein coding genes and miRNAs that could contribute to sorghum's drought tolerance may also be found. Sorghum yield improvement has lagged behind that of other crops and the availability of the genome sequence could provide a vital boost to work on its improvement. Sorghum is an African grass that is grown for food, animal feed and fuel. The current paper presents an initial analysis of the ∼730 megabase genome of Sorghum bicolor. Genome analysis and its comparison with maize and rice shed light on grass genome evolution and also provide insights into the evolution of C4 photosynthesis, as well as protein coding genes and miRNAs that might contribute to sorghum's drought tolerance.
The relative activity of lipoprotein lipase (LPL) in different tissues controls the partitioning of lipoprotein-derived fatty acids between sites of fat storage (adipose tissue) and oxidation (heart and skeletal muscle). Here we used a reverse genetic strategy to test the hypothesis that 4 angiopoietin-like proteins (ANGPTL3, -4, -5, and -6) play key roles in triglyceride (TG) metabolism in humans. We re-sequenced the coding regions of the genes encoding these proteins and identified multiple rare nonsynonymous (NS) sequence variations that were associated with low plasma TG levels but not with other metabolic phenotypes. Functional studies revealed that all mutant alleles of ANGPTL3 and ANGPTL4 that were associated with low plasma TG levels interfered either with the synthesis or secretion of the protein or with the ability of the ANGPTL protein to inhibit LPL. A total of 1% of the Dallas Heart Study population and 4% of those participants with a plasma TG in the lowest quartile had a rare loss-of-function mutation in ANGPTL3, ANGPTL4, or ANGPTL5. Thus, ANGPTL3, ANGPTL4, and ANGPTL5, but not ANGPTL6, play nonredundant roles in TG metabolism, and multiple alleles at these loci cumulatively contribute to variability in plasma TG levels in humans.
To broaden our understanding of the evolution of gene regulation mechanisms, we generated occupancy profiles for 34 orthologous transcription factors (TFs) in human–mouse erythroid progenitor, lymphoblast and embryonic stem-cell lines. By combining the genome-wide transcription factor occupancy repertoires, associated epigenetic signals, and co-association patterns, here we deduce several evolutionary principles of gene regulatory features operating since the mouse and human lineages diverged. The genomic distribution profiles, primary binding motifs, chromatin states, and DNA methylation preferences are well conserved for TF-occupied sequences. However, the extent to which orthologous DNA segments are bound by orthologous TFs varies both among TFs and with genomic location: binding at promoters is more highly conserved than binding at distal elements. Notably, occupancy-conserved TF-occupied sequences tend to be pleiotropic; they function in several tissues and also co-associate with many TFs. Single nucleotide variants at sites with potential regulatory functions are enriched in occupancy-conserved TF-occupied sequences. As part of the mouse ENCODE project, genome-wide transcription factor (TF) occupancy repertoires and co-association patterns in mice and humans are studied; many aspects are conserved but the extent to which orthologous DNA segments are bound by TFs in mice and humans varies both among TFs and genomic location, and TF-occupied sequences whose occupancy is conserved tend to be pleiotropic and enriched for single nucleotide variants with known regulatory potential. As part of the mouse ENCODE project Mike Snyder and colleagues studied the genome-wide transcription factor (TF) occupancy repertoires, associated epigenetic signals, and TF co-association patterns in mice and humans to broaden our understanding of the evolution of gene regulation mechanisms in mammals. The results indicate that although many aspects of TF occupied sequences are conserved in both species, the extent to which orthologous DNA segments are bound by orthologous TFs in human and mouse varies both among TFs and with genomic location. Importantly, TF occupied sequences with conserved occupancy tend to be pleiotropic; they are also enriched for single nucleotide variants (SNVs) that are known to have regulatory potential or are associated with known phenotypes.
The oomycete vegetable pathogen Phytophthora capsici has shown remarkable adaptation to fungicides and new hosts. Like other members of this destructive genus, P. capsici has an explosive epidemiology, rapidly producing massive numbers of asexual spores on infected hosts. In addition, P. capsici can remain dormant for years as sexually recombined oospores, making it difficult to produce crops at infested sites, and allowing outcrossing populations to maintain significant genetic variation. Genome sequencing, development of a high-density genetic map, and integrative genomic or genetic characterization of P. capsici field isolates and intercross progeny revealed significant mitotic loss of heterozygosity (LOH) in diverse isolates. LOH was detected in clonally propagated field isolates and sexual progeny, cumulatively affecting >30% of the genome. LOH altered genotypes for more than 11,000 single-nucleotide variant sites and showed a strong association with changes in mating type and pathogenicity. Overall, it appears that LOH may provide a rapid mechanism for fixing alleles and may be an important component of adaptability for P. capsici.
Forward genetic mutational studies, adaptive evolution, and phenotypic screening are powerful tools for creating new variant organisms with desirable traits. However, mutations generated in the process cannot be easily identified with traditional genetic tools. We show that new high-throughput, massively parallel sequencing technologies can completely and accurately characterize a mutant genome relative to a previously sequenced parental (reference) strain. We studied a mutant strain of Pichia stipitis, a yeast capable of converting xylose to ethanol. This unusually efficient mutant strain was developed through repeated rounds of chemical mutagenesis, strain selection, transformation, and genetic manipulation over a period of seven years. We resequenced this strain on three different sequencing platforms. Surprisingly, we found fewer than a dozen mutations in open reading frames. All three sequencing technologies were able to identify each single nucleotide mutation given at least 10-15-fold nominal sequence coverage. Our results show that detecting mutations in evolved and engineered organisms is rapid and cost-effective at the whole-genome level using new sequencing technologies. Identification of specific mutations in strains with altered phenotypes will add insight into specific gene functions and guide further metabolic engineering efforts.
Published
Evolution of lignocellulose decomposition was one of the most ecologically important innovations in fungi. White-rot fungi in the Agaricomycetes (mushrooms and relatives) are the most effective microorganisms in degrading both cellulose and lignin components of woody plant cell walls (PCW). However, the precise evolutionary origins of lignocellulose decomposition are poorly understood, largely because certain early-diverging clades of Agaricomycetes and its sister group, the Dacrymycetes, have yet to be sampled, or have been undersampled, in comparative genomic studies. Here, we present new genome sequences of ten saprotrophic fungi, including members of the Dacrymycetes and early-diverging clades of Agaricomycetes (Cantharellales, Sebacinales, Auriculariales, and Trechisporales), which we use to refine the origins and evolutionary history of the enzymatic toolkit of lignocellulose decomposition. We reconstructed the origin of ligninolytic enzymes, focusing on class II peroxidases (AA2), as well as enzymes that attack crystalline cellulose. Despite previous reports of white rot appearing as early as the Dacrymycetes, our results suggest that white-rot fungi evolved later in the Agaricomycetes, with the first class II peroxidases reconstructed in the ancestor of the Auriculariales and residual Agaricomycetes. The exemplars of the most ancient clades of Agaricomycetes that we sampled all lack class II peroxidases, and are thus concluded to use a combination of plesiomorphic and derived PCW degrading enzymes that predate the evolution of white rot.
The United States Environmental Protection Agency (U.S. EPA) must characterize potential risks to human health and the environment associated with manufacture and use of thousands of chemicals. High-throughput screening (HTS) for biological activity allows the ToxCast research program to prioritize chemical inventories for potential hazard. Similar capabilities for estimating exposure potential would support rapid risk-based prioritization for chemicals with limited information; here, we propose a framework for high-throughput exposure assessment. To demonstrate application, an analysis was conducted that predicts human exposure potential for chemicals and estimates uncertainty in these predictions by comparison to biomonitoring data. We evaluated 1936 chemicals using far-field mass balance human exposure models (USEtox and RAIDAR) and an indicator for indoor and/or consumer use. These predictions were compared to exposures inferred by Bayesian analysis from urine concentrations for 82 chemicals reported in the National Health and Nutrition Examination Survey (NHANES). Joint regression on all factors provided a calibrated consensus prediction, the variance of which serves as an empirical determination of uncertainty for prioritization on absolute exposure potential. Information on use was found to be most predictive; generally, chemicals above the limit of detection in NHANES had consumer/indoor use. Coupled with hazard HTS, exposure HTS can place risk earlier in decision processes. High-priority chemicals become targets for further data collection.
We have determined the complete sequence of the mitochondrial genome of the scaphopod mollusk Graptacme eborea (14,492 nts) and completed the sequence of the mitochondrial genome of the bivalve mollusk Mytilus edulis (16,740 nts). (The name Graptacme eborea is a revision of the species formerly known as Dentalium eboreum.) G. eborea mtDNA contains the 37 genes that are typically found and has the genes divided about evenly between the two strands, but M. edulis contains an extra trnM and is missing atp8, and it has all genes on the same strand. Each has a highly rearranged gene order relative to each other and to all other studied mtDNAs. G. eborea mtDNA has almost no strand skew, but the coding strand of M. edulis mtDNA is very rich in G and T. This is reflected in differential codon usage patterns and even in amino acid compositions. G. eborea mtDNA has fewer noncoding nucleotides than any other mtDNA studied to date, with the largest noncoding region only 24 nt long. Phylogenetic analysis using 2,420 aligned amino acid positions of concatenated proteins weakly supports an association of the scaphopod with gastropods to the exclusion of Bivalvia, Cephalopoda, and Polyplacophora, but it is generally unable to convincingly resolve the relationships among major groups of the Lophotrochozoa, in contrast to the good resolution seen for several other major metazoan groups.
Cavitation, known as the formation of vapor bubbles when liquids are under tension, is of great interest both in condensed matter science as well as in diverse applications such as botany, hydraulic engineering, and medicine. Although widely studied in bulk and microscale-confined liquids, cavitation in the nanoscale is generally believed to be energetically unfavorable and has never been experimentally demonstrated. Here we report evaporation-induced cavitation in water-filled hydrophilic nanochannels under enormous negative pressures up to -7 MPa. As opposed to receding menisci observed in microchannel evaporation, the menisci in nanochannels are pinned at the entrance while vapor bubbles form and expand inside. Evaporation in the channels is found to be aided by advective liquid transport, which leads to an evaporation rate that is an order of magnitude higher than that governed by Fickian vapor diffusion in macro- and microscale evaporation. The vapor bubbles also exhibit unusual motion as well as translational stability and symmetry, which occur because of a balance between two competing mass fluxes driven by thermocapillarity and evaporation. Our studies expand our understanding of cavitation and provide new insights for phase-change phenomena at the nanoscale.
Development of daptomycin (DAP) resistance in Enterococcus faecalis has recently been associated with mutations in genes encoding proteins with two main functions: (i) control of the cell envelope stress response to antibiotics and antimicrobial peptides (LiaFSR system) and (ii) cell membrane phospholipid metabolism (glycerophosphoryl diester phosphodiesterase and cardiolipin synthase [cls]). However, the genetic bases for DAP resistance in Enterococcus faecium are unclear. We performed whole-genome comparative analysis of a clinical strain pair, DAP-susceptible E. faecium S447 and its DAP-resistant derivative R446, which was recovered from a single patient during DAP therapy. By comparative whole-genome sequencing, DAP resistance in R446 was associated with changes in 8 genes. Two of these genes encoded proteins involved in phospholipid metabolism: (i) an R218Q substitution in Cls and (ii) an A292G reversion in a putative cyclopropane fatty acid synthase enzyme. The DAP-resistant derivative R446 also exhibited an S333L substitution in the putative histidine kinase YycG, a member of the YycFG system, which, similar to LiaFSR, has been involved in cell envelope homeostasis and DAP resistance in other Gram-positive cocci. Additional changes identified in E. faecium R446 (DAP resistant) included two putative proteins involved in transport (one for carbohydrate and one for sulfate) and three enzymes predicted to play a role in general metabolism. Exchange of the "susceptible" cls allele from S447 for the "resistant" one belonging to R446 did not affect DAP susceptibility. Our results suggest that, apart from the LiaFSR system, the essential YycFG system is likely to be an important mediator of DAP resistance in some E. faecium strains.
A robust, noncatenated, and permanently microporous metal-organic framework (MOF) material has been synthesized by combining a new nonplanar ligand, 4,4',4'',4'''-benzene-1,2,4,5-tetrayltetrabenzoic acid, with a zinc(II) source under solvothermal conditions. The new material features cavities that are readily modified via activation and functionalization of framework nodes (as opposed to struts). A preliminary investigation of the "empty cavity" version of the material and six cavity-modified versions reveals that modification can substantially modulate the MOF's internal surface area, pore volume, and ability to sorb molecular hydrogen.
Mobility engineering is one of the most important challenges that determine the optoelectronic performance of two-dimensional (2D) materials. So far, charged-impurity scattering and electrical-contact barriers have been suppressed through high-κ dielectrics and seamless contact engineering, giving rise to carrier-mobility improvement in exfoliated 2D semiconducting MoS2. Here we demonstrate a facile and scalable technique to effectively suppress both Coulomb scattering and electron-phonon scattering via the HfO2 overlayer, resulting in a large mobility improvement in CVD-grown monolayer MoS2, in excess of 60 cm2 V-1 s-1. Surface passivation and suppression of Coulomb scattering can partially contribute to the mobility increase. Interestingly, we correlate the mobility increase with phonon quenching through Raman and temperature-dependent mobility measurements. The experimental method is facile, industrially scalable, and renders phonon engineering an additional leverage towards further improvements in 2D semiconductor mobility and device performance.
Mammalian genomes carry hundreds of Krüppel -type zinc finger (ZNF) genes, most of which reside in familial clusters. ZNF genes encoding Krüppel -associated box (KRAB) motifs are especially prone to this type of tandem organization. Despite their prevalence, little is known about the functions or evolutionary histories of these clustered gene families. Here we describe a homologous pair of human and mouse KRAB-ZNF gene clusters containing 21 human and 10 mouse genes, respectively. Evolutionary analysis uncovered only three pairs of putative orthologs and two cases where a single gene in one species is related to multiple genes in the other; several human genes have no obvious homolog in mouse. We deduce that duplication and loss of ancestral cluster members occurred independently in the primate and rodent lineages after divergence, yielding substantially different ZNF gene repertoires in humans and mice. Differences in expression patterns and sequence divergence within the DNA binding regions of predicted proteins suggest that the duplicated genes have acquired novel functions over evolutionary time. Since KRAB-ZNF proteins are predicted to function as transcriptional regulators, the elaboration of new lineage-specific genes in this and other clustered ZNF families is likely to have had a significant impact on species-specific aspects of biology.
Mixed matrix membranes are being studied for their potential use in post-combustion carbon capture on the premise that they could dramatically lower costs relative to mature technologies available today.
The electronic, optical and thermodynamic properties of ABO<sub>3</sub> (A = La,Sr, B = Fe,Co) perovskites are investigated using first-principles calculations.
Ligand effects are of major interest in catalytic reactions owing to their potential critical role in determining the reaction activity and selectivity. Herein, we report ligand effects in the CO2 electrochemical reduction reaction at the atomic level with three unique Au25 nanoclusters comprising the same kernel but different protecting ligands (-XR, where X = S or Se, and R represents the carbon tail). It is observed that a change in the carbon tail shows no obvious impact on the catalytic selectivity and activity, but the anchoring atom (X = S or Se) strongly affects the electrocatalytic selectivity. Specifically, the S site acts as the active site and sustains CO selectivity, while the Se site shows a higher tendency of hydrogen evolution. Density functional theory (DFT) calculations reveal that the energy penalty associated with the *COOH formation is lower on the S site by 0.26 eV compared to that on the Se site. Additionally, the formation energy of the product (*CO) is lower on the sulfur-based Au nanocluster by 0.43 eV. We attribute these energetic differences to the higher electron density on the sulfur sites of the Au nanocluster, resulting in a modified bonding character of the reaction intermediates that reduce the energetic penalty for the *COOH and *CO formation. Overall, this work demonstrates that S/Se atoms at the metal-ligand interface can play an important role in determining the overall electrocatalytic performance of Au nanoclusters.
Near-field scanning optical microscopy images of solid wall, circular, and elliptical microscale corrals show standing wave patterns confined inside the structures with a wavelength close to that of the incident light. The patterns inside the corrals can be tuned by changing the size and material of the walls, the wavelength of incident light, and polarization direction for elliptical corrals. Finite-difference time-domain calculations of the corral structures agree with the experimental observations and reveal that the electric and magnetic field intensities are out of phase inside the corral. A theoretical modal analysis indicates that the fields inside the corrals can be attributed to p- and s-polarized waveguide modes, and that the superposition of the propagating and evanescent modes can explain the phase differences between the fields. These experimental and theoretical results demonstrate that electromagnetic fields on a dielectric surface can be controlled in a predictable manner.
ABO3-δ (A = La, Sr, B = Fe, Co) perovskites are useful in a wide range of applications, including their recent exploration for application in high-temperature optical oxygen sensing for energy conversion devices such as solid oxide fuel cells. To elucidate the dependence of functional properties and oxygen vacancy formation on defect chemistry and composition, first principles calculations are presented. The obtained results show that oxygen vacancy (VO) formation energies are in the order of LaFeO3 > LaCoO3 > SrFeO3 > SrCoO3. Furthermore, the influence of VO on the electronic and optical properties is investigated for the high temperature stable phases (T = 1100 K). For the LaFeO3 insulator, the VO donated electrons are all localized on the down-spin d3z2-r2 orbitals of the nearest Fe ions. These defect states located in the band gap induce a drop in the energy onset of absorption as pristine bulk → V2+O → V1+O → V0O, and especially, an extra absorption peak appears between 0.5 and 1.5 eV due to V0O and V1+O formation. In the rest of the crystals that expressed a metallic feature, the VO donated electrons partially localize on the down-spin d3z2-r2 orbital and partially delocalize through the lattice, by which the absorption peaks (0.5-2.0 eV for LaCoO3, 0.0-0.5 eV for SrFeO3 and SrCoO3) from the electronic excitation near the Fermi level are enhanced. A high VO concentration of oxygen divacancy in SrFeO3 and SrCoO3 could enhance charge localization on down-spin d3z2-r2 orbitals, resulting in a remarkable increase of optical absorption at 1.5-3.0 eV.
This report examines how wind and solar integration studies have evolved, what analysis techniques work, what common mistakes are still made, what improvements are likely to be made in the near future, and why calculating integration costs is such a difficult problem and should be undertaken carefully, if at all.