Avinashilingam Institute for Home Science and Higher Education for Women
Citations
All
Search in:AllTitleAbstractAuthor name
Publications
(20K+)
Patents
Grants
Pathways
Clinical trials
Publication
Journal: Science
February/10/2015
Abstract
Resolving the molecular details of proteome variation in the different tissues and organs of the human body will greatly increase our knowledge of human biology and disease. Here, we present a map of the human tissue proteome based on an integrated omics approach that involves quantitative transcriptomics at the tissue and organ level, combined with tissue microarray-based immunohistochemistry, to achieve spatial localization of proteins down to the single-cell level. Our tissue-based analysis detected more than 90% of the putative protein-coding genes. We used this approach to explore the human secretome, the membrane proteome, the druggable proteome, the cancer proteome, and the metabolic functions in 32 different tissues and organs. All the data are integrated in an interactive Web-based database that allows exploration of individual proteins, as well as navigation of global expression patterns, in all major tissues and organs in the human body.
Publication
Journal: Nucleic Acids Research
March/16/2014
Abstract
Pfam, available via servers in the UK (http://pfam.sanger.ac.uk/) and the USA (http://pfam.janelia.org/), is a widely used database of protein families, containing 14 831 manually curated entries in the current release, version 27.0. Since the last update article 2 years ago, we have generated 1182 new families and maintained sequence coverage of the UniProt Knowledgebase (UniProtKB) at nearly 80%, despite a 50% increase in the size of the underlying sequence database. Since our 2012 article describing Pfam, we have also undertaken a comprehensive review of the features that are provided by Pfam over and above the basic family data. For each feature, we determined the relevance, computational burden, usage statistics and the functionality of the feature in a website context. As a consequence of this review, we have removed some features, enhanced others and developed new ones to meet the changing demands of computational biology. Here, we describe the changes to Pfam content. Notably, we now provide family alignments based on four different representative proteome sequence data sets and a new interactive DNA search interface. We also discuss the mapping between Pfam and known 3D structures.
Publication
Journal: Nature Neuroscience
January/7/2010
Abstract
The Cre/lox system is widely used in mice to achieve cell-type-specific gene expression. However, a strong and universally responding system to express genes under Cre control is still lacking. We have generated a set of Cre reporter mice with strong, ubiquitous expression of fluorescent proteins of different spectra. The robust native fluorescence of these reporters enables direct visualization of fine dendritic structures and axonal projections of the labeled neurons, which is useful in mapping neuronal circuitry, imaging and tracking specific cell populations in vivo. Using these reporters and a high-throughput in situ hybridization platform, we are systematically profiling Cre-directed gene expression throughout the mouse brain in several Cre-driver lines, including new Cre lines targeting different cell types in the cortex. Our expression data are displayed in a public online database to help researchers assess the utility of various Cre-driver lines for cell-type-specific genetic manipulation.
Publication
Journal: Nature Protocols
February/23/2014
Abstract
De novo assembly of RNA-seq data enables researchers to study transcriptomes without the need for a genome sequence; this approach can be usefully applied, for instance, in research on 'non-model organisms' of ecological and evolutionary importance, cancer samples or the microbiome. In this protocol we describe the use of the Trinity platform for de novo transcriptome assembly from RNA-seq data in non-model organisms. We also present Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes. In the procedure, we provide a workflow for genome-independent transcriptome analysis leveraging the Trinity platform. The software, documentation and demonstrations are freely available from http://trinityrnaseq.sourceforge.net. The run time of this protocol is highly dependent on the size and complexity of data to be analyzed. The example data set analyzed in the procedure detailed herein can be processed in less than 5 h.
Publication
Journal: Autophagy
October/18/2016
Publication
Journal: Nature Reviews Molecular Cell Biology
September/23/2014
Abstract
MicroRNAs (miRNAs) are small non-coding RNAs that function as guide molecules in RNA silencing. Targeting most protein-coding transcripts, miRNAs are involved in nearly all developmental and pathological processes in animals. The biogenesis of miRNAs is under tight temporal and spatial control, and their dysregulation is associated with many human diseases, particularly cancer. In animals, miRNAs are ∼22 nucleotides in length, and they are produced by two RNase III proteins--Drosha and Dicer. miRNA biogenesis is regulated at multiple levels, including at the level of miRNA transcription; its processing by Drosha and Dicer in the nucleus and cytoplasm, respectively; its modification by RNA editing, RNA methylation, uridylation and adenylation; Argonaute loading; and RNA decay. Non-canonical pathways for miRNA biogenesis, including those that are independent of Drosha or Dicer, are also emerging.
Publication
Journal: Bioinformatics
September/19/2013
Abstract
BACKGROUND
Molecular simulation has historically been a low-throughput technique, but faster computers and increasing amounts of genomic and structural data are changing this by enabling large-scale automated simulation of, for instance, many conformers or mutants of biomolecules with or without a range of ligands. At the same time, advances in performance and scaling now make it possible to model complex biomolecular interaction and function in a manner directly testable by experiment. These applications share a need for fast and efficient software that can be deployed on massive scale in clusters, web servers, distributed computing or cloud resources.
RESULTS
Here, we present a range of new simulation algorithms and features developed during the past 4 years, leading up to the GROMACS 4.5 software package. The software now automatically handles wide classes of biomolecules, such as proteins, nucleic acids and lipids, and comes with all commonly used force fields for these molecules built-in. GROMACS supports several implicit solvent models, as well as new free-energy algorithms, and the software now uses multithreading for efficient parallelization even on low-end systems, including windows-based workstations. Together with hand-tuned assembly kernels and state-of-the-art parallelization, this provides extremely high performance and cost efficiency for high-throughput as well as massively parallel simulations.
BACKGROUND
GROMACS is an open source and free software available from http://www.gromacs.org.
BACKGROUND
Supplementary data are available at Bioinformatics online.
Publication
Journal: Nature
February/26/2015
Abstract
Obesity is heritable and predisposes to many diseases. To understand the genetic basis of obesity better, here we conduct a genome-wide association study and Metabochip meta-analysis of body mass index (BMI), a measure commonly used to define obesity and assess adiposity, in up to 339,224 individuals. This analysis identifies 97 BMI-associated loci (P < 5 × 10(-8)), 56 of which are novel. Five loci demonstrate clear evidence of several independent association signals, and many loci have significant effects on other metabolic phenotypes. The 97 loci account for ∼2.7% of BMI variation, and genome-wide estimates suggest that common variation accounts for >20% of BMI variation. Pathway analyses provide strong support for a role of the central nervous system in obesity susceptibility and implicate new genes and pathways, including those related to synaptic function, glutamate signalling, insulin secretion/action, energy metabolism, lipid biology and adipogenesis.
Publication
Journal: Nature
December/22/2009
Abstract
Genomes are organized into high-level three-dimensional structures, and DNA elements separated by long genomic distances can in principle interact functionally. Many transcription factors bind to regulatory DNA elements distant from gene promoters. Although distal binding sites have been shown to regulate transcription by long-range chromatin interactions at a few loci, chromatin interactions and their impact on transcription regulation have not been investigated in a genome-wide manner. Here we describe the development of a new strategy, chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) for the de novo detection of global chromatin interactions, with which we have comprehensively mapped the chromatin interaction network bound by oestrogen receptor alpha (ER-alpha) in the human genome. We found that most high-confidence remote ER-alpha-binding sites are anchored at gene promoters through long-range chromatin interactions, suggesting that ER-alpha functions by extensive chromatin looping to bring genes together for coordinated transcriptional regulation. We propose that chromatin interactions constitute a primary mechanism for regulating transcription in mammalian genomes.
Publication
Journal: Scientific data
September/5/2016
Abstract
There is an urgent need to improve the infrastructure supporting the reuse of scholarly data. A diverse set of stakeholders-representing academia, industry, funding agencies, and scholarly publishers-have come together to design and jointly endorse a concise and measureable set of principles that we refer to as the FAIR Data Principles. The intent is that these may act as a guideline for those wishing to enhance the reusability of their data holdings. Distinct from peer initiatives that focus on the human scholar, the FAIR Principles put specific emphasis on enhancing the ability of machines to automatically find and use the data, in addition to supporting its reuse by individuals. This Comment is the first formal publication of the FAIR Principles, and includes the rationale behind them, and some exemplar implementations in the community.
Publication
Journal: Nature
November/4/2012
Abstract
Neuroanatomically precise, genome-wide maps of transcript distributions are critical resources to complement genomic sequence data and to correlate functional and genetic brain architecture. Here we describe the generation and analysis of a transcriptional atlas of the adult human brain, comprising extensive histological analysis and comprehensive microarray profiling of ∼900 neuroanatomically precise subdivisions in two individuals. Transcriptional regulation varies enormously by anatomical location, with different regions and their constituent cell types displaying robust molecular signatures that are highly conserved between individuals. Analysis of differential gene expression and gene co-expression relationships demonstrates that brain-wide variation strongly reflects the distributions of major cell classes such as neurons, oligodendrocytes, astrocytes and microglia. Local neighbourhood relationships between fine anatomical subdivisions are associated with discrete neuronal subtypes and genes involved with synaptic transmission. The neocortex displays a relatively homogeneous transcriptional pattern, but with distinct features associated selectively with primary sensorimotor cortices and with enriched frontal lobe expression. Notably, the spatial topography of the neocortex is strongly reflected in its molecular topography-the closer two cortical regions, the more similar their transcriptomes. This freely accessible online data resource forms a high-resolution transcriptional baseline for neurogenetic studies of normal and abnormal human brain function.
Publication
Journal: Nature Genetics
January/19/2015
Abstract
Using genome-wide data from 253,288 individuals, we identified 697 variants at genome-wide significance that together explained one-fifth of the heritability for adult height. By testing different numbers of variants in independent studies, we show that the most strongly associated ∼2,000, ∼3,700 and ∼9,500 SNPs explained ∼21%, ∼24% and ∼29% of phenotypic variance. Furthermore, all common variants together captured 60% of heritability. The 697 variants clustered in 423 loci were enriched for genes, pathways and tissue types known to be involved in growth and together implicated genes and pathways not highlighted in earlier efforts, such as signaling by fibroblast growth factors, WNT/β-catenin and chondroitin sulfate-related genes. We identified several genes and pathways not previously connected with human skeletal growth, including mTOR, osteoglycin and binding of hyaluronic acid. Our results indicate a genetic architecture for human height that is characterized by a very large but finite number (thousands) of causal variants.
Publication
Journal: Nature Genetics
April/22/2007
Abstract
Embryonic stem cells rely on Polycomb group proteins to reversibly repress genes required for differentiation. We report that stem cell Polycomb group targets are up to 12-fold more likely to have cancer-specific promoter DNA hypermethylation than non-targets, supporting a stem cell origin of cancer in which reversible gene repression is replaced by permanent silencing, locking the cell into a perpetual state of self-renewal and thereby predisposing to subsequent malignant transformation.
Publication
Journal: Science
December/30/2015
Abstract
Antibodies targeting CTLA-4 have been successfully used as cancer immunotherapy. We find that the antitumor effects of CTLA-4 blockade depend on distinct Bacteroides species. In mice and patients, T cell responses specific for B. thetaiotaomicron or B. fragilis were associated with the efficacy of CTLA-4 blockade. Tumors in antibiotic-treated or germ-free mice did not respond to CTLA blockade. This defect was overcome by gavage with B. fragilis, by immunization with B. fragilis polysaccharides, or by adoptive transfer of B. fragilis-specific T cells. Fecal microbial transplantation from humans to mice confirmed that treatment of melanoma patients with antibodies against CTLA-4 favored the outgrowth of B. fragilis with anticancer properties. This study reveals a key role for Bacteroidales in the immunostimulatory effects of CTLA-4 blockade.
Publication
Journal: Nature
May/13/2014
Abstract
Comprehensive knowledge of the brain's wiring diagram is fundamental for understanding how the nervous system processes information at both local and global scales. However, with the singular exception of the C. elegans microscale connectome, there are no complete connectivity data sets in other species. Here we report a brain-wide, cellular-level, mesoscale connectome for the mouse. The Allen Mouse Brain Connectivity Atlas uses enhanced green fluorescent protein (EGFP)-expressing adeno-associated viral vectors to trace axonal projections from defined regions and cell types, and high-throughput serial two-photon tomography to image the EGFP-labelled axons throughout the brain. This systematic and standardized approach allows spatial registration of individual experiments into a common three dimensional (3D) reference space, resulting in a whole-brain connectivity matrix. A computational model yields insights into connectional strength distribution, symmetry and other network properties. Virtual tractography illustrates 3D topography among interconnected regions. Cortico-thalamic pathway analysis demonstrates segregation and integration of parallel pathways. The Allen Mouse Brain Connectivity Atlas is a freely available, foundational resource for structural and functional investigations into the neural circuits that support behavioural and cognitive processes in health and disease.
Publication
Journal: Science
October/22/2017
Abstract
Cancer is one of the leading causes of death, and there is great interest in understanding the underlying molecular mechanisms involved in the pathogenesis and progression of individual tumors. We used systems-level approaches to analyze the genome-wide transcriptome of the protein-coding genes of 17 major cancer types with respect to clinical outcome. A general pattern emerged: Shorter patient survival was associated with up-regulation of genes involved in cell growth and with down-regulation of genes involved in cellular differentiation. Using genome-scale metabolic models, we show that cancer patients have widespread metabolic heterogeneity, highlighting the need for precise and personalized medicine for cancer treatment. All data are presented in an interactive open-access database (www.proteinatlas.org/pathology) to allow genome-wide exploration of the impact of individual proteins on clinical outcomes.
Publication
Journal: Molecular and Cellular Proteomics
October/8/2014
Abstract
Global classification of the human proteins with regards to spatial expression patterns across organs and tissues is important for studies of human biology and disease. Here, we used a quantitative transcriptomics analysis (RNA-Seq) to classify the tissue-specific expression of genes across a representative set of all major human organs and tissues and combined this analysis with antibody-based profiling of the same tissues. To present the data, we launch a new version of the Human Protein Atlas that integrates RNA and protein expression data corresponding to ∼80% of the human protein-coding genes with access to the primary data for both the RNA and the protein analysis on an individual gene level. We present a classification of all human protein-coding genes with regards to tissue-specificity and spatial expression pattern. The integrative human expression map can be used as a starting point to explore the molecular constituents of the human body.
Publication
Journal: Bioinformatics
August/14/2017
Abstract
Fast and accurate quality control is essential for studies involving next-generation sequencing data. Whilst numerous tools exist to quantify QC metrics, there is no common approach to flexibly integrate these across tools and large sample sets. Assessing analysis results across an entire project can be time consuming and error prone; batch effects and outlier samples can easily be missed in the early stages of analysis.
We present MultiQC, a tool to create a single report visualising output from multiple tools across many samples, enabling global trends and biases to be quickly identified. MultiQC can plot data from many common bioinformatics tools and is built to allow easy extension and customization.
MultiQC is available with an GNU GPLv3 license on GitHub, the Python Package Index and Bioconda. Documentation and example reports are available at http://multiqc.info
phil.ewels@scilifelab.se.
Publication
Journal: Nature
January/27/2014
Abstract
We present a high-quality genome sequence of a Neanderthal woman from Siberia. We show that her parents were related at the level of half-siblings and that mating among close relatives was common among her recent ancestors. We also sequenced the genome of a Neanderthal from the Caucasus to low coverage. An analysis of the relationships and population history of available archaic genomes and 25 present-day human genomes shows that several gene flow events occurred among Neanderthals, Denisovans and early modern humans, possibly including gene flow into Denisovans from an unknown archaic group. Thus, interbreeding, albeit of low magnitude, occurred among many hominin groups in the Late Pleistocene. In addition, the high-quality Neanderthal genome allows us to establish a definitive list of substitutions that became fixed in modern humans after their separation from the ancestors of Neanderthals and Denisovans.
Publication
Journal: Nature Reviews Immunology
July/27/2014
Abstract
Monocytes and macrophages have crucial and distinct roles in tissue homeostasis and immunity, but they also contribute to a broad spectrum of pathologies and are thus attractive therapeutic targets. Potential intervention strategies that aim to manipulate these cells will require an in-depth understanding of their origins and the mechanisms that ensure their homeostasis. Recent evidence shows that monocytes do not substantially contribute to most tissue macrophage populations in the steady state or during certain types of inflammation. Rather, most tissue macrophage populations in mice are derived from embryonic precursors, are seeded before birth and can maintain themselves in adults by self-renewal. In this Review, we discuss the evidence that has dramatically changed our understanding of monocyte and macrophage development, and the maintenance of these cells in the steady state.
Publication
Journal: Journal of the American College of Cardiology
August/15/2017
Abstract
BACKGROUND
The burden of cardiovascular diseases (CVDs) remains unclear in many regions of the world.
OBJECTIVE
The GBD (Global Burden of Disease) 2015 study integrated data on disease incidence, prevalence, and mortality to produce consistent, up-to-date estimates for cardiovascular burden.
METHODS
CVD mortality was estimated from vital registration and verbal autopsy data. CVD prevalence was estimated using modeling software and data from health surveys, prospective cohorts, health system administrative data, and registries. Years lived with disability (YLD) were estimated by multiplying prevalence by disability weights. Years of life lost (YLL) were estimated by multiplying age-specific CVD deaths by a reference life expectancy. A sociodemographic index (SDI) was created for each location based on income per capita, educational attainment, and fertility.
RESULTS
In 2015, there were an estimated 422.7 million cases of CVD (95% uncertainty interval: 415.53 to 427.87 million cases) and 17.92 million CVD deaths (95% uncertainty interval: 17.59 to 18.28 million CVD deaths). Declines in the age-standardized CVD death rate occurred between 1990 and 2015 in all high-income and some middle-income countries. Ischemic heart disease was the leading cause of CVD health lost globally, as well as in each world region, followed by stroke. As SDI increased beyond 0.25, the highest CVD mortality shifted from women to men. CVD mortality decreased sharply for both sexes in countries with an SDI >0.75.
CONCLUSIONS
CVDs remain a major cause of health loss for all regions of the world. Sociodemographic change over the past 25 years has been associated with dramatic declines in CVD in regions with very high SDI, but only a gradual decrease or no change in most regions. Future updates of the GBD study can be used to guide policymakers who are focused on reducing the overall burden of noncommunicable disease and achieving specific global health targets for CVD.
Publication
Journal: Nature
November/17/1993
Abstract
Do biological motors move with regular steps? To address this question, we constructed instrumentation with the spatial and temporal sensitivity to resolve movement on a molecular scale. We deposited silica beads carrying single molecules of the motor protein kinesin on microtubules using optical tweezers and analysed their motion under controlled loads by interferometry. We find that kinesin moves with 8-nm steps.
Publication
Journal: Genome Biology
September/24/2009
Abstract
BACKGROUND
The genome of the domestic cow, Bos taurus, was sequenced using a mixture of hierarchical and whole-genome shotgun sequencing methods.
RESULTS
We have assembled the 35 million sequence reads and applied a variety of assembly improvement techniques, creating an assembly of 2.86 billion base pairs that has multiple improvements over previous assemblies: it is more complete, covering more of the genome; thousands of gaps have been closed; many erroneous inversions, deletions, and translocations have been corrected; and thousands of single-nucleotide errors have been corrected. Our evaluation using independent metrics demonstrates that the resulting assembly is substantially more accurate and complete than alternative versions.
CONCLUSIONS
By using independent mapping data and conserved synteny between the cow and human genomes, we were able to construct an assembly with excellent large-scale contiguity in which a large majority (approximately 91%) of the genome has been placed onto the 30 B. taurus chromosomes. We constructed a new cow-human synteny map that expands upon previous maps. We also identified for the first time a portion of the B. taurus Y chromosome.
Publication
Journal: Cell
December/22/1997
Abstract
Leukocyte trafficking at the endothelium requires both cellular adhesion molecules and chemotactic factors. Fractalkine, a novel transmembrane molecule with a CX3C-motif chemokine domain atop a mucin stalk, induces both adhesion and migration of leukocytes. Here we identify a seven-transmembrane high-affinity receptor for fractalkine and show that it mediates both the adhesive and migratory functions of fractalkine. The receptor, now termed CX3CR1, requires pertussis toxin-sensitive G protein signaling to induce migration but not to support adhesion, which also occurs without other adhesion molecules but requires the architecture of a chemokine domain atop the mucin stalk. Natural killer cells predominantly express CX3CR1 and respond to fractalkine in both migration and adhesion. Thus, fractalkine and CX3CR1 represent new types of leukocyte trafficking regulators, performing both adhesive and chemotactic functions.
load more...