Genes
Citations
All
Search in:AllTitleAbstractAuthor name
Publications
(57K+)
Patents
Grants
Pathways
Clinical trials
Publication
Journal: Nature
October/16/2012
Abstract
Eukaryotic cells make many types of primary and processed RNAs that are found either in specific subcellular compartments or throughout the cells. A complete catalogue of these RNAs is not yet available and their characteristic subcellular localizations are also poorly understood. Because RNA represents the direct output of the genetic information encoded by genomes and a significant proportion of a cell's regulatory capabilities are focused on its synthesis, processing, transport, modification and translation, the generation of such a catalogue is crucial for understanding genome function. Here we report evidence that three-quarters of the human genome is capable of being transcribed, as well as observations about the range and levels of expression, localization, processing fates, regulatory regions and modifications of almost all currently annotated and thousands of previously unannotated RNAs. These observations, taken together, prompt a redefinition of the concept of a gene.
Publication
Journal: Nucleic Acids Research
January/19/2004
Abstract
A grand challenge in the post-genomic era is a complete computer representation of the cell and the organism, which will enable computational prediction of higher-level complexity of cellular processes and organism behavior from genomic information. Toward this end we have been developing a knowledge-based approach for network prediction, which is to predict, given a complete set of genes in the genome, the protein interaction networks that are responsible for various cellular processes. KEGG at http://www.genome.ad.jp/kegg/ is the reference knowledge base that integrates current knowledge on molecular interaction networks such as pathways and complexes (PATHWAY database), information about genes and proteins generated by genome projects (GENES/SSDB/KO databases) and information about biochemical compounds and reactions (COMPOUND/GLYCAN/REACTION databases). These three types of database actually represent three graph objects, called the protein network, the gene universe and the chemical universe. New efforts are being made to abstract knowledge, both computationally and manually, about ortholog clusters in the KO (KEGG Orthology) database, and to collect and analyze carbohydrate structures in the GLYCAN database.
Publication
Journal: Journal of Molecular Evolution
January/22/1986
Abstract
A new statistical method for estimating divergence dates of species from DNA sequence data by a molecular clock approach is developed. This method takes into account effectively the information contained in a set of DNA sequence data. The molecular clock of mitochondrial DNA (mtDNA) was calibrated by setting the date of divergence between primates and ungulates at the Cretaceous-Tertiary boundary (65 million years ago), when the extinction of dinosaurs occurred. A generalized least-squares method was applied in fitting a model to mtDNA sequence data, and the clock gave dates of 92.3 +/- 11.7, 13.3 +/- 1.5, 10.9 +/- 1.2, 3.7 +/- 0.6, and 2.7 +/- 0.6 million years ago (where the second of each pair of numbers is the standard deviation) for the separation of mouse, gibbon, orangutan, gorilla, and chimpanzee, respectively, from the line leading to humans. Although there is some uncertainty in the clock, this dating may pose a problem for the widely believed hypothesis that the pipedal creature Australopithecus afarensis, which lived some 3.7 million years ago at Laetoli in Tanzania and at Hadar in Ethiopia, was ancestral to man and evolved after the human-ape splitting. Another likelier possibility is that mtDNA was transferred through hybridization between a proto-human and a proto-chimpanzee after the former had developed bipedalism.
Publication
Journal: Nature
June/20/2005
Abstract
To date, more than 200 microRNAs have been described in humans; however, the precise functions of these regulatory, non-coding RNAs remains largely obscure. One cluster of microRNAs, the mir-17-92 polycistron, is located in a region of DNA that is amplified in human B-cell lymphomas. Here we compared B-cell lymphoma samples and cell lines to normal tissues, and found that the levels of the primary or mature microRNAs derived from the mir-17-92 locus are often substantially increased in these cancers. Enforced expression of the mir-17-92 cluster acted with c-myc expression to accelerate tumour development in a mouse B-cell lymphoma model. Tumours derived from haematopoietic stem cells expressing a subset of the mir-17-92 cluster and c-myc could be distinguished by an absence of apoptosis that was otherwise prevalent in c-myc-induced lymphomas. Together, these studies indicate that non-coding RNAs, specifically microRNAs, can modulate tumour formation, and implicate the mir-17-92 cluster as a potential human oncogene.
Publication
Journal: Nature Reviews Genetics
August/21/2012
Abstract
DNA methylation is frequently described as a 'silencing' epigenetic mark, and indeed this function of 5-methylcytosine was originally proposed in the 1970s. Now, thanks to improved genome-scale mapping of methylation, we can evaluate DNA methylation in different genomic contexts: transcriptional start sites with or without CpG islands, in gene bodies, at regulatory elements and at repeat sequences. The emerging picture is that the function of DNA methylation seems to vary with context, and the relationship between DNA methylation and transcription is more nuanced than we realized at first. Improving our understanding of the functions of DNA methylation is necessary for interpreting changes in this mark that are observed in diseases such as cancer.
Pulse
Views:
1
Posts:
No posts
Rating:
Not rated
Publication
Journal: Proceedings of the National Academy of Sciences of the United States of America
February/29/1976
Abstract
A method has been developed whereby a very large number of colonies of Escherichia coli carrying different hybrid plasmids can be rapidly screened to determine which hybrid plasmids contain a specified DNA sequence or genes. The colonies to be screened are formed on nitrocellulose filters, and, after a reference set of these colonies has been prepared by replica plating, are lysed and their DNA is denatured and fixed to the filter in situ. The resulting DNA-prints of the colonies are then hybridized to a radioactive RNA that defines the sequence or gene of interest, and the result of this hybridization is assayed by autoradiography. Colonies whose DNA-prints exhibit hybridization can then be picked from the reference plate. We have used this method to isolate clones of ColE1 hybrid plasmids that contain Drosophila melanogaster genes for 18 and 28S rRNAs. In principle, the method can be used to isolate any gene whose base sequence is represented in an available RNA.
Publication
Journal: Science
June/28/1994
Abstract
Through the study of transcriptional activation in response to interferon alpha (IFN-alpha) and interferon gamma (IFN-gamma), a previously unrecognized direct signal transduction pathway to the nucleus has been uncovered: IFN-receptor interaction at the cell surface leads to the activation of kinases of the Jak family that then phosphorylate substrate proteins called STATs (signal transducers and activators of transcription). The phosphorylated STAT proteins move to the nucleus, bind specific DNA elements, and direct transcription. Recognition of the molecules involved in the IFN-alpha and IFN-gamma pathway has led to discoveries that a number of STAT family members exist and that other polypeptide ligands also use the Jak-STAT molecules in signal transduction.
Pulse
Views:
2
Posts:
No posts
Rating:
Not rated
Publication
Journal: Nucleic Acids Research
March/15/1999
Abstract
Kyoto Encyclopedia of Genes and Genomes (KEGG) is a knowledge base for systematic analysis of gene functions in terms of the networks of genes and molecules. The major component of KEGG is the PATHWAY database that consists of graphical diagrams of biochemical pathways including most of the known metabolic pathways and some of the known regulatory pathways. The pathway information is also represented by the ortholog group tables summarizing orthologous and paralogous gene groups among different organisms. KEGG maintains the GENES database for the gene catalogs of all organisms with complete genomes and selected organisms with partial genomes, which are continuously re-annotated, as well as the LIGAND database for chemical compounds and enzymes. Each gene catalog is associated with the graphical genome map for chromosomal locations that is represented by Java applet. In addition to the data collection efforts, KEGG develops and provides various computational tools, such as for reconstructing biochemical pathways from the complete genome sequence and for predicting gene regulatory networks from the gene expression profiles. The KEGG databases are daily updated and made freely available (http://www.genome.ad.jp/kegg/).
Publication
Journal: Nucleic Acids Research
January/19/2004
Abstract
The Gene Ontology (GO) project (http://www. geneontology.org/) provides structured, controlled vocabularies and classifications that cover several domains of molecular and cellular biology and are freely available for community use in the annotation of genes, gene products and sequences. Many model organism databases and genome annotation groups use the GO and contribute their annotation sets to the GO resource. The GO database integrates the vocabularies and contributed annotations and provides full access to this information in several formats. Members of the GO Consortium continually work collectively, involving outside experts as needed, to expand and update the GO vocabularies. The GO Web resource also provides access to extensive documentation about the GO project and links to applications that use GO data for functional analyses.
Publication
Journal: Biotechnology Letters
November/29/2004
Abstract
The stability of standard gene expression is an elementary prerequisite for internal standardisation of target gene expression data and many so called housekeeping genes with assumed stable expression can exhibit either up- or down-regulation under some experimental conditions. The developed, and herein presented, software called BestKeeper determines the best suited standards, out of ten candidates, and combines them into an index. The index can be compared with further ten target genes to decide, whether they are differentially expressed under an applied treatment. All data processing is based on crossing points. The BestKeeper software tool was validated on four housekeeping genes and 10 members of the somatotropic axis differentially expressed in bovine corpora lutea total RNA. The BestKeeper application and necessary information about data processing and handling can be downloaded on http://www.wzw.tum.de/gene-quantification/bestkeeper.html.
Publication
Journal: Annual Review of Genetics
May/9/1989
Publication
Journal: Journal of Molecular Biology
June/18/1997
Abstract
We introduce a general probabilistic model of the gene structure of human genomic sequences which incorporates descriptions of the basic transcriptional, translational and splicing signals, as well as length distributions and compositional features of exons, introns and intergenic regions. Distinct sets of model parameters are derived to account for the many substantial differences in gene density and structure observed in distinct C + G compositional regions of the human genome. In addition, new models of the donor and acceptor splice signals are described which capture potentially important dependencies between signal positions. The model is applied to the problem of gene identification in a computer program, GENSCAN, which identifies complete exon/intron structures of genes in genomic DNA. Novel features of the program include the capacity to predict multiple genes in a sequence, to deal with partial as well as complete genes, and to predict consistent sets of genes occurring on either or both DNA strands. GENSCAN is shown to have substantially higher accuracy than existing methods when tested on standardized sets of human and vertebrate genes, with 75 to 80% of exons identified exactly. The program is also capable of indicating fairly accurately the reliability of each predicted exon. Consistently high levels of accuracy are observed for sequences of differing C + G content and for distinct groups of vertebrates.
Publication
Journal: Nucleic Acids Research
May/26/1982
Abstract
Splice junction sequences from a large number of nuclear and viral genes encoding protein have been collected. The sequence CAAG/GTAGAGT was found to be a consensus of 139 exon-intron boundaries (or donor sequences) and (TC)nNCTAG/G was found to be a consensus of 130 intron-exon boundaries (or acceptor sequences). The possible role of splice junction sequences as signals for processing is discussed.
Authors
Publication
Journal: Nucleic Acids Research
February/27/2006
Abstract
Access to unified datasets of protein and genetic interactions is critical for interrogation of gene/protein function and analysis of global network properties. BioGRID is a freely accessible database of physical and genetic interactions available at http://www.thebiogrid.org. BioGRID release version 2.0 includes >116 000 interactions from Saccharomyces cerevisiae, Caenorhabditis elegans, Drosophila melanogaster and Homo sapiens. Over 30 000 interactions have recently been added from 5778 sources through exhaustive curation of the Saccharomyces cerevisiae primary literature. An internally hyper-linked web interface allows for rapid search and retrieval of interaction data. Full or user-defined datasets are freely downloadable as tab-delimited text files and PSI-MI XML. Pre-computed graphical layouts of interactions are available in a variety of file formats. User-customized graphs with embedded protein, gene and interaction attributes can be constructed with a visualization system called Osprey that is dynamically linked to the BioGRID.
Publication
Journal: Nature Reviews Cancer
April/21/2004
Publication
Journal: Nature
November/25/2004
Abstract
The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers approximately 99% of the euchromatic genome and is accurate to an error rate of approximately 1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human genome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead.
Publication
Journal: Cell
January/28/1988
Abstract
The protein product of the human Duchenne muscular dystrophy locus (DMD) and its mouse homolog (mDMD) have been identified by using polyclonal antibodies directed against fusion proteins containing two distinct regions of the mDMD cDNA. The DMD protein is shown to be approximately 400 kd and to represent approximately 0.002% of total striated muscle protein. This protein is also detected in smooth muscle (stomach). Muscle tissue isolated from both DMD-affected boys and mdx mice contained no detectable DMD protein, suggesting that these genetic disorders are homologous. Since mdx mice present no obvious clinical abnormalities, the identification of the mdx mouse as an animal model for DMD has important implications with regard to the etiology of the lethal DMD phenotype. We have named the protein dystrophin because of its identification via the isolation of the Duchenne muscular dystrophy locus.
Publication
Journal: Nucleic Acids Research
June/23/2008
Abstract
Functional genomics technologies have been widely adopted in the biological research of both model and non-model species. An efficient functional annotation of DNA or protein sequences is a major requirement for the successful application of these approaches as functional information on gene products is often the key to the interpretation of experimental results. Therefore, there is an increasing need for bioinformatics resources which are able to cope with large amount of sequence data, produce valuable annotation results and are easily accessible to laboratories where functional genomics projects are being undertaken. We present the Blast2GO suite as an integrated and biologist-oriented solution for the high-throughput and automatic functional annotation of DNA or protein sequences based on the Gene Ontology vocabulary. The most outstanding Blast2GO features are: (i) the combination of various annotation strategies and tools controlling type and intensity of annotation, (ii) the numerous graphical features such as the interactive GO-graph visualization for gene-set function profiling or descriptive charts, (iii) the general sequence management features and (iv) high-throughput capabilities. We used the Blast2GO framework to carry out a detailed analysis of annotation behaviour through homology transfer and its impact in functional genomics research. Our aim is to offer biologists useful information to take into account when addressing the task of functionally characterizing their sequence data.
Publication
Journal: Nucleic Acids Research
April/15/1987
Abstract
A simple, effective measure of synonymous codon usage bias, the Codon Adaptation Index, is detailed. The index uses a reference set of highly expressed genes from a species to assess the relative merits of each codon, and a score for a gene is calculated from the frequency of use of all codons in that gene. The index assesses the extent to which selection has been effective in moulding the pattern of codon usage. In that respect it is useful for predicting the level of expression of a gene, for assessing the adaptation of viral genes to their hosts, and for making comparisons of codon usage in different organisms. The index may also give an approximate indication of the likely success of heterologous gene expression.
Publication
Journal: International review of cytology
July/8/1981
Publication
Journal: Nature
May/26/1983
Abstract
In the genome of a germ-line cell, the genetic information for an immunoglobulin polypeptide chain is contained in multiple gene segments scattered along a chromosome. During the development of bone marrow-derived lymphocytes, these gene segments are assembled by recombination which leads to the formation of a complete gene. In addition, mutations are somatically introduced at a high rate into the amino-terminal region. Both somatic recombination and mutation contribute greatly to an increase in the diversity of antibody synthesized by a single organism.
Authors
Publication
Journal: Science
December/2/1982
Abstract
Exogenous DNA sequences were introduced into the Drosophila germ line. A rosy transposon (ry1), constructed by inserting a chromosomal DNA fragment containing the wild-type rosy gene into a P transposable element, transformed germ line cells in 20 to 50 percent of the injected rosy mutant embryos. Transformants contained one or two copies of chromosomally integrated, intact ry1 that were stably inherited in subsequent generations. These transformed flies had wild-type eye color indicating that the visible genetic defect in the host strain could be fully and permanently corrected by the transferred gene. To demonstrate the generality of this approach, a DNA segment that does not confer a recognizable phenotype on recipients was also transferred into germ line chromosomes.
Publication
Journal: Nucleic Acids Research
July/12/2016
Abstract
KEGG (http://www.kegg.jp/ or http://www.genome.jp/kegg/) is an integrated database resource for biological interpretation of genome sequences and other high-throughput data. Molecular functions of genes and proteins are associated with ortholog groups and stored in the KEGG Orthology (KO) database. The KEGG pathway maps, BRITE hierarchies and KEGG modules are developed as networks of KO nodes, representing high-level functions of the cell and the organism. Currently, more than 4000 complete genomes are annotated with KOs in the KEGG GENES database, which can be used as a reference data set for KO assignment and subsequent reconstruction of KEGG pathways and other molecular networks. As an annotation resource, the following improvements have been made. First, each KO record is re-examined and associated with protein sequence data used in experiments of functional characterization. Second, the GENES database now includes viruses, plasmids, and the addendum category for functionally characterized proteins that are not represented in complete genomes. Third, new automatic annotation servers, BlastKOALA and GhostKOALA, are made available utilizing the non-redundant pangenome data set generated from the GENES database. As a resource for translational bioinformatics, various data sets are created for antimicrobial resistance and drug interaction networks.
Publication
Journal: Nucleic Acids Research
August/28/2006
Abstract
Unified, structured vocabularies and classifications freely provided by the Gene Ontology (GO) Consortium are widely accepted in most of the large scale gene annotation projects. Consequently, many tools have been created for use with the GO ontologies. WEGO (Web Gene Ontology Annotation Plot) is a simple but useful tool for visualizing, comparing and plotting GO annotation results. Different from other commercial software for creating chart, WEGO is designed to deal with the directed acyclic graph structure of GO to facilitate histogram creation of GO annotation results. WEGO has been used widely in many important biological research projects, such as the rice genome project and the silkworm genome project. It has become one of the daily tools for downstream gene annotation analysis, especially when performing comparative genomics tasks. WEGO, along with the two other tools, namely External to GO Query and GO Archive Query, are freely available for all users at http://wego.genomics.org.cn. There are two available mirror sites at http://wego2.genomics.org.cn and http://wego.genomics.com.cn. Any suggestions are welcome at wego@genomics.org.cn.
load more...