Publications about FII

Publication

Gene Expression Omnibus: NCBI gene expression and hybridization array data repository.

Journal: Nucleic Acids Research

January/20/2002

Abstract

The Gene Expression Omnibus (GEO) project was initiated in response to the growing demand for a public repository for high-throughput gene expression data. GEO provides a flexible and open design that facilitates submission, storage and retrieval of heterogeneous data sets from high-throughput gene expression and genomic hybridization experiments. GEO is not intended to replace in house gene expression databases that benefit from coherent data sets, and which are constructed to facilitate a particular analytic method, but rather complement these by acting as a tertiary, central data distribution hub. The three central data entities of GEO are platforms, samples and series, and were designed with gene expression and genomic hybridization experiments in mind. A platform is, essentially, a list of probes that define what set of molecules may be detected. A sample describes the set of molecules that are being probed and references a single platform used to generate its molecular abundance data. A series organizes samples into the meaningful data sets which make up an experiment. The GEO repository is publicly accessible through the World Wide Web at http://www.ncbi.nlm.nih.gov/geo.

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC99122/bin/gkf06601.jpg

Authors

Ron Edgar; Michael Domrachev; Alex E Lash

Publication

MolProbity: all-atom structure validation for macromolecular crystallography.

Download PDF

Journal: Acta crystallographica. Section D, Biological crystallography

June/23/2010

Abstract

MolProbity is a structure-validation web service that provides broad-spectrum solidly based evaluation of model quality at both the global and local levels for both proteins and nucleic acids. It relies heavily on the power and sensitivity provided by optimized hydrogen placement and all-atom contact analysis, complemented by updated versions of covalent-geometry and torsion-angle criteria. Some of the local corrections can be performed automatically in MolProbity and all of the diagnostics are presented in chart and graphical forms that help guide manual rebuilding. X-ray crystallography provides a wealth of biologically important molecular data in the form of atomic three-dimensional structures of proteins, nucleic acids and increasingly large complexes in multiple forms and states. Advances in automation, in everything from crystallization to data collection to phasing to model building to refinement, have made solving a structure using crystallography easier than ever. However, despite these improvements, local errors that can affect biological interpretation are widespread at low resolution and even high-resolution structures nearly all contain at least a few local errors such as Ramachandran outliers, flipped branched protein side chains and incorrect sugar puckers. It is critical both for the crystallographer and for the end user that there are easy and reliable methods to diagnose and correct these sorts of errors in structures. MolProbity is the authors' contribution to helping solve this problem and this article reviews its general capabilities, reports on recent enhancements and usage, and presents evidence that the resulting improvements are now beneficially affecting the global database.

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2803126/bin/d-66-00012-fig1.jpg

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2803126/bin/d-66-00012-fig2.jpg

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2803126/bin/d-66-00012-fig3.jpg

Authors

Vincent B Chen; W Bryan Arendall; Jeffrey J Headd; Daniel A Keedy; Robert M Immormino; Gary J Kapral+3 authors

Pulse

Views:

2

Posts:

No posts

Rating:

Not rated

Publication

Comparative protein modelling by satisfaction of spatial restraints.

Journal: Journal of Molecular Biology

January/11/1994

Abstract

We describe a comparative protein modelling method designed to find the most probable structure for a sequence given its alignment with related structures. The three-dimensional (3D) model is obtained by optimally satisfying spatial restraints derived from the alignment and expressed as probability density functions (pdfs) for the features restrained. For example, the probabilities for main-chain conformations of a modelled residue may be restrained by its residue type, main-chain conformation of an equivalent residue in a related protein, and the local similarity between the two sequences. Several such pdfs are obtained from the correlations between structural features in 17 families of homologous proteins which have been aligned on the basis of their 3D structures. The pdfs restrain C alpha-C alpha distances, main-chain N-O distances, main-chain and side-chain dihedral angles. A smoothing procedure is used in the derivation of these relationships to minimize the problem of a sparse database. The 3D model of a protein is obtained by optimization of the molecular pdf such that the model violates the input restraints as little as possible. The molecular pdf is derived as a combination of pdfs restraining individual spatial features of the whole molecule. The optimization procedure is a variable target function method that applies the conjugate gradients algorithm to positions of all non-hydrogen atoms. The method is automated and is illustrated by the modelling of trypsin from two other serine proteinases.

Authors

A Sali; T L Blundell

Publication

Cancer statistics, 2008.

Journal: CA - A Cancer Journal for Clinicians

August/28/2008

Abstract

Each year, the American Cancer Society estimates the number of new cancer cases and deaths expected in the United States in the current year and compiles the most recent data on cancer incidence, mortality, and survival based on incidence data from the National Cancer Institute, Centers for Disease Control and Prevention, and the North American Association of Central Cancer Registries and mortality data from the National Center for Health Statistics. Incidence and death rates are age-standardized to the 2000 US standard million population. A total of 1,437,180 new cancer cases and 565,650 deaths from cancer are projected to occur in the United States in 2008. Notable trends in cancer incidence and mortality include stabilization of incidence rates for all cancer sites combined in men from 1995 through 2004 and in women from 1999 through 2004 and a continued decrease in the cancer death rate since 1990 in men and since 1991 in women. Overall cancer death rates in 2004 compared with 1990 in men and 1991 in women decreased by 18.4% and 10.5%, respectively, resulting in the avoidance of over a half million deaths from cancer during this time interval. This report also examines cancer incidence, mortality, and survival by site, sex, race/ethnicity, education, geographic area, and calendar year, as well as the proportionate contribution of selected sites to the overall trends. Although much progress has been made in reducing mortality rates, stabilizing incidence rates, and improving survival, cancer still accounts for more deaths than heart disease in persons under age 85 years. Further progress can be accelerated by supporting new discoveries and by applying existing cancer control knowledge across all segments of the population.

Authors

Ahmedin Jemal; Rebecca Siegel; Elizabeth Ward; Yongping Hao; Jiaquan Xu; Taylor Murray; Michael J Thun

Publication

Obesity: preventing and managing the global epidemic. Report of a WHO consultation.

Journal: Technical Report Series - World Health Organization, Geneva

June/6/2001

Abstract

Overweight and obesity represent a rapidly growing threat to the health of populations in an increasing number of countries. Indeed they are now so common that they are replacing more traditional problems such as undernutrition and infectious diseases as the most significant causes of ill-health. Obesity comorbidities include coronary heart disease, hypertension and stroke, certain types of cancer, non-insulin-dependent diabetes mellitus, gallbladder disease, dyslipidaemia, osteoarthritis and gout, and pulmonary diseases, including sleep apnoea. In addition, the obese suffer from social bias, prejudice and discrimination, on the part not only of the general public but also of health professionals, and this may make them reluctant to seek medical assistance. WHO therefore convened a Consultation on obesity to review current epidemiological information, contributing factors and associated consequences, and this report presents its conclusions and recommendations. In particular, the Consultation considered the system for classifying overweight and obesity based on the body mass index, and concluded that a coherent system is now available and should be adopted internationally. The Consultation also concluded that the fundamental causes of the obesity epidemic are sedentary lifestyles and high-fat energy-dense diets, both resulting from the profound changes taking place in society and the behavioural patterns of communities as a consequence of increased urbanization and industrialization and the disappearance of traditional lifestyles. A reduction in fat intake to around 20-25% of energy is necessary to minimize energy imbalance and weight gain in sedentary individuals. While there is strong evidence that certain genes have an influence on body mass and body fat, most do not qualify as necessary genes, i.e. genes that cause obesity whenever two copies of the defective allele are present; it is likely to be many years before the results of genetic research can be applied to the problem. Methods for the treatment of obesity are described, including dietary management, physical activity and exercise, and antiobesity drugs, with gastrointestinal surgery being reserved for extreme cases.

Pulse

Views:

4

Posts:

No posts

Rating:

Not rated

Publication

Overview of the CCP4 suite and current developments.

Download PDF

Journal: Acta crystallographica. Section D, Biological crystallography

June/26/2011

Abstract

The CCP4 (Collaborative Computational Project, Number 4) software suite is a collection of programs and associated data and software libraries which can be used for macromolecular structure determination by X-ray crystallography. The suite is designed to be flexible, allowing users a number of methods of achieving their aims. The programs are from a wide variety of sources but are connected by a common infrastructure provided by standard file formats, data objects and graphical interfaces. Structure solution by macromolecular crystallography is becoming increasingly automated and the CCP4 suite includes several automation pipelines. After giving a brief description of the evolution of CCP4 over the last 30 years, an overview of the current suite is given. While detailed descriptions are given in the accompanying articles, here it is shown how the individual programs contribute to a complete software package.

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3069738/bin/d-67-00235-fig1a.jpg

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3069738/bin/d-67-00235-fig1b.jpg

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3069738/bin/d-67-00235-fig2.jpg

Authors

Martyn D Winn; Charles C Ballard; Kevin D Cowtan; Eleanor J Dodson; Paul Emsley; Phil R Evans+12 authors

Pulse

Views:

1

Posts:

No posts

Rating:

Not rated

Publication

Minimal criteria for defining multipotent mesenchymal stromal cells. The International Society for Cellular Therapy position statement.

Journal: Cytotherapy

May/3/2007

Abstract

The considerable therapeutic potential of human multipotent mesenchymal stromal cells (MSC) has generated markedly increasing interest in a wide variety of biomedical disciplines. However, investigators report studies of MSC using different methods of isolation and expansion, and different approaches to characterizing the cells. Thus it is increasingly difficult to compare and contrast study outcomes, which hinders progress in the field. To begin to address this issue, the Mesenchymal and Tissue Stem Cell Committee of the International Society for Cellular Therapy proposes minimal criteria to define human MSC. First, MSC must be plastic-adherent when maintained in standard culture conditions. Second, MSC must express CD105, CD73 and CD90, and lack expression of CD45, CD34, CD14 or CD11b, CD79alpha or CD19 and HLA-DR surface molecules. Third, MSC must differentiate to osteoblasts, adipocytes and chondroblasts in vitro. While these criteria will probably require modification as new knowledge unfolds, we believe this minimal set of standard criteria will foster a more uniform characterization of MSC and facilitate the exchange of data among investigators.

Authors

M Dominici; K Le Blanc; I Mueller; I Slaper-Cortenbach; Fc Marini; Ds Krause+4 authors

Publication

Principal components analysis corrects for stratification in genome-wide association studies.

Journal: Nature Genetics

September/10/2006

Abstract

Population stratification--allele frequency differences between cases and controls due to systematic ancestry differences-can cause spurious associations in disease studies. We describe a method that enables explicit detection and correction of population stratification on a genome-wide scale. Our method uses principal components analysis to explicitly model ancestry differences between cases and controls. The resulting correction is specific to a candidate marker's variation in frequency across ancestral populations, minimizing spurious associations while maximizing power to detect true associations. Our simple, efficient approach can easily be applied to disease studies with hundreds of thousands of markers.

Authors

Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich

Publication

Meta-analysis of observational studies in epidemiology: a proposal for reporting. Meta-analysis Of Observational Studies in Epidemiology (MOOSE) group.

Journal: JAMA - Journal of the American Medical Association

May/3/2000

Abstract

OBJECTIVE

Because of the pressure for timely, informed decisions in public health and clinical practice and the explosion of information in the scientific literature, research results must be synthesized. Meta-analyses are increasingly used to address this problem, and they often evaluate observational studies. A workshop was held in Atlanta, Ga, in April 1997, to examine the reporting of meta-analyses of observational studies and to make recommendations to aid authors, reviewers, editors, and readers.

METHODS

Twenty-seven participants were selected by a steering committee, based on expertise in clinical practice, trials, statistics, epidemiology, social sciences, and biomedical editing. Deliberations of the workshop were open to other interested scientists. Funding for this activity was provided by the Centers for Disease Control and Prevention.

METHODS

We conducted a systematic review of the published literature on the conduct and reporting of meta-analyses in observational studies using MEDLINE, Educational Research Information Center (ERIC), PsycLIT, and the Current Index to Statistics. We also examined reference lists of the 32 studies retrieved and contacted experts in the field. Participants were assigned to small-group discussions on the subjects of bias, searching and abstracting, heterogeneity, study categorization, and statistical methods.

METHODS

From the material presented at the workshop, the authors developed a checklist summarizing recommendations for reporting meta-analyses of observational studies. The checklist and supporting evidence were circulated to all conference attendees and additional experts. All suggestions for revisions were addressed.

CONCLUSIONS

The proposed checklist contains specifications for reporting of meta-analyses of observational studies in epidemiology, including background, search strategy, methods, results, discussion, and conclusion. Use of the checklist should improve the usefulness of meta-analyses for authors, reviewers, editors, readers, and decision makers. An evaluation plan is suggested and research areas are explored.

Authors

D F Stroup; J A Berlin; S C Morton; I Olkin; G D Williamson; D Rennie+4 authors

Publication

Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.

Download PDF

Journal: Nature

June/28/2007

Abstract

There is increasing evidence that genome-wide association (GWA) studies represent a powerful approach to the identification of genes involved in common human diseases. We describe a joint GWA study (using the Affymetrix GeneChip 500K Mapping Array Set) undertaken in the British population, which has examined approximately 2,000 individuals for each of 7 major diseases and a shared set of approximately 3,000 controls. Case-control comparisons identified 24 independent association signals at P < 5 x 10(-7): 1 in bipolar disorder, 1 in coronary artery disease, 9 in Crohn's disease, 3 in rheumatoid arthritis, 7 in type 1 diabetes and 3 in type 2 diabetes. On the basis of prior findings and replication studies thus-far completed, almost all of these signals reflect genuine susceptibility effects. We observed association at many previously identified loci, and found compelling evidence that some loci confer risk for more than one of the diseases studied. Across all diseases, we identified a large number of further signals (including 58 loci with single-point P values between 10(-5) and 5 x 10(-7)) likely to yield additional susceptibility loci. The importance of appropriately large samples was confirmed by the modest effect sizes observed at most loci identified. This study thus represents a thorough validation of the GWA approach. It has also demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; has generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in the British population is generally modest. Our findings offer new avenues for exploring the pathophysiology of these important disorders. We anticipate that our data, results and software, which will be widely available to other investigators, will provide a powerful resource for human genetics research.

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2719288/bin/ukmss-4894-f0001.jpg

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2719288/bin/ukmss-4894-f0002.jpg

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2719288/bin/ukmss-4894-f0007.jpg

Authors

Wellcome Trust Case Control Consortium

Pulse

Views:

3

Posts:

No posts

Rating:

Not rated

Publication

Improved survival with ipilimumab in patients with metastatic melanoma.

Download PDF

Journal: New England Journal of Medicine

September/14/2010

Abstract

BACKGROUND

An improvement in overall survival among patients with metastatic melanoma has been an elusive goal. In this phase 3 study, ipilimumab--which blocks cytotoxic T-lymphocyte-associated antigen 4 to potentiate an antitumor T-cell response--administered with or without a glycoprotein 100 (gp100) peptide vaccine was compared with gp100 alone in patients with previously treated metastatic melanoma.

METHODS

A total of 676 HLA-A*0201-positive patients with unresectable stage III or IV melanoma, whose disease had progressed while they were receiving therapy for metastatic disease, were randomly assigned, in a 3:1:1 ratio, to receive ipilimumab plus gp100 (403 patients), ipilimumab alone (137), or gp100 alone (136). Ipilimumab, at a dose of 3 mg per kilogram of body weight, was administered with or without gp100 every 3 weeks for up to four treatments (induction). Eligible patients could receive reinduction therapy. The primary end point was overall survival.

RESULTS

The median overall survival was 10.0 months among patients receiving ipilimumab plus gp100, as compared with 6.4 months among patients receiving gp100 alone (hazard ratio for death, 0.68; P<0.001). The median overall survival with ipilimumab alone was 10.1 months (hazard ratio for death in the comparison with gp100 alone, 0.66; P=0.003). No difference in overall survival was detected between the ipilimumab groups (hazard ratio with ipilimumab plus gp100, 1.04; P=0.76). Grade 3 or 4 immune-related adverse events occurred in 10 to 15% of patients treated with ipilimumab and in 3% treated with gp100 alone. There were 14 deaths related to the study drugs (2.1%), and 7 were associated with immune-related adverse events.

CONCLUSIONS

Ipilimumab, with or without a gp100 peptide vaccine, as compared with gp100 alone, improved overall survival in patients with previously treated metastatic melanoma. Adverse events can be severe, long-lasting, or both, but most are reversible with appropriate treatment. (Funded by Medarex and Bristol-Myers Squibb; ClinicalTrials.gov number, NCT00094653.)

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3549297/bin/nihms-431916-f0001.jpg

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3549297/bin/nihms-431916-f0002.jpg

Authors

F Stephen Hodi; Steven J O'Day; David F McDermott; Robert W Weber; Jeffrey A Sosman; John B Haanen+23 authors

Pulse

Views:

1

Posts:

No posts

Rating:

Not rated

Publication

Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy.

Download PDF

Journal: Applied and Environmental Microbiology

October/18/2007

Abstract

The Ribosomal Database Project (RDP) Classifier, a naïve Bayesian classifier, can rapidly and accurately classify bacterial 16S rRNA sequences into the new higher-order taxonomy proposed in Bergey's Taxonomic Outline of the Prokaryotes (2nd ed., release 5.0, Springer-Verlag, New York, NY, 2004). It provides taxonomic assignments from domain to genus, with confidence estimates for each assignment. The majority of classifications (98%) were of high estimated confidence >> or = 95%) and high accuracy (98%). In addition to being tested with the corpus of 5,014 type strain sequences from Bergey's outline, the RDP Classifier was tested with a corpus of 23,095 rRNA sequences as assigned by the NCBI into their alternative higher-order taxonomy. The results from leave-one-out testing on both corpora show that the overall accuracies at all levels of confidence for near-full-length and 400-base segments were 89% or above down to the genus level, and the majority of the classification errors appear to be due to anomalies in the current taxonomies. For shorter rRNA segments, such as those that might be generated by pyrosequencing, the error rate varied greatly over the length of the 16S rRNA gene, with segments around the V2 and V4 variable regions giving the lowest error rates. The RDP Classifier is suitable both for the analysis of single rRNA sequences and for the analysis of libraries of thousands of sequences. Another related tool, RDP Library Compare, was developed to facilitate microbial-community comparison based on 16S rRNA gene sequence libraries. It combines the RDP Classifier with a statistical test to flag taxa differentially represented between samples. The RDP Classifier and RDP Library Compare are available online at http://rdp.cme.msu.edu/.

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1950982/bin/zam0160780800001.jpg

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1950982/bin/zam0160780800002.jpg

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1950982/bin/zam0160780800003.jpg

Authors

Qiong Wang; George M Garrity; James M Tiedje; James R Cole

Pulse

Views:

3

Posts:

No posts

Rating:

Not rated

Publication

Statistical significance for genomewide studies.

Download PDF

Journal: Proceedings of the National Academy of Sciences of the United States of America

October/2/2003

Abstract

With the increase in genomewide experiments and the sequencing of multiple genomes, the analysis of large data sets has become commonplace in biology. It is often the case that thousands of features in a genomewide data set are tested against some null hypothesis, where a number of features are expected to be significant. Here we propose an approach to measuring statistical significance in these genomewide studies based on the concept of the false discovery rate. This approach offers a sensible balance between the number of true and false positives that is automatically calibrated and easily interpreted. In doing so, a measure of statistical significance called the q value is associated with each tested feature. The q value is similar to the well known p value, except it is a measure of significance in terms of the false discovery rate rather than the false positive rate. Our approach avoids a flood of false positive results, while offering a more liberal criterion than what has been used in genome scans for linkage.

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC170937/bin/pq1530509001.jpg

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC170937/bin/pq1530509002.jpg

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC170937/bin/pq1530509003.jpg

Authors

John D Storey; Robert Tibshirani

Publication

Cancer statistics in China, 2015.

Journal: CA - A Cancer Journal for Clinicians

July/19/2016

Abstract

With increasing incidence and mortality, cancer is the leading cause of death in China and is a major public health problem. Because of China's massive population (1.37 billion), previous national incidence and mortality estimates have been limited to small samples of the population using data from the 1990s or based on a specific year. With high-quality data from an additional number of population-based registries now available through the National Central Cancer Registry of China, the authors analyzed data from 72 local, population-based cancer registries (2009-2011), representing 6.5% of the population, to estimate the number of new cases and cancer deaths for 2015. Data from 22 registries were used for trend analyses (2000-2011). The results indicated that an estimated 4292,000 new cancer cases and 2814,000 cancer deaths would occur in China in 2015, with lung cancer being the most common incident cancer and the leading cause of cancer death. Stomach, esophageal, and liver cancers were also commonly diagnosed and were identified as leading causes of cancer death. Residents of rural areas had significantly higher age-standardized (Segi population) incidence and mortality rates for all cancers combined than urban residents (213.6 per 100,000 vs 191.5 per 100,000 for incidence; 149.0 per 100,000 vs 109.5 per 100,000 for mortality, respectively). For all cancers combined, the incidence rates were stable during 2000 through 2011 for males (+0.2% per year; P = .1), whereas they increased significantly (+2.2% per year; P < .05) among females. In contrast, the mortality rates since 2006 have decreased significantly for both males (-1.4% per year; P < .05) and females (-1.1% per year; P < .05). Many of the estimated cancer cases and deaths can be prevented through reducing the prevalence of risk factors, while increasing the effectiveness of clinical care delivery, particularly for those living in rural areas and in disadvantaged populations.

Authors

Wanqing Chen; Rongshou Zheng; Peter D Baade; Siwei Zhang; Hongmei Zeng; Freddie Bray+3 authors

Publication

Operating characteristics of a rank correlation test for publication bias.

Journal: Biometrics

July/23/1995

Abstract

An adjusted rank correlation test is proposed as a technique for identifying publication bias in a meta-analysis, and its operating characteristics are evaluated via simulations. The test statistic is a direct statistical analogue of the popular "funnel-graph." The number of component studies in the meta-analysis, the nature of the selection mechanism, the range of variances of the effect size estimates, and the true underlying effect size are all observed to be influential in determining the power of the test. The test is fairly powerful for large meta-analyses with 75 component studies, but has only moderate power for meta-analyses with 25 component studies. However, in many of the configurations in which there is low power, there is also relatively little bias in the summary effect size estimate. Nonetheless, the test must be interpreted with caution in small meta-analyses. In particular, bias cannot be ruled out if the test is not significant. The proposed technique has potential utility as an exploratory tool for meta-analysts, as a formal procedure to complement the funnel-graph.

Authors

C B Begg; M Mazumdar

Pulse

Views:

2

Posts:

No posts

Rating:

Not rated

Publication

PAML 4: phylogenetic analysis by maximum likelihood.

Journal: Molecular Biology and Evolution

October/28/2007

Abstract

PAML, currently in version 4, is a package of programs for phylogenetic analyses of DNA and protein sequences using maximum likelihood (ML). The programs may be used to compare and test phylogenetic trees, but their main strengths lie in the rich repertoire of evolutionary models implemented, which can be used to estimate parameters in models of sequence evolution and to test interesting biological hypotheses. Uses of the programs include estimation of synonymous and nonsynonymous rates (d(N) and d(S)) between two protein-coding DNA sequences, inference of positive Darwinian selection through phylogenetic comparison of protein-coding genes, reconstruction of ancestral genes and proteins for molecular restoration studies of extinct life forms, combined analysis of heterogeneous data sets from multiple gene loci, and estimation of species divergence times incorporating uncertainties in fossil calibrations. This note discusses some of the major applications of the package, which includes example data sets to demonstrate their use. The package is written in ANSI C, and runs under Windows, Mac OSX, and UNIX systems. It is available at -- (http://abacus.gene.ucl.ac.uk/software/paml.html).

Authors

Ziheng Yang

Publication

Development and validation of brief measures of positive and negative affect: the PANAS scales.

Journal: Journal of Personality and Social Psychology

August/31/1988

Abstract

In recent studies of the structure of affect, positive and negative affect have consistently emerged as two dominant and relatively independent dimensions. A number of mood scales have been created to measure these factors; however, many existing measures are inadequate, showing low reliability or poor convergent or discriminant validity. To fill the need for reliable and valid Positive Affect and Negative Affect scales that are also brief and easy to administer, we developed two 10-item mood scales that comprise the Positive and Negative Affect Schedule (PANAS). The scales are shown to be highly internally consistent, largely uncorrelated, and stable at appropriate levels over a 2-month time period. Normative data and factorial and external evidence of convergent and discriminant validity for the scales are also presented.

Authors

D Watson; L A Clark; A Tellegen

Pulse

Views:

1

Posts:

No posts

Rating:

Not rated

Publication

Embryonic stem cell lines derived from human blastocysts.

Journal: Science

November/24/1998

Abstract

Human blastocyst-derived, pluripotent cell lines are described that have normal karyotypes, express high levels of telomerase activity, and express cell surface markers that characterize primate embryonic stem cells but do not characterize other early lineages. After undifferentiated proliferation in vitro for 4 to 5 months, these cells still maintained the developmental potential to form trophoblast and derivatives of all three embryonic germ layers, including gut epithelium (endoderm); cartilage, bone, smooth muscle, and striated muscle (mesoderm); and neural epithelium, embryonic ganglia, and stratified squamous epithelium (ectoderm). These cell lines should be useful in human developmental biology, drug discovery, and transplantation medicine.

Authors

J A Thomson; J Itskovitz-Eldor; S S Shapiro; M A Waknitz; J J Swiergiel; V S Marshall; J M Jones

Pulse

Views:

2

Posts:

No posts

Rating:

Not rated

Publication

A map of human genome variation from population-scale sequencing.

Download PDF

Journal: Nature

December/13/2010

Abstract

The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype. Here we present results of the pilot phase of the project, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms. We undertook three projects: low-coverage whole-genome sequencing of 179 individuals from four populations; high-coverage sequencing of two mother-father-child trios; and exon-targeted sequencing of 697 individuals from seven populations. We describe the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants, most of which were previously undescribed. We show that, because we have catalogued the vast majority of common variation, over 95% of the currently accessible variants found in any individual are present in this data set. On average, each person is found to carry approximately 250 to 300 loss-of-function variants in annotated genes and 50 to 100 variants previously implicated in inherited disorders. We demonstrate how these results can be used to inform association and functional studies. From the two trios, we directly estimate the rate of de novo germline base substitution mutations to be approximately 10(-8) per base pair per generation. We explore the data with regard to signatures of natural selection, and identify a marked reduction of genetic variation in the neighbourhood of genes, due to selection at linked sites. These methods and public data will support the next phase of human genetic research.

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3042601/bin/ukmss-34220-f0001.jpg

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3042601/bin/ukmss-34220-f0003.jpg

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3042601/bin/ukmss-34220-f0004.jpg

Authors

1000 Genomes Project Consortium; Gonçalo R Abecasis; David Altshuler; Adam Auton; Lisa D Brooks; Richard M Durbin+3 authors

Pulse

Views:

1

Posts:

No posts

Rating:

Not rated

Publication

Chromatin modifications and their function.

Journal: Cell

March/29/2007

Abstract

The surface of nucleosomes is studded with a multiplicity of modifications. At least eight different classes have been characterized to date and many different sites have been identified for each class. Operationally, modifications function either by disrupting chromatin contacts or by affecting the recruitment of nonhistone proteins to chromatin. Their presence on histones can dictate the higher-order chromatin structure in which DNA is packaged and can orchestrate the ordered recruitment of enzyme complexes to manipulate DNA. In this way, histone modifications have the potential to influence many fundamental biological processes, some of which may be epigenetically inherited.

Authors

Tony Kouzarides

Pulse

Views:

1

Posts:

No posts

Rating:

Not rated

Publication

Prospective identification of tumorigenic breast cancer cells.

Download PDF

Journal: Proceedings of the National Academy of Sciences of the United States of America

May/21/2003

Abstract

Breast cancer is the most common malignancy in United States women, accounting for >40,000 deaths each year. These breast tumors are comprised of phenotypically diverse populations of breast cancer cells. Using a model in which human breast cancer cells were grown in immunocompromised mice, we found that only a minority of breast cancer cells had the ability to form new tumors. We were able to distinguish the tumorigenic (tumor initiating) from the nontumorigenic cancer cells based on cell surface marker expression. We prospectively identified and isolated the tumorigenic cells as CD44(+)CD24(-/low)Lineage(-) in eight of nine patients. As few as 100 cells with this phenotype were able to form tumors in mice, whereas tens of thousands of cells with alternate phenotypes failed to form tumors. The tumorigenic subpopulation could be serially passaged: each time cells within this population generated new tumors containing additional CD44(+)CD24(-/low)Lineage(-) tumorigenic cells as well as the phenotypically diverse mixed populations of nontumorigenic cells present in the initial tumor. The ability to prospectively identify tumorigenic cancer cells will facilitate the elucidation of pathways that regulate their growth and survival. Furthermore, because these cells drive tumor development, strategies designed to target this population may lead to more effective therapies.

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC153034/bin/pq0530291001.jpg

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC153034/bin/pq0530291002.jpg

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC153034/bin/pq0530291003.jpg

Authors

Muhammad Al-Hajj; Max S Wicha; Adalberto Benito-Hernandez; Sean J Morrison; Michael F Clarke

Pulse

Views:

3

Posts:

No posts

Rating:

Not rated

Publication

Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach.

Journal: Biometrics

February/2/1989

Abstract

Methods of evaluating and comparing the performance of diagnostic tests are of increasing importance as new tests are developed and marketed. When a test is based on an observed variable that lies on a continuous or graded scale, an assessment of the overall value of the test can be made through the use of a receiver operating characteristic (ROC) curve. The curve is constructed by varying the cutpoint used to determine which values of the observed variable will be considered abnormal and then plotting the resulting sensitivities against the corresponding false positive rates. When two or more empirical curves are constructed based on tests performed on the same individuals, statistical analysis on differences between curves must take into account the correlated nature of the data. This paper presents a nonparametric approach to the analysis of areas under correlated ROC curves, by using the theory on generalized U-statistics to generate an estimated covariance matrix.

Authors

E R DeLong; D M DeLong; D L Clarke-Pearson

Publication

Induced pluripotent stem cell lines derived from human somatic cells.

Journal: Science

January/7/2008

Abstract

Somatic cell nuclear transfer allows trans-acting factors present in the mammalian oocyte to reprogram somatic cell nuclei to an undifferentiated state. We show that four factors (OCT4, SOX2, NANOG, and LIN28) are sufficient to reprogram human somatic cells to pluripotent stem cells that exhibit the essential characteristics of embryonic stem (ES) cells. These induced pluripotent human stem cells have normal karyotypes, express telomerase activity, express cell surface markers and genes that characterize human ES cells, and maintain the developmental potential to differentiate into advanced derivatives of all three primary germ layers. Such induced pluripotent human cell lines should be useful in the production of new disease models and in drug development, as well as for applications in transplantation medicine, once technical limitations (for example, mutation through viral integration) are eliminated.

Authors

Junying Yu; Maxim A Vodyanik; Kim Smuga-Otto; Jessica Antosiewicz-Bourget; Jennifer L Frane; Shulan Tian+6 authors

Pulse

Views:

1

Posts:

No posts

Rating:

Not rated

Publication

A comparison of normalization methods for high density oligonucleotide array data based on variance and bias.

Journal: Bioinformatics

August/26/2003

Abstract

BACKGROUND

When running experiments that involve multiple high density oligonucleotide arrays, it is important to remove sources of variation between arrays of non-biological origin. Normalization is a process for reducing this variation. It is common to see non-linear relations between arrays and the standard normalization provided by Affymetrix does not perform well in these situations.

RESULTS

We present three methods of performing normalization at the probe intensity level. These methods are called complete data methods because they make use of data from all arrays in an experiment to form the normalizing relation. These algorithms are compared to two methods that make use of a baseline array: a one number scaling based algorithm and a method that uses a non-linear normalizing relation by comparing the variability and bias of an expression measure. Two publicly available datasets are used to carry out the comparisons. The simplest and quickest complete data method is found to perform favorably.

BACKGROUND

Software implementing all three of the complete data normalization methods is available as part of the R package Affy, which is a part of the Bioconductor project http://www.bioconductor.org.

BACKGROUND

Additional figures may be found at http://www.stat.berkeley.edu/~bolstad/normalize/index.html

Authors

B M Bolstad; R A Irizarry; M Astrand; T P Speed