PMID:15608167

From EcoliWiki
Jump to: navigation, search
Citation

Bairoch, A, Apweiler, R, Wu, CH, Barker, WC, Boeckmann, B, Ferro, S, Gasteiger, E, Huang, H, Lopez, R, Magrane, M, Martin, MJ, Natale, DA, O'Donovan, C, Redaschi, N and Yeh, LS (2005) The Universal Protein Resource (UniProt). Nucleic Acids Res. 33:D154-9

Abstract

The Universal Protein Resource (UniProt) provides the scientific community with a single, centralized, authoritative resource for protein sequences and functional information. Formed by uniting the Swiss-Prot, TrEMBL and PIR protein database activities, the UniProt consortium produces three layers of protein sequence databases: the UniProt Archive (UniParc), the UniProt Knowledgebase (UniProt) and the UniProt Reference (UniRef) databases. The UniProt Knowledgebase is a comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase with extensive cross-references. This centrepiece consists of two sections: UniProt/Swiss-Prot, with fully, manually curated entries; and UniProt/TrEMBL, enriched with automated classification and annotation. During 2004, tens of thousands of Knowledgebase records got manually annotated or updated; we introduced a new comment line topic: TOXIC DOSE to store information on the acute toxicity of a toxin; the UniProt keyword list got augmented by additional keywords; we improved the documentation of the keywords and are continuously overhauling and standardizing the annotation of post-translational modifications. Furthermore, we introduced a new documentation file of the strains and their synonyms. Many new database cross-references were introduced and we started to make use of Digital Object Identifiers. We also achieved in collaboration with the Macromolecular Structure Database group at EBI an improved integration with structural databases by residue level mapping of sequences from the Protein Data Bank entries onto corresponding UniProt entries. For convenient sequence searches we provide the UniRef non-redundant sequence databases. The comprehensive UniParc database stores the complete body of publicly available protein sequence data. The UniProt databases can be accessed online (http://www.uniprot.org) or downloaded in several formats (ftp://ftp.uniprot.org/pub). New releases are published every two weeks.

Links

PubMed PMC540024 Online version:10.1093/nar/gki070

Keywords

Amino Acid Sequence; Databases, Protein; Proteins/chemistry; Proteins/physiology; Systems Integration; User-Computer Interface

Significance

You can help EcoliWiki by summarizing why this paper is useful

Useful Materials and Methods

You can help Ecoliwiki by describing the useful materials (strains, plasmids, antibodies, etc) described in this paper.

Annotations

<protect><annotationlinks/></protect>

EcoliWiki Links

Add links to pages that link here (e.g. gene, product, method pages)

See also

References

See Help:References for how to manage references in EcoliWiki.