[German]  [Feedback]


The Bioinformatics Educational Resource"  info

This site provides short and concise introductions to basic concepts in molecular and cell biology and bioinformatics. The main emphasis is placed on making it as easy as possible for the user to understand which tools and databases are available from the EBI and from sites belonging to its collaborators.

3D-GENOMICS"  info

The 3D-GENOMICS database provides structural annotations for proteins from sequenced genomes.

5S Ribosomal RNA

5S ribosomal RNA is an integral component of the large subunit of all cytoplasmic and most organellar ribosomes. Its small size and association with ribosomal as well as non-ribosomal proteins made it an ideal model RNA molecule for studies of RNA structure and RNA-protein interactions.
The purpose of this database is to provide information on nucleotide sequences of 5S rRNAs and their genes. The sequences for particular organisms can be retrieved as single files using a taxonomic browser or in multiple sequence structural alignments.

Amino Acid Indices And Similarity Matrices

Database about physicochemical properties of amino acids.

Aminoacyl-tRNA Synthetase Database"  info

Aminoacyl-tRNA synthetases (AARSs) are the key components of the protein biosynthesis machinery. They are responsible for maintaining the fidelity of transfer of genetic information from DNA into protein. The database is a compilation of amino acid sequences of all aminoacyl-tRNA synthetases known to date.

Automatic Domain Decomposition Algorithm"  info

A Automatic Domain Decomposition Algorithm (ADDA) was used to generate a database of protein domain families with complete coverage of all protein sequences. Sequences are split into domains and domains are grouped into protein domain families in a completely automated process.
The current database contains domains for more than 1.5 million sequences in more than 40,000 domain families. In particular, there are 3828 novel domain families that do not overlap with the curated domain databases Pfam, SCOP and InterPro.

Adipocyte Proteome Database"  info

Database of in-depth analysis of the adipocyte proteome by mass spectrometry and bioinformatics.

Automatic Generated Test-Sets Database for Protein-Protein Docking

Automatic Generated Test-Sets Database for Protein-Protein Docking (AGT-SDP) is a database providing test cases for protein-protein docking.


Database about genes, transcription and proteins from human and mouse.

Antimicrobial Peptide Database

Antimicrobial peptides exist widely from bacteria to mammals. They are encoded by the genome and produced through regular processes of gene transcription. They have an effect on bacteria, fungi, viruses, and even cancer cells. They can interact directly with biological membranes without requirement for specific receptors on the membranes.

ArchType DataBase

ArchDB contains an automated classification of protein loop structure.

Gene Regulation by Mammalian micro-RNAs

MicroRNAs (miRNAs) constitute a class of small non-coding RNAs that regulate expression of target genes either by decreasing the stability of the target mRNA or by translational inhibition. They are involved in diverse processes, including cellular differentiation, proliferation and apoptosis. Recent evidence also suggests their importance for cancerogenesis. By far the most important model systems in cancer research are mammalian organisms. Argonaute compiles comprehensive information on mammalian miRNAs, their origin and regulated target genes in an exhaustive database.

ArrayExpress"  info

ArrayExpress is a public database of gene expression experiments. These experiments consist of multiple parts (or objects), in order to simplify querying we have provided a query interface which allows you to retrieve the most useful information relating to each experiment, the array(s) and the protocols(s) used.

Active Sequences Collection

ASC is a collection of active protein sequences, or protein fragments or subsequences, collected in the form of function-oriented databases.

Artificial Selected Proteins / Peptides Database"  info

ASPD is a database that incorporates data on full-length proteins, protein domains and peptides that were obtained through in vitro directed evolution processes (mainly by means of phage display).

Alternative Translational Initiation Database"  info

ATID is a database that provides a comprehensive collection of alternative translational initiation events. The purpose of this alternative translational initiation database (ATID) is to facilitate the systematic study of alternative translational initiation of genes.

Xenopus laevis Database Focusing on Gene Expression"  info

Axeldb is a database storing and integrating gene expression patterns and DNA sequences identified in a large-scale in situ hybridization study in Xenopus laevis embryos. The database was developed at the German Cancer Research Center (Deutsches Krebsforschungszentrum, DKFZ) in Heidelberg.

Benchmark Alignment Database"  info

BAliBASE is a database for the comparison of multiple sequence alignments.

Binding MOAD"  info

Binding MOAD is a subset of the Protein Data Bank (PDB), containing every high-quality example of ligand-protein binding.

BioMagResBank"  info

BMRB is a repository for data from NMR spectroscopy on proteins, peptides, and nucleic acids.

Body Fluid Database

Body fluids are one of the important targets for proteomics. Characterization of the protein composition of bodilyfluids could lead us to better understand human physiology. If a quantitative aspect is added, proteomics could contribute to the discovery of novel biomarkers.

BRAGI"  info

BRAGI is an interactive protein modeling program. It was developed for the special purpose to model unknown proteins from the structure of a known one. BRAGI enables you to view and explore the three-dimensional (3D) structure of any macromolecule. One can explore proteins, DNA, RNA, carbohydrates, and complexes, such as between transcriptional regulatory proteins and DNA, or enzymes and drugs. BRAGI includes also a set of programs and utility functions.

Conformation Angles Database"  info

Conformation Angles DataBase (CADB) provides an online resource to access data on conformation angles (both main-chain and side-chain) of protein structures in two data sets corresponding to 25% and 90% sequence identity between any two proteins, available in the Protein Data Bank.

Protein Structure Classification"  info

CATH is a novel hierarchical classification of protein domain structures, which clusters proteins at four major levels, Class(C), Architecture(A), Topology(T) and Homologous superfamily (H).

Consensus CDS (CCDS)

The Consensus CDS (CCDS) project is a collaborative effort to identify a core set of human protein coding regions that are consistently annotated and of high quality.

Conserved Domain Database (CDD)"  info

Proteins often contain several modules or domains, each with a distinct evolutionary origin and function. The CD-Search service may be used to identify the conserved domains present in a protein sequence.

3D Protein Structure Comparison and Alignment"  info

The CE-Database provides databases and tools for 3D protein structure comparison and alignment using the Combinatorial Extension (CE) Method.

CleanEx"  info

CleanEx is a database which provides access to public gene expression data via unique approved gene symbols and which represents heterogeneous expression data produced by different technologies in a way that facilitates joint analysis and cross-dataset comparisons.

Clusters of Swiss-Prot and TrEMBL proteins

The CluSTr database offers an automatic classification of UniProt Knowledgebase proteins into groups of related proteins. The clustering is based on analysis of all pairwise comparisons between protein sequences.


coliBASE is a database for comparative genome analysis of E. coli.

Bioinorganic Motif Database

COME is an attempt to classify metalloproteins and some other complex proteins using the concept of bioinorganic motif.

Chaperonin Database"  info

Type I chaperonins are molecular chaperones present in virtually all bacteria, some archaea and the plastids and mitochondria of eukaryotes. Sequences of cpn60 genes, encoding 60 kDa chaperonin protein subunits (CPN60, also known as GroEL or HSP60), are useful for phylogenetic studies and as targets for detection and identification of organisms.

Cambridge Structural Database

The world repository of small molecule crystal structures.

Cold Shock Domain Database"  info

This is an interactive database which provides detailed information on proteins containing the so-called cold shock domain (CSD).

DALI Domain Dictionary

Dali stands for distance matrix alignment. The Dali server is an automatic service for the comparison of protein structure in 3D. You send the coordinates of a query structure and receive a multiple structure alignment in return. You can submit your coordinates either by electronic mail or interactively from the World Wide Web.

Differentially Expressed Proteins in Human Cancer Database

dbDEPC (database of Differentially Expressed Proteins in Human Cancer) is designed to store and display differentially expressed proteins (DEPs) in human cancers detected by mass spectrometry (MS) approach.

Database of Protein Subcellular Localization

DBSubLoc is a database of protein subcellular localization. This database contains proteins from primary protein database SWISS-PROT and PIR. By collecting the subcellular localization annotation, these information are classified and categorized by cross referencs to taxonomies and Gene Ontology database.

Database of Copper-Chelating Proteins

The Database of Copper-Chelating Proteins (DCCP) is maintained by Shandong Provincial Research Center for Bioinformatic Engineering and Technique. DCCP currently contains two types of proteins, one with 3D structure, called DCCP_3D, which contains 448 entries of proteins, and the other without 3D structure, called DCCP_1D, which contains 5088 entries.

DExH/D Protein Family Database"  info

DExH/D proteins are essential for all aspects of cellular RNA metabolism and processing, in the replication of many viruses and in DNA replication. DExH/D proteins are subject to current biological, biochemical and biophysical research which provides a continuous wealth of data. The DExH/D protein family database compiles this information and makes it available over the WWW.

Database of Interacting Proteins"  info

The DIPTM database catalogs experimentally determined interactions between proteins. It combines information from a variety of sources to create a single, consistent set of protein-protein interactions. The data stored within the DIP database were curated, both, manually by expert curators and also automatically using computational approaches that utilize the the knowledge about the protein-protein interaction networks extracted from the most reliable, core subset of the DIP data.

Database of Protein Disorder

Database of Protein Disorder (DisProt) is a curated database that provides information about proteins that lack fixed 3D structure in their putatively native states, either in their entirety or in part.

Database of Bacterial Lipoproteins"  info

Bacterial lipoproteins are a set of proteins modified at their N-terminal with N-Acyl Diacyl Glyceryl group (derived from phospholipids), which serves to anchor these proteins to carry out important functions efficiently at the membrane-aqueous interface.

Database Of Ribosomal Crosslinks

In order to interpret the molecular basis of the translational process, it is essential to have a corresponding knowledge of the higher structure of the ribosome. A large amount of biochemical data about structure of the ribosome is available at present. Ribosomal cross-linking is one of the most important methods which allows us to get such data.

Disulphide Database"  info

DSDBASE is a database on disulphide bonds in proteins that provides information on native disulphides and those which are stereochemically possible between pairs of residues in a protein.

Database of Simulated Molecular Motions"  info

The purpose of this database is to provide an easily-searchable source of information about movies showing biomolecular motions that have been generated by computer simulation. All of the movies are available through the internet.
Molecules simulated include proteins, DNA, RNA, sugars and lipids. Simulation techniques include Molecular Dynamics, Brownian Dynamics and automated docking procedures.

Macromolecular Structure Database"  info

The E-MSD macromolecular structure relational database is designed to be a single access point for protein and nucleic acid structures and related information.

Electrostatic-Surface of Functional Site"  info

eFsite (electrostatic-surface of Functional site) is a database for molecular surfaces of proteins' functional sites, displaying the electrostatic potentials and hydrophobic properties together on the Connolly surfaces of the active sites, for analyses of the molecular recognition mechanisms.

Edinburgh Mouse Atlas Project

The Edinburgh Mouse Atlas consists of a digital atlas of mouse embryo development and spatially-mapped gene expression.

Entrez Protein"  info

The protein entries in the Entrez search and retrieval system have been compiled from a variety of sources, including SwissProt, PIR, PRF, PDB, and translations from annotated coding regions in GenBank and RefSeq.

Endocrine Pancreas Consortium Database"  info

Genexpression database of the Endocrine Pancreas Consortium.

Erythropoiesis Database"  info

The EpoDB (Erythropoiesis database) is a database of genes that relate to vertebrate red blood cells. It includes DNA sequence, structural features, protein information, gene expression information and transcription factor binding sites. The database ist provided by the Computational Biology and Informatics Laboratory at the University of Pennsylvania.

ESTerases and alpha / beta Hydrolase Enzymes and Relatives

This server is dedicated to the analysis of protein and nucleic acid sequences belonging to the superfamily of alpha/beta hydrolases homologous to cholinesterases.

Expert Protein Analysis System"  info

EXPASy (Expert Protein Analysis System) provides databases for the research on proteomics:
Swiss-Prot and TrEMBL - Protein knowledgebase
PROSITE - Protein families and domains
SWISS-2DPAGE - Two-dimensional polyacrylamide gel electrophoresis
ENZYME - Enzyme nomenclature
SWISS-3DIMAGE - 3D images of proteins and other biological macromolecules
SWISS-MODEL Repository - Automatically generated protein models
CD40Lbase - CD40 ligand defects
SeqAnalRef - Sequence analysis bibliographic references

ExPASy Proteomics Tools"  info

Tools for the analysis of the human proteom.

"  info

The EyeSite database is an information and modelling resource for families of proteins that function in the eye. Homologues are collected from all species and clustered according to tissue type, function and sequence similarity.
A principal feature of the site is structural annotations,which range from experimentally solved structures to close structural neighbours to distant structure predictions. Many pre-generated homology models are provided.

Fold classification based on Structure-Structure alignment of Proteins (FSSP)

Fold classification - database on Structure-Structure alignment of proteins.

Low-Complexity Peptides of Forming Amyloid Plaques

Database about Peptides, which are capable of forming amyloid plaques.

Functional Shift"  info

FunShift provides Functional shift analysis between the subfamilies of a Protein domain family.

Genomic Distribution of Structural Superfamilies"  info

Genomic Distribution of Structural Superfamilies identifies and classifies evolutionary related proteins at the superfamily level in whole genome databases. The database has been curated in direct correspondence with SCOP and represents 4001 highly resolved domains in 1194 structural superfamilies across protein sequence databases.

Microarray Gene Annotation (GeneAnnot)"  info

Revised and improved annotation of Affymetrix human gene probe sets.

"  info

GeneNote is a database of human genes and their expression profiles in healthy tissues. It is based on Weizmann Institute of Science DNA array experiments, which were performed on the Affymetrix HG-U95 set A-E. It offers:

  • An expression profile (tissue vector) for each gene in the human genome
  • Gene and tissue clustering based on expression profiles
  • A full genome ranking procedure according to the gene's tendency for tissue specificity, from tissue-specific to housekeeping genes.

  • GenePaint"  info

    GenePaint is a digital atlas of gene expression patterns in the mouse.


    Genevestigator is one of the world's largest quality-controlled, curated and annotated databases of mouse Affymetrix arrays, and offers unique querying possibilities for molecular biologists.

    Gene Expression Omnibus"  info

    The Gene Expression Omnibus is a high-throughput gene expression / molecular abundance data repository, as well as a curated, online resource for gene expression data browsing, query and retrieval. GEO became operational in July 2000.

    GermOnline"  info

    GermOnline is a database that provides access to microarray expression data from experiments relevant for the mitotic and meiotic cell cycle as well as gametogenesis in yeast and higher eukaryotes. In addition to that, GermOnline integrates knowledge about genes important for sexual reproduction.

    Gene Ontology Annotation (GOA)"  info

    The Gene Ontology Annotation (GOA) database aims to provide high-quality electronic and manual annotations to the UniProt Knowledgebase (Swiss-Prot, TrEMBL and PIR-PSD) using the standardized vocabulary of the Gene Ontology (GO).

    Endogenous GPCR List

    Information about what G protein coupled receptors are expressed in which cell lines.

    Information System for G Protein-Coupled Receptors (GPCRs)"  info

    Information system about G protein-coupled receptors.

    Database of G-Proteins and their Interaction with GPCRs"  info

    gpDB is a database of G-proteins and their interactions with GPCRs.

    Genomic Threading Database"  info

    The Genomic Threading Database (GTD) contains structural annotations of proteomes, translated from the genomes of key organisms.

    Genomic Threading Database"  info

    The Genomic Threading Database (GTD) contains structural annotations of proteomes, translated from the genomes of key organisms.

    Genomes To Protein Structures And Functions"  info

    GTOP is a database built by the Laboratory of Gene-product Informatics at the National Institute of Genetics consisting of data analyses of proteins identified by various genome projects. This database mainly uses sequence homology analyses and features extensive utilization of information on three-dimensional structures. This research is supported by the Japan Science and Technology Corporation and Grants-in-Aid for Scientific Research (Genomes in category C) from the Ministry of Education, Science, Sports and Culture of Japan.

    Mouse Gene Expression Database

    Gene expression database from the mouse.

    Heart and Calcium Functional Network (HCNet) Database

    The Heart and Calcium functional Network (HCNet) database is a collection of functional gene modules calculated from the microarray data compendium available from the GEO database. It is a specialized database designed to assist experimentalists for cardiac calcium signaling research by providing the pre-calculated gene clusters and their potential correlation network in heart.

    Hembase"  info

    Hembase is an integrated browser and genome portal designed for web-based examination of the human erythroid transcriptome.

    Hetero-Atoms in Protein Structures Database"  info

    Proteins in Protein DataBank contains a lot of hetero atoms (Atoms that compose non-protein molecules). This information is useful to analyze protein-ligand/protein-substrate interactions. In Het-PDB Navi, information for hetero atoms is obtained through PDB ID and hetero atom name.

    Hetero-Compound Information Centre Uppsala"  info

    HIC-Up is a server for hetero-compounds ("small molecules").

    Database of Protein Domains and Motifs"  info

    Hits is a database devoted to protein domains. It is also a collection of tools for the investigation of the relationships between protein sequences and motifs described on them. These motifs are defined by an heterogeneous collection of predictors, which currently include regular expressions, generalized profiles and hidden Markov models.

    HIV-1 / Human Protein Interaction Database

    A database about the interaction of human immunodeficiency virus type 1 (HIV-1) proteins with those of the host cell, created by the National Institute of Allergy & Infectious Diseases.

    Homeobox Page

    Informations about Homeobox proteins, classification and evolution.

    Homeodomain Resource"  info

    The Homeodomain Resource is a searchable, curated collection of information for the homeodomain protein family.


    HomoloGene is a system for automated detection of homologs among eukaryotic gene sets.

    Homologous Structure Alignment Database"  info

    HOMSTRAD (HOMologous STRucture Alignment Database) is a curated database of structure-based alignments for homologous protein families. All known protein structure are clustered into homologous families (i.e., common ancestry), and the sequences of representative members of each family are aligned on the basis of their 3D structures using the programs MNYFIT, STAMP and COMPARER.

    The Human Olfactory Receptor Data Exploratorium"  info

    Olfactory receptors (ORs) constitute the largest multigene family in multicellular organisms. Their evolutionary proliferation has been driven by the need to provide recognition capacity for millions of potential odorants with arbitrary chemical configurations. Homo sapiens will likely be the first vertebrate species in which the complete OR repertoire will be known. You will find here information on the OR proteins, their structure, function and evolution. A set of analysis tools is provided.

    Homeobox Genes DataBase"  info

    HOX-Pro is a database which arranges all available data on structure, function, phylogeny and evolution of Hox genes, Hox clusters and Hox networks.

    Human Proteomics Initiative"  info

    The Human Proteomics Initiative is an initiative to annotate all human sequences.

    Human Plasma Membrane Receptome

    The human plasma membrane receptome is a protein sequences, literature, and expression database.

    Human Protein Reference Database"  info

    The Human Protein Reference Database is a platform to visually depict and integrate information pertaining to domain architecture, post-translational modifications, interaction networks and disease association for each protein in the human proteome. All the information in HPRD has been manually extracted from the literature by expert biologists who read, interpret and analyze the published data.
    HPRD is made by PandeyLab and the Institute of Bioinformatics.

    Human Red Blood Cell Database

    hRBCD is a result of collaboration between the Department for Proteomics and Signal Transduction at the Max-Planck-Institut for Biochemistry and the Biomedical Primate Research Centre in the Netherlands.

    Homology-Derived Structures of Proteins (HSSP)"  info

    HSSP (homology-derived structures of proteins) is a derived database merging structural (2-D and 3-D) and sequence information (1-D). For each protein of known 3D structure from the Protein Data Bank, the database has a file with all sequence homologues, properly aligned to the PDB protein.

    Human Gene Expression Index (HuGE Index)"  info

    Human Gene Expression Index (HuGE Index) aims to provide a comprehensive database to aid in understanding the expression of human genes in normal human tissues.
    The mRNA expression levels of thousands of genes in a collection of normal human organs were obtained using high-density oligonucleotide array technology and deposited in this public database.

    Human Protein Atlas

    The protein atlas has been created to show the expression and localization of proteins in a large variety of normal human tissues and cancer cells. The data is presented as high resolution images representing immunohistochemically stained tissue sections. Available proteins (genes) can be reached through a specific search (by gene/protein name/id or classification, such as kinase or protease) or by browsing the individual chromosomes.

    Inter-Chain Beta-Sheets"  info

    ICBS is a a database of protein-protein interactions mediated by interchain ß-sheet formation.

    Image Library of Biological Macromolecules"  info

    The IMB Jena Image Library of Biological Macromolecules is aimed at a better dissemination of information on three-dimensional biopolymer structures with an emphasis on visualization and analysis.

    IMGT/3Dstructure Database"  info

    IMGT/3Dstructure-DB and IMGT/Structural-Query are a 3D structure database and a tool for immunological proteins. They are part of IMGT, the international ImMunoGenetics information system, a high-quality integrated knowledge resource specializing in immunoglobulins (IG), T cell receptors (TR), major histocompatibility complex (MHC) and related proteins of the immune system (RPI) of human and other vertebrate species, which consists of databases, Web resources and interactive on-line tools.

    The New England Biolabs Intein Database"  info

    Inteins (protein splicing elements) database.

    Inner Ear Protein Database"  info

    Database about the Proteom of the inner ear.

    IntAct Project"  info

    IntAct provides an open source database and toolkit for the storage, presentation and analysis of protein interactions.

    Database of Putative Protein Domain Interactions"  info

    InterDom is a database of putative interacting protein domains derived from multiple sources, ranging from domain fusions (Rosetta Stone), protein interactions (DIP and BIND), protein complexes (PDB), to scientific literature (MEDLINE).

    Interferon Stimulated Gene Database

    Database about genes induced by treatment with interferons.


    The InterPro database is an integrated documentation resource for protein families, domains and functional sites.

    International Protein Index (IPI)"  info

    The IPI (International Protein Index) provides a top level guide to the main databases that describe the human, mouse and rat proteomes: Swiss-Prot, TrEMBL, RefSeq and Ensembl. IPI is created by identifying entries from the constituent databases that represent the same protein; and using these mappings to automatically create a datasets with maximum extent but minimum redundancy. For each protein in IPI, an entry from one of the constituent databases is selected as the master entry, and supplies the IPI entry with its sequence and annotation. Stable identifiers (with incremental versioning) are maintained to allow the tracking of sequences in IPI between IPI releases.

    Integrated Protein Knowledgebase"  info

    The iProClass database provides value-added information reports for UniProtKB and unique UniParc proteins, with links to over 90 biological databases, including databases for protein families, functions and pathways, interactions, structures and structural classifications, genes and genomes, ontologies, literature, and taxonomy. iProClass combines both data warehouse and hypertext navigation methods for integrating data, providing a comprehensive picture of protein properties that may lead to novel prediction and functional inference for previously uncharacterized "hypothetical" proteins and protein groups.

    Database of Chemical Compounds and Reactions in Biological Pathways"  info

    KEGG LIGAND Database. The universe of chemical reactions involving metabolites and other biochemical compounds, as well as xenobiotic compounds.

    Kidney Development Database

    The Kidney Development Database was created to collect in one place the data from a large number of developmental studies that have a bearing on the study of kidney development.

    Lipase Engineering Database"  info

    The Lipase Engineering Database (LED) integrates information on sequence, structure, and function of lipases, esterases, and related proteins.

    Ligand-Gated Ion Channel Database"  info

    Ligand-Gated Ion Channels are transmembrane proteins that can exist under different conformations, at least one forming a pore through the membrane connecting the two neighbour compartments.
    In the LGIC Database one will find the nucleic and proteic sequences of the subunits. Multiple sequence alignments can be generated, and some phylogenetic studies of the superfamilies are provided.

    Database for Localization, Interaction, Functional assays and Expression of Proteins

    The Database for Localization, Interaction, Functional assays and Expression of Proteinsis created and maintained by Detlev Bannasch and Alexander Mehrle, DKFZ Heidelberg, Molecular Genome Analysis.

    LipOXygenases DataBase"  info

    Due to their involvement in several diseases like cancer, inflammation, fever or arthritis, a lot of research is done on lipoxygenases yielding information about sequence, structure and function of these proteins. The LipOXygenases-DataBase (LOX-DB) is an effort to combine these informations and the related literature in one database and make it available via the internet.
    This database ist maintained by the DKFZ (Deutsches Krebsforschungszentrum) in Heidelberg.

    Metalloprotein Database

    MDB contains quantitative information on all the metal-containing sites available from structures in the PDB distribution.

    MEROPS. The Protease Database.

    The MEROPS database is an information resource for peptidases (also termed proteases, proteinases and proteolytic enzymes) and the proteins that inhibit them. The Summary page describing a given peptidase can be reached by use of an index under its Name, MEROPS Identifier or source Organism. The Summary describes the classification and nomenclature of the peptidase and offers links to supplementary pages showing sequence identifiers, the structure if known, literature references and more.

    DNA Methylation Database"  info

    Database of DNA methylation.

    MOLecular INTeraction Database"  info

    Database about molecular protein interactions.

    Major Intrinsic Proteins Database"  info

    The MIPs (major intrinsic proteins) constitute a large family of membrane proteins that facilitate the passive transport of water and small neutral solutes across cell membranes. Since water is the most abundant molecule in all living organisms, the discovery of selective water-transporting channels called AQPs (aquaporins) has led to new knowledge on both the physiological and molecular mechanisms of membrane permeability. The MIPs are identified in Archaea, Bacteria and Eukaryota.

    The MIPS Mammalian Protein-Protein Interaction Database"  info

    The MIPS Mammalian Protein-Protein Interaction Database is produced by the Munich Information Center for Protein Sequences.


    MIRAGE (Molecular Informatics Resource for the Analysis of Gene Expression) is a web site dedicated to methodologies, tools, and technologies relating to gene expression information. MIRAGE is a web resource of the Institute for Transcriptional Informatics (IFTI), Pittsburgh PA 15230-2556 USA.

    Mitochondrial Proteom Database"  info

    Database about the mitochondrial proteom.

    Molecular Modeling Database"  info

    NCBI's Entrez includes a database of experimentally determined three-dimensional biomolecular structures. Most 3D structure data are obtained from X-ray crystallography and NMR spectroscopy; they provide a wealth of information on biological function, on mechanisms linked to the function, and on the evolutionary history of and relationships between macromolecules.

    Comparative Protein Structure Models Database"  info

    MODBASE is a relational database of annotated comparative protein structure models for all available protein sequences matched to at least one known protein structure.

    Database of Macromolecular Movements"  info

    The Database of Macromolecular Movements with Associated Tools for Geometric Analysis describes the motions that occur in proteins and other macromolecules, particularly using movies. Associated with it are a variety of free software tools and servers for structural analysis.

    Membrane Protein Data Bank"  info

    Database about membrane-proteins.

    Nuclear Export Signals Database"  info

    NESbase is a database of proteins in which the presence of Leucine-rich nuclear export signal (NES) has been experimentally verified.

    NetAffx Analysis Center"  info

    NetAffx details and annotates probesets on Affymetrix GeneChip microarrays. These annotations include (i) static information specific to the probeset composition; (ii) sequence annotations extracted from public databases; and (iii) protein sequence-level annotations derived from public domain programs, as well as libraries of hidden Markov models (HMMs) developed at Affymetrix.

    Nuclear Localization Signals (NLS) Database

    NLSdb is a database of nuclear localization signals (NLSs) and of nuclear proteins targeted to the nucleus by NLS motifs, produced by Rajesh Nair & Phil Carter @ Rost Group, Columbia University, New York.

    Nuclear Matrix Associated Proteins - Database"  info

    NMP-db is a database of nuclear matrix associated proteins, produced by Sven Mika @ Rost Group, Columbia University, New York.

    Nucleolar Proteome Database

    The Nucleolar Proteome Database contains all data from ongoing proteomic analysis of human nucleoli, which is carried out as a collaboration between the Lamond and Mann laboratories.

    Nuclear Protein Database"  info

    The NPD contains information on >1000 vertebrate proteins (mainly those from mouse and human) that are thought to, or known to, be localised to the cell nucleus. Where known, the sub-nuclear compartment where the proteins have been found are reported. Also stored is information on the amino acid sequence, predicted protein size and isoelectric point, as well as any repeats, motifs or domains within the protein sequence. Biological and molecular functions of the proteins are described using GO terms. Where appropriate, links to other databases are provided (e.g. Entrez, SWISS-PROT, OMIM, PubMed, PubMed Central).

    Nuclear Receptor Resource

    The Nuclear Receptor Resource (NRR) Project is a collection of individual databases on members of the steroid and thyroid hormone receptor superfamily. Although the databases are located on different servers and are managed individually, they each form a node of the NRR. The NRR itself integrates the separate databases and allows an interactive forum for the dissemination of information about the superfamily.

    Information System for Nuclear Receptors"  info

    Database about nuclear receptors.

    Nuclear Hormone Receptors Database"  info

    NUREBASE is a reference database on Nuclear Hormone Receptors.
    Nuclear hormone receptors are a very important class of ligand-activated transcription factors, found in varying numbers in all animals. The receptors play essential roles in embryonic development, maintenance of differentiated cellular phenotypes, metabolism and cell death. Malfunction or deregulation of the receptors leads to proliferative, reproductive, and metabolic diseases such as cancer, infertility, obesity and diabetes.

    O-GlycBase"  info

    O-GlycBase is a database of glycoproteins with O-linked glycosylation sites.

    Olfactory Bulb Odor Map Database"  info

    OdorMapDB is designed to be a database to support the experimental analysis of the molecular and functional organization of the olfactory bulb and its basis for the perception of smell.

    Cancer Microarray Database - Oncomine

    ONCOMINE collects all publicly available cancer microarray studies.

    Object-Oriented Transcription Factors Database"  info

    ooTFD (object-oriented Transcription Factors Database) is a successor to TFD, the original Transcription Factors Database .

    Orientations of Proteins in Membranes (OPM) Database"  info

    OPM provides hydrophobic thicknesses and orientations of membrane proteins with respect to the hydrocarbon core of the lipid bilayer.

    Olfactory Receptor Database

    Olfactory receptors (ORs) are the largest family in the genome, and the first of a widening range of chemosensory receptors (CRs) in other chemosensory organs. ORDB began as a database of vertebrate OR genes and proteins and continues to support sequencing and analysis of these receptors by providing a comprehensive archive with search tools for this expanding family. Because of the associated growth of interest in other CRs, the database has grown over the years to include a broad range of chemosensory genes and proteins, that includes in addition to ORs the taste papilla receptors (TPRs), vomeronasal organ receptors (VNRs), insect olfaction receptors (IORs), Caenorhabditis elegans chemosensory receptors (CeCRs), fungal pheromone receptors (FPRs) [these are informal abbreviations for tracking in ORDB, pending development of consensus nomenclature by the field].

    Organellar Map Database

    In the Organellar Map Database, a protein and peptide list with organellar map information is provided.

    OrthoDisease - A Database of Disease Gene Orthologs"  info

    OrthoDisease is a database of model organism genes that are orthologous to human disease genes.

    Protein Alignments Organised as Structural Superfamilies DB"  info

    PASS2 is a database of structural motifs of protein superfamilies.

    Protein Data Bank"  info

    The data afford the 3-dimensional illustration of proteins.


    The PDBbind database is developed in Dr. Shaomeng Wang's group at the University of Michigan.
    The PDBbind database is designed to provide a collection of experimentally measured binding affinity data (Kd, Ki, and IC50) exclusively for the protein-ligand complexes available in the Protein Data Bank (PDB). All of the binding affinity data compiled in the PDBbind database were collected from original references.

    Database of the Known 3D Structures of Proteins and Nucleic Acids

    PDBsum was created by Roman Laskowski of the European Bioinformatics Institute in Hinxton, U.K. This database provides synopses of structural and functional information stashed in the Protein Data Bank repository.

    Protein Databank of Transmembrane Proteins

    PDB_TM is a databank about transmembrane proteins.

    Protein-Protein Interaction Database for PDZ-Domains"  info

    PDZBase is a manually curated protein-protein interaction database developed specifically for interactions involving PDZ domains. PDZBase currently contains 339 experimentally determined protein protein interactions.

    Predictions for Entire Proteomes"  info

    PEP is a database of Predictions for Entire Proteomes, produced by the Rost Group, at Columbia University, New York.

    Peptide Conformation Database

    PepConf is a database of peptide conformations.

    Public Expression Profiling Resource (pepr)"  info

    Public expression profiling resource provides expression profiles in a variety of diseases and conditions.

    Peroxisome Database

    Peroxisomes are single-membrane subcellular organelles, present in most eukaryotic cells and organisms. The Peroxisome fulfills essential metabolic functions in lipid metabolism, both catabolic (oxidation of pipecolic, phytanic and very-long chain fatty acids) and anabolic (synthesis of plasmalogens and bile acids). Moreover, the peroxisome plays a key role in free radical detoxification, differentiation, development and morphogenesis from human to yeast. Fatal disorders are related to defective peroxisomal function or biogenesis. The aim of PEROXISOME database (PeroxisomeDB) is to gather, organise and integrate curated information on peroxisomal genes, their encoded proteins, their molecular function and metabolic pathway they belong to, and their related disorders. PeroxisomeDB contains the complete peroxisomal proteome of Homo sapiens (encoded by 85 genes) and Saccaromyces cerevisiae (encoded by 61 genes).

    Protein Families Database of Alignments and HMMs

    Pfam is a collection of protein families and domains. Pfam contains multiple protein alignments and profile-HMMs of these families. Pfam is a semi-automatic protein family database, which aims to be comprehensive as well as accurate.

    Protein Folding Database"  info

    The Protein Folding Database (PFD) is a collection of all biophysical data relating to experimental protein folding studies.
    The database contains structural, methodological, kinetic and thermodynamic data, and contains data for 52 proteins, representing 39 SCOP families, and 18 organisms.

    Database of S/T/Y Phosphorylation Sites

    The Phospho.ELM database contains experimentally verified Serine, Threonine and Tyrosine sites in eukaryotic proteins. The entries, manually annotated and based on scientific literature, provide information about the phosphorylated proteins and the exact position of known phosphorylated instances.

    PhosphoBase - A Database of Phosphorylation Sites"  info

    PhosphoBase contains information about phosphorylated residues in proteins and data about peptide phosphorylation by a variety of protein kinases.

    PIR International Protein Sequence Database

    The PIR, in collaboration with MIPS and JIPID, produces and distributes the PIR-International Protein Sequence Database (PSD), the most comprehensive and expertly annotated protein sequence database in the public domain. The primary objective for its continuing development and enhancement is to achieve the properties of: Comprehensiveness, Timeliness, Non-Redundancy, Quality Annotation, and Full Classification

    PIRSF SuperFamily Classification System"  info

    The PIR superfamily/family concept, the original classification based on sequence similarity, has been used as a guiding principle to provide non-overlapping clusters of protein sequences to reflect their evolutionary relationships.

    Protein Mutant Database

    Compliations of protein mutant data are valuable as a basis for protein engineering. They provide information on what kinds of functional and/or structural influences are brought about by amino acid mutation at a specific position of protein.
    The Protein Mutant Database (PMD) covers natural as well as artificial mutants, including random and site-directed ones, for all proteins except members of the globin and immunoglobulin families. The PMD is based on literature, not on proteins. That is, each entry in the database corresponds to one article which may describe one, several or a number of protein mutants.

    Proteome Profile Database"  info

    PPD is a WWW-based database for comparative analysis of protein lengths in completely sequenced prokaryotic and eukaryotic genomes.

    Plasma Proteome Database

    PPD is a comprehensive list of blood proteins from Johns Hopkins University in Baltimore, Maryland, and the Institute of Bioinformatics in Bangalore, India. There is information about data from the literature on the more than 7500 protein variants that enter the plasma at some time. For each version, or isoform, one can find standard genomic information such as gene and amino acid sequences.

    HUPO - Plasma Proteom Project"  info

    HUPO initiated the Plasma Proteome Project (PPP) in 2002. Its pilot phase has (1) evaluated advantages and limitations of many depletion, fractionation, and MS technology platforms; (2) compared PPP reference specimens of human serum and EDTA, heparin, and citrate-anti-coagulated plasma; and (3) created a publicly-available knowledge base (;

    Predicted and Consensus Interaction Sites in Enzymes"  info

    PRECISE (Predicted and Consensus Interaction Sites in Enzymes) is a database of interactions between the amino acid residues of an enzyme and its various ligands, i.e., substrate and transition state analogues, cofactors, inhibitors, and products.

    PredictProtein"  info

    PredictProtein is a service for sequence analysis, structure and function prediction. When submitting any protein sequence PredictProtein retrieves similar sequences in the database and predicts aspects of protein structure and function.

    Protein Research Foundation - Literature Database"  info

    The Peptide Institute, Protein Research Foundation, is collecting the information related to amino acids, peptides and proteins. All articles dealing with peptides from scientific journals accessible in Japan are collected.

    Amino Acids Sequence Database

    Japanese database about amino acids.

    PRoteomics IDEntifications Database"  info

    The PRIDE PRoteomics IDEntifications database has been developed to provide the proteomics community with a public repository for protein and peptide identifications together with the evidence supporting these identifications.

    The PRINTS Fingerprint Database"  info

    PRINTS is a compendium of protein fingerprints. A fingerprint is a group of conserved motifs used to characterise a protein family; its diagnostic power is refined by iterative scanning of a SWISS-PROT/TrEMBL composite.

    ProDom"  info

    ProDom is a comprehensive set of protein domain families automatically generated from the SWISS-PROT and TrEMBL sequence databases.

    Prokaryotic Lysis Enzymes Site

    ProLysES is a database of bacterial protease systems.

    The Prosthetic Groups and Metal Ions in Protein Active Sites Database

    Database about prosthetic groups and metal ions in protein active sites.

    Thermodynamic Database for Protein - Nucleic Acid Interactions"  info

    ProNIT contains information about sequence, structure, bibliographic information and several thermodynamic parameters such as dissociation constant, association constant, changes in Gibbs free energy, enthalpy and heat capacity, activity (Km and kcat) etc., along with experimental method and conditions.

    PROSITE Database of Protein Families and Domains

    PROSITE is a database of protein families and domains. It is based on the observation that, while there is a huge number of different proteins, most of them can be grouped, on the basis of similarities in their sequences, into a limited number of families. Proteins or protein domains belonging to a particular family generally share functional attributes and are derived from a common ancestor.

    Entrez Protein Database

    The protein entries in the Entrez search and retrieval system have been compiled from a variety of sources, including SwissProt, PIR, PRF, PDB, and translations from annotated coding regions in GenBank and RefSeq.

    Proteome Analysis Database

    The Proteome Analysis database has been set up to provide comprehensive statistical and comparative analyses of the predicted proteomes of fully sequenced organisms. The analysis is compiled using InterPro, CluSTr and GO Slim, and is performed on the non-redundant complete proteome sets of SWISS-PROT and TrEMBL entries.

    PROtein TErminUS

    ProTeus (PROtein TErminUS) is a tool for identification of short linear signatures in protein termini.

    Thermodynamic Database for Proteins and Mutants

    ProTherm is a collection of numerical data of thermodynamic parameters such as Gibbs free energy change, enthalpy change, heat capacity change, transition temperature etc. for wild type and mutant proteins, that are important for understanding the structure and stability of proteins. It also contains information about secondary structure and accessibility of wild type residues, experimental conditions (pH, temperature, buffer, ion and protein concentration), measurements and methods used for each data, and activity information (Km and Kcat ).

    ProtoMap"  info

    This site offers an exhaustive classification of all the proteins in the SWISSPROT and TrEMBL databases, into groups of related proteins.

    ProtoNet"  info

    ProtoNet provides automatic hierarchical classification of protein sequences.

    Protein Reviews On The Web

    PROW Guides are authoritative short, structured reviews on proteins and protein families.
    The Guides provide approximately 20 standardized categories of information (abstract, biochemical function, ligands, references, etc.) for each protein.

    Ribosomal Database Project

    The Ribosomal Database Project (RDP) provides ribosome related data services to the scientific community, including online data analysis, rRNA derived phylogenetic trees, and aligned and annotated rRNA sequences.

    The Restriction Enzyme Database"  info

    A collection of information about restriction enzymes and related proteins. It contains published and unpublished references, recognition and cleavage sites, isoschizomers, commercial availability, methylation sensitivity, crystal and sequence data. DNA methyltransferases, homing endonucleases, nicking enzymes, specificity subunits and control proteins are also included. Putative DNA methyltransferases and restriction enzymes, as predicted from analysis of genomic sequences, are also listed. REBASE is updated daily and is constantly expanding. REBASE was made by Nobel laureate Dr. Richard J. Roberts and Dana Macelis.

    Database of the Translational Recoding Events (recode)"  info

    The RECODE database is a compilation of translational recoding events (programmed ribosomal frameshifting, codon redefinition and translational bypass). The database provides information about the genes utilizing these events for their expression, recoding sites, stimulatory sequences and other relevant information.

    REFerence database for Gene EXpression Analysis"  info

    Database for gene expression analysis.

    Database for Protein Renaturation"  info

    Database of protocols describing the refolding and purification of recombinant proteins.

    Database for Comprehensive Analysis of Protein–Ligand Interactions

    The database behind Relibase is the Protein Data Bank (PDB). It covers all entries in the PDB which were determined experimentally by means of X-ray diffraction or NMR spectroscopy, but not theoretical structures.

    Remaining TrEMBL

    TrEMBL is a computer-annotated protein sequence database supplementing the Swiss-Prot Protein Sequence Data Bank. TrEMBL contains the translations of all coding sequences (CDS) present in the EMBL Nucleotide Sequence Database not yet integrated in Swiss-Prot. TrEMBL can be considered as a preliminary section of Swiss-Prot. For all TrEMBL entries which should finally be upgraded to the standard Swiss-Prot quality, Swiss-Prot accession numbers have been assigned. TrEMBL is split in two main sections: SpTrEMBL and RemTREMBL
    RemTrEMBL (Remaining TrEMBL) contains the entries that we do not want to include in Swiss-Prot.

  • Immunoglobulins and T-cell receptors
  • Synthetic sequences
  • Patent application sequences
  • Small fragments
  • CDS not coding for real proteins

  • RESID"  info

    The RESID Database is a comprehensive collection of annotations and structures for protein modifications including amino-terminal, carboxyl-terminal and peptide chain cross-link, pre-, co- and post-translational modifications.

    Ribosomal RNA Database

    This database compiles all complete or nearly complete SSU (small subunit) and LSU (large subunit) ribosomal RNA sequences. Sequences are provided in aligned format. The alignment takes into account the secondary structure information derived by comparative sequence analysis of thousands of sequences. Additional information such as literature references, taxonomy, secondary structure modles and nucleotide variability maps, is also available.

    Ribosomal Differentiation of Medical Microorganisms"  info

    Database about the ribosomal differentiation of medical relevant microorganisms.

    Ribonuclease P Database"  info

    The Ribonuclease P Database is a compilation of RNase P sequences, sequence alignments, secondary structures, three-dimensional models and accessory information available electronically via the World Wide Web

    Ribosomal Protein Gene Database"  info

    RPG is a database that provides detailed information about ribosomal protein (RP) genes. It contains data from humans and other organisms, including Drosophila melanogaster, Caenorhabditis elegans, Saccharo myces cerevisiae, Methanococcus jannaschii and Escherichia coli.

    Database of Receptor Tyrosine Kinase"  info

    In this database, protein sequences from eight model metazoan species are organized under the format previously used for the HOVERGEN, HOBACGEN and NUREBASE systems.

    S/MAR Transaction DataBase"  info

    S/MARt DB, the S/MAR transaction database, is a relational database covering scaffold/matrix attached regions (S/MARs) and nuclear matrix proteins that are involved in the chromosomal attachment to the nuclear scaffold.

    Sarcomere Protein Gene Mutation Database

    Mutations in sarcomere protein genes are known to cause hypertrophic cardiomyopathy, dilated cardiomyopathy, and other diseases.

    Sbase"  info

    SBASE is a collection of protein domain sequences collected from the literature, from protein sequence databases and from genomic databases (Vlahovicek et al, 2002). The protein domains are defined by their sequence boundaries given by the publishing authors or in one of the primary sequence databases (Swiss-Prot, PIR, TREMBL etc.). Domain groups are included if they have well defined sequence boundaries, and if they can be distinguished from other sequences using a similarity search technique.

    Structural Classification of Proteins"  info

    The Structural Classification of Proteins (SCOP) database is a comprehensive ordering of all proteins of known structure, according to their evolutionary and structural relationships.

    SCOPe: Structural Classification of Proteins — extended"  info

    SCOPe is a database developed at the Berkeley Lab and UC Berkeley that extends SCOP (version 1). SCOPe classifies many structures released since SCOP 1.75 through a combination of automation and manual curation, and corrects some errors, aiming to have the same accuracy as the fully hand-curated SCOP releases. SCOPe also incorporates and updates the Astral database.

    Database of Protein Catalytic Domains"  info

    Database of protein catalytic domains.

    Structural Characterization Of Water, Ligands and Proteins"  info

    SCOWLP (Structural Characterization Of Water, Ligands and Proteins) is a database for detailed characterization and visualization of the PDB protein interfaces. The SCOWLP database includes proteins, peptidic-ligands and interface water molecules as descriptors of protein interfaces.

    Structural Database of Allergenic Proteins"  info

    SDAP is a Web server that integrates a database of allergenic proteins with various bioinformatics tools for performing structural studies related to allergens and characterization of their epitopes.

    7 Transmembrane Helix Receptor Database (7-TMR)

    SEVENS is a database including 7-TMR (7 transmembrane helix receptor) candidates predicted from whole human genome sequences, using sequence search, motif and domain assignment, transmembrane helix prediction and the gene quality refinement.

    Simple Modular Architecture Research Tool"  info

    SMART (Simple Modular Architecture Research Tool) is a web tool for the identification and annotation of protein domains, and provides a platform for the comparative study of complex domain architectures in genes and proteins.

    Stanford Microarray Database"  info

    SMD stores raw and normalized data from microarray experiments, as well as their corresponding image files. In addition, SMD provides interfaces for data retrieval, analysis and visualization.

    Stanley Medical Research Institute Online Genomics Database"  info

    The Stanley Medical Research Institute online genomics database (SMRIDB) is a web-based system for understanding the genetic effects of human brain disease (i.e. bipolar, schizophrenia, and depression). This database contains fully annotated clinical metadata and gene expression patterns.

    Secreted Protein Database

    Database about proteins which are secreted outside of the cell membrane.

    Signal Peptide Database"  info

    SPdb is a signal peptide database containing signal/leader sequences of archaea, prokaryotes and eukaryotes.

    Signal Recognition Particle Database"  info

    SRPDB (Signal Recognition Particle Database) provides aligned, annotated and phylogenetically Ordered sequences related to structure and function of SRP.

    Structure Superposition Database (SSD)"  info

    The SSD has been developed to address the need for resources and tools for understanding large sets of superpositions in order to understand evolutionary relationships and to make predictions of function.

    Super LIgands"  info

    Super Ligands contains small molecule structures occurring as ligands in the Protein Data Bank.

    HMM Library and Genome Assignments Server"  info

    The SUPERFAMILY database provides structural assignments to protein sequences and a framework for analysis of the results.

    SUrface Residues and Functions Annotated, Compared and Evaluated"  info

    The SURFACE database is a repository of annotated and compared protein surface regions.


    SPTR (accessible as SWall on the EBI SRS server) is a comprehensive protein sequence database that combines the high quality of annotation in Swiss-Prot with the completeness of the weekly updated translation of all protein coding sequences from the EMBL nucleotide sequence database.

    Sweden Peptide Database"  info

    SwePep is an endogenous peptide database developed by the Laboratory for Biological and Medical Mass Spectrometry at Uppsala University to aid mass spectrometric identifications.

    Swiss Prot Protein Knowledgebase

    Swiss-Prot is a curated protein sequence database which strives to provide a high level of annotation (such as the description of the function of a protein, its domains structure, post-translational modifications, variants, etc.), a minimal level of redundancy and high level of integration with other databases.

    Two-dimensional Polyacrylamide Gel Electrophoresis Database

    SWISS-2DPAGE contains data on proteins identified on various 2-D PAGE and SDS-PAGE reference maps.

    SWISS-MODEL Repository"  info

    The SWISS-MODEL Repository is a database of annotated three-dimensional comparative protein structure models generated by the fully automated homology-modelling pipeline SWISS-MODEL. The repository is developed at the Biozentrum Basel within the Swiss Institute of Bioinformatics.

    SYFPEITHI"  info

    This Database contains information on:

  • Peptide sequences
  • anchor positions
  • MHC specificity
  • source proteins, source organisms
  • publication references

  • SYSTEmatic Re-Searching (systers)"  info

    SYSTERS is a collection of graph-based algorithms to hierarchically partition a large set of protein sequences into homologous families and superfamilies. The methods unified now under the name SYSTERS (short for SYSTEmatic Re-Searching) are based on an all-against-all database search (using Smith-Waterman comparisons on a GeneMatcher machine).

    Target Search for Structural Genomics"  info

    TargetDB is a target registration database funded by the NIH that was developed to provide registration and tracking information for the NIH P50 structural genomics centers.
    The scope of TargetDB is to provide timely status and tracking information on the progress of the production and solution of structures.

    Transcription Element Listening System (TELiS)"  info

    TELiS combines sequence-based analysis of gene regulatory regions with statistical prevalence analyses to identify transcription-factor binding motifs (TFBMs) that are over-represented among the promoters of up- or down-regulated genes. TELiS thus provides a simple, rapid and sensitive tool for identifying transcription control pathways mediating observed gene expression dynamics.

    TIGRFAMs"  info

    TIGRFAMs are protein families based on Hidden Markov Models or HMMs. Use this page to see the curated seed alignmet for each TIGRFAM, the full alignment of all family members and the cutoff scores for inclusion in each of the TIGRFAMs.

    Tissue-Specific Promoter Database

    TiProD is a database of human promoter sequences for which some functional features are known. It allows a user to query individual promoters and the expression pattern they mediate, gene expression signatures of individual tissues, and to retrieve sets of promoters according to their tissue-specific activity or according to individual Gene Ontology terms the corresponding genes are assigned to.

    Gene Expression in Tooth

    Database about the geneexpression in tooth.

    Topology of Protein Structures Database"  info

    The TOPS database holds topological descriptions of protein structures.

    UniProt"  info

    UniProt (Universal Protein Resource) is the world's most comprehensive catalog of information on proteins. It is a central repository of protein sequence and function created by joining the information contained in Swiss-Prot, TrEMBL, and PIR.
    UniProt is comprised of three components, each optimized for different uses. The UniProt Knowledgebase (UniProt) is the central access point for extensive curated protein information, including function, classification, and cross-reference. The UniProt Non-redundant Reference (UniRef) databases combine closely related sequences into a single record to speed searches. The UniProt Archive (UniParc) is a comprehensive repository, reflecting the history of all protein sequences.

    Virus Database at University College London"  info

    VIDA is a virus database that organizes open reading frames (ORFs) from partial and complete genomic sequences from animal viruses.

    Virus Particle Explorer (viper)"  info

    VIPER is a Website for virus capsid structures and their computational analyses.

    Voltage-gated Potassium Channel Database"  info

    Voltage-gated potassium channel database (VKCDB) is designed to serve as a resource for research on voltage-gated potassium channels. Protein sequences, references, functional data and many other relevant data are included in this database.

    WebQTL Project"  info

    WebQTL stors data about gene expression, neuropharmacological,anatomical, and behavioral data for over 30 strains of mice.

    The Wnt gene Homepage

    Wnt proteins form a family of highly conserved secreted signaling molecules that regulate cell-to-cell interactions during embryogenesis. Wnt genes and Wnt signaling are also implicated in cancer. Insights into the mechanisms of Wnt action have emerged from several systems: genetics in Drosophila and Caenorhabditis elegans; biochemistry in cell culture and ectopic gene expression in Xenopus embryos. Many Wnt genes in the mouse have been mutated, leading to very specific developmental defects.

    YEAST Protein Complex Database"  info

    This website contains the information on protein-protein interactions and protein complex worked out by a group of scientists at Cellzome AG and the European Molecular Biology Laboratory in Heidelberg, Germany.