LINKS

UTA logo

Michael Y. Galperin
The Molecular Biology Database Collection: 2005 update
Nucl. Acids Res. 2005 33: D5-D24; doi:10.1093/nar/gki139
1. Nucleotide Sequence Databases
1.1. International Nucleotide Sequence Database Collaboration
  DDBJ—DNA Data Bank of Japan All known nucleotide and protein sequences http://www.ddbj.nig.ac.jp
  EMBL Nucleotide Sequence Database All known nucleotide and protein sequences http://www.ebi.ac.uk/embl.html
  GenBank® All known nucleotide and protein sequences http://www.ncbi.nlm.nih.gov/Entrez
1.2. DNA sequences: genes, motifs and regulatory sites
1.2.1. Coding and coding DNA
  ACLAME A classification of genetic mobile elements http://aclame.ulb.ac.be/
  CUTG Codon usage tabulated from GenBank http://www.kazusa.or.jp/codon/
  Genetic Codes Genetic codes in various organisms and organelles http://www.ncbi.nlm.nih.gov/Taxonomy/Utils/wprintgc.cgi
  Entrez Gene Gene-centered information at NCBI http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=gene
  HERVd Human endogenous retrovirus database http://herv.img.cas.cz
  Hoppsigen Human and mouse homologous processed pseudogenes http://pbil.univ-lyon1.fr/databases/hoppsigen.html
  Imprinted Gene Catalogue Imprinted genes and parent-of-origin effects in animals http://www.otago.ac.nz/IGC
  Islander Pathogenicity islands and prophages in bacterial genomes http://www.indiana.edu/~islander
  MICdb Prokaryotic microsatellites http://www.cdfd.org.in/micas
  NPRD Nucleosome positioning region database http://srs6.bionet.nsc.ru/srs6/
  STRBase Short tandem DNA repeats database http://www.cstl.nist.gov/div831/strbase/
  TIGR Gene Indices Organism-specific databases of EST and gene sequences http://www.tigr.org/tdb/tgi.shtml
  Transterm Codon usage, start and stop signals http://uther.otago.ac.nz/Transterm.html
  UniGene Non-redundant set of eukaryotic gene-oriented clusters http://www.ncbi.nlm.nih.gov/UniGene/
  UniVec Vector sequences, adapters, linkers and primers used in DNA cloning, can be used to check for vector contamination http://www.ncbi.nlm.nih.gov/VecScreen/UniVec.html
  VectorDB Characterization and classification of nucleic acid vectors http://genome-www2.stanford.edu/vectordb/
  Xpro Eukaryotic protein-encoding DNA sequences, both intron-containing and intron-less genes http://origin.bic.nus.edu.sg/xpro/
1.2.2. Gene structure, introns and exons, splice sites
  ASAP Alternative spliced isoforms http://www.bioinformatics.ucla.edu/ASAP
  ASD Alternative splicing database at EBI, includes three databases AltSplice, AltExtron and AEdb http://www.ebi.ac.uk/asd
  ASDB Alternative splicing database: protein products and expression patterns of alternatively spliced genes http://hazelton.lbl.gov/~teplitski/alt
  ASHESdb Alternatively spliced human genes by exon skipping database http://sege.ntu.edu.sg/wester/ashes/
  EASED Extended alternatively spliced EST database http://eased.bioinf.mdc-berlin.de/
  ECgene Genome annotation for alternative splicing http://genome.ewha.ac.kr/ECgene/
  EDAS EST-derived alternative splicing database http://www.ig-msk.ru:8005/EDAS/
  ExInt Exon–intron structure of eukaryotic genes http://sege.ntu.edu.sg/wester/exint/
  HS3D Homo sapiens splice sites dataset http://www.sci.unisannio.it/docenti/rampone/
  Intronerator Alternative splicing in C.elegans and C.briggsae http://www.cse.ucsc.edu/~kent/intronerator/
  SpliceDB Canonical and non-canonical mammalian splice sites http://www.softberry.com/berry.phtml?topic=splicedb&group=data&subgroup=spldb
  SpliceInfo Modes of alternative splicing in human genome http://140.115.50.96/SpliceInfo/
  SpliceNest A tool for visualizing splicing of genes from EST data http://splicenest.molgen.mpg.de/
1.2.3. Transcriptional regulator sites and transcription factors
  ACTIVITY Functional DNA/RNA site activity http://wwwmgs.bionet.nsc.ru/mgs/systems/activity/
  DBTBS Bacillus subtilis promoters and transcription factors http://dbtbs.hgc.jp/
  DoOP Database of orthologous promoters: chordates and plants http://doop.abc.hu/
  DPInteract Binding sites for E.coli DNA-binding proteins http://arep.med.harvard.edu/dpinteract
  EPD Eukaryotic promoter database http://www.epd.isb-sib.ch
  HemoPDB Hematopoietic promoter database: transcriptional regulation in hematopoiesis http://bioinformatics.med.ohio-state.edu/HemoPDB
  JASPAR PSSMs for transcription factor DNA-binding sites http://jaspar.cgb.ki.se
  MAPPER Putative transcription factor binding sites in various genomes http://bio.chip.org/mapper
  PLACE Plant cis-acting regulatory DNA elements http://www.dna.affrc.go.jp/htdocs/PLACE
  PlantCARE Plant promoters and cis-acting regulatory elements http://intra.psb.ugent.be:8080/PlantCARE/
  PlantProm Plant promoter sequences for RNA polymerase II http://mendel.cs.rhul.ac.uk/
  PRODORIC Prokaryotic database of gene regulation networks http://prodoric.tu-bs.de/
  PromEC E.coli promoters with experimentally identified transcriptional start sites http://bioinfo.md.huji.ac.il/marg/promec
  SELEX_DB DNA and RNA binding sites for various proteins, found by systematic evolution of ligands by exponential enrichment http://wwwmgs.bionet.nsc.ru/mgs/systems/selex/
  TESS Transcription element search system http://www.cbil.upenn.edu/tess
  TRACTOR db Transcription factors in gamma-proteobacteria database http://www.tractor.lncc.br/
  TRANSCompel Composite regulatory elements affecting gene transcription in eukaryotes http://www.gene-regulation.com/pub/databases.html#transcompel
  TRANSFAC Transcription factors and binding sites http://transfac.gbf.de/TRANSFAC/index.html
  TRED Transcriptional regulatory element database http://rulai.cshl.edu/tred
  TRRD Transcription regulatory regions of eukaryotic genes http://www.bionet.nsc.ru/trrd/
2. RNA sequence databases
  16S and 23S rRNA Mutation Database 16S and 23S ribosomal RNA mutations http://www.fandm.edu/Departments/Biology/Databases/RNA.html
  5S rRNA Database 5S rRNA sequences http://biobases.ibch.poznan.pl/5SData/
  Aptamer database Small RNA/DNA molecules binding nucleic acids, proteins http://aptamer.icmb.utexas.edu/
  ARED AU-rich element-containing mRNA database http://rc.kfshrc.edu.sa/ared
  Mobile group II introns A database of group II introns, self-splicing catalytic RNAs http://www.fp.ucalgary.ca/group2introns/
  European rRNA database All complete or nearly complete rRNA sequences http://www.psb.ugent.be/rRNA/
  GtRDB Genomic tRNA database http://rna.wustl.edu/GtRDB
  Guide RNA Database RNA editing in various kinetoplastid species http://biosun.bio.tu-darmstadt.de/goringer/gRNA/gRNA.html
  HIV Sequence Database HIV RNA sequences http://hiv-web.lanl.gov/
  HuSiDa Human siRNA database http://itb1.biologie.hu-berlin.de/~nebulus/sirna/
  HyPaLib Hybrid pattern library: structural elements in classes of RNA http://bibiserv.techfak.uni-bielefeld.de/HyPa/
  IRESdb Internal ribosome entry site database http://ifr31w3.toulouse.inserm.fr/IRESdatabase/
  microRNA Registry Database of microRNAs (small non-coding RNAs) http://www.sanger.ac.uk/Software/Rfam/mirna/
  NCIR Non-canonical interactions in RNA structures http://prion.bchs.uh.edu/bp_type/
  ncRNAs Database Non-coding RNAs with regulatory functions http://biobases.ibch.poznan.pl/ncRNA/
  NONCODE A database of non-coding RNAs http://www.bioinfo.org.cn/NONCODE/index.htm
  PLANTncRNAs Plant non-coding RNAs http://www.prl.msu.edu/PLANTncRNAs
  Plant snoRNA DB snoRNA genes in plant species http://bioinf.scri.sari.ac.uk/cgi-bin/plant_snorna/home
  PolyA_DB A database of mammalian mRNA polyadenylation http://polya.umdnj.edu/polyadb/
  PseudoBase Database of RNA pseudoknots http://wwwbio.leidenuniv.nl/~Batenburg/PKB.html
  Rfam Non-coding RNA families http://www.sanger.ac.uk/Software/Rfam/
  RISSC Ribosomal internal spacer sequence collection http://ulises.umh.es/RISSC
  RNAdb Mammalian non-coding RNA database http://ncrna.bioinformatics.com.au/
  RNA Modification Database Naturally modified nucleosides in RNA http://medlib.med.utah.edu/RNAmods/
  RRNDB rRNA operon numbers in various prokaryotes http://rrndb.cme.msu.edu/
  siRNAdb siRNA database and search engine http://sirna.cgb.ki.se/
  Small RNA Database Small RNAs from prokaryotes and eukaryotes http://mbcr.bcm.tmc.edu/smallRNA
  SRPDB Signal recognition particle database http://psyche.uthct.edu/dbs/SRPDB/SRPDB.html
  SSU rRNA Modification Database Modified nucleosides in small subunit rRNA http://medstat.med.utah.edu/SSUmods/
  Subviral RNA Database Viroids and viroid-like RNAs http://subviral.med.uottawa.ca/
  tmRNA Website tmRNA sequences and alignments http://www.indiana.edu/~tmrna
  tmRDB tmRNA database http://psyche.uthct.edu/dbs/tmRDB/tmRDB.html
  tRNA sequences tRNA viewer and sequence editor http://www.uni-bayreuth.de/departments/biochemie/trna/
  UTRdb/UTRsite 5'- and 3'-UTRs of eukaryotic mRNAs http://bighost.area.ba.cnr.it/srs6/
3. Protein sequence databases
3.1. General sequence databases
  EXProt Sequences of proteins with experimentally verified function http://www.cmbi.kun.nl/EXProt/
  NCBI Protein database All protein sequences: translated from GenBank and imported from other protein databases http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein
  PA-GOSUB Protein sequences from model organisms, GO assignment and subcellular localization http://www.cs.ualberta.ca/~bioinfo/PA/ProteomeIndex.html
  PIR-PSD Protein information resource protein sequence database, has been merged into the UniProt knowledgebase http://pir.georgetown.edu/
  PIR-NREF PIR's non-redundant reference protein database http://pir.georgetown.edu/pirwww/pirnref.shtml
  PRF Protein research foundation database of peptides: sequences, literature and unnatural amino acids http://www.prf.or.jp/en
  Swiss-Prot Now UniProt/Swiss-Prot: expertly curated protein sequence database, section of the UniProt knowledgebase http://www.expasy.org/sprot
  TrEMBL Now UniProt/TrEMBL: computer-annotated translations of EMBL nucleotide sequence entries: section of the UniProt knowledgebase http://www.expasy.org/sprot
  UniParc UniProt archive: a repository of all protein sequences, consisting only of unique identifiers and sequence http://www.uniprot.org/database/archive.shtml
  UniProt Universal protein knowledgebase: merged data from Swiss-Prot, TrEMBL and PIR protein sequence databases http://www.uniprot.org/
  UniRef UniProt non-redundant reference database: clustered sets of related sequences (including splice variants and isoforms) http://www.uniprot.org/database/nref.shtml
3.2. Protein properties
  AAindex Physicochemical properties of amino acids http://www.genome.ad.jp/aaindex/
  ProNIT Thermodynamic data on protein–nucleic acid interactions http://gibk26.bse.kyutech.ac.jp/jouhou/pronit/pronit.html
  ProTherm Thermodynamic data for wild-type and mutant proteins http://gibk26.bse.kyutech.ac.jp/jouhou/protherm/protherm.html
  TECRdb Thermodynamics of enzyme-catalyzed reactions http://xpdb.nist.gov/enzyme_thermodynamics/
3.3. Protein localization and targeting
  DBSubLoc Database of protein subcellular localization http://www.bioinfo.tsinghua.edu.cn/dbsubloc.html
  NESbase Nuclear export signals database http://www.cbs.dtu.dk/databases/NESbase
  NLSdb Nuclear localization signals http://cubic.bioc.columbia.edu/db/NLSdb/
  NMPdb Nuclear matrix associated proteins database http://www.rostlab.org/db/NMPdb/
  NOPdb Nucleolar proteome database http://www.lamondlab.com/NOPdb/
  PSORTdb Protein subcellular localization in bacteria http://db.psort.org/
  SPD Secreted protein database http://spd.cbi.pku.edu.cn
  THGS Transmembrane helices in genome sequences http://pranag.physics.iisc.ernet.in/thgs/
  TMPDB Experimentally characterized transmembrane topologies http://bioinfo.si.hirosaki-u.ac.jp/~TMPDB/
3.4. Protein sequence motifs and active sites
  ASC Active sequence collection: biologically active peptides http://bioinformatica.isa.cnr.it/ASC/
  Blocks Alignments of conserved regions in protein families http://blocks.fhcrc.org/
  CSA Catalytic site atlas: active sites and catalytic residues in enzymes of known 3D structure http://www.ebi.ac.uk/thornton-srv/databases/CSA/
  COMe Co-ordination of metals etc.: classification of bioinorganic proteins (metalloproteins and some other complex proteins) http://www.ebi.ac.uk/come
  CopS Comprehensive peptide signature database http://203.195.151.46/copsv2/index.html
  eBLOCKS Highly conserved protein sequence blocks http://fold.stanford.edu/eblocks/acsearch.html
  eMOTIF Protein sequence motif determination and searches http://motif.stanford.edu/emotif
  Metalloprotein Site Database Metal-binding sites in metalloproteins http://metallo.scripps.edu/
  O-GlycBase O- and C-linked glycosylation sites in proteins http://www.cbs.dtu.dk/databases/OGLYCBASE/
  PDBSite 3D structure of protein functional sites http://srs6.bionet.nsc.ru/srs6/
  Phospho.ELM S/T/Y protein phosphorylation sites (formerly PhosphoBase) http://phospho.elm.eu.org/
  PROMISE Prosthetic centers and metal ions in protein active sites http://metallo.scripps.edu/PROMISE
  PROSITE Biologically significant protein patterns and profiles http://www.expasy.org/prosite
  ProTeus Signature sequences at the protein N- and C-termini http://www.proteus.cs.huji.ac.il/
3.5. Protein domain databases; protein classification
  ADDA A database of protein domain classification http://ekhidna.biocenter.helsinki.fi:8080/examples/servlets/adda/
  CDD Conserved domain database, includes protein domains from Pfam, SMART, COG and KOG databases http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml
  CluSTr Clusters of Swiss-Prot+TrEMBL proteins http://www.ebi.ac.uk/clustr
  FunShift Functional divergence between the subfamilies of a protein domain family http://funshift.cgb.ki.se/
  Hits A database of protein domains and motifs http://hits.isb-sib.ch/
  InterPro Integrated resource of protein families, domains and functional sites http://www.ebi.ac.uk/interpro
  iProClass Integrated protein classification database http://pir.georgetown.edu/iproclass/
  PIRSF Family/superfamily classification of whole proteins http://pir.georgetown.edu/pirsf/
  PRINTS Hierarchical gene family fingerprints http://www.bioinf.man.ac.uk/dbbrowser/PRINTS/
  Pfam Protein families: multiple sequence alignments and profile hidden Markov models of protein domains http://www.sanger.ac.uk/Software/Pfam/
  PRECISE Predicted and consensus interaction sites in enzymes http://precise.bu.edu/precisedb/
  ProDom Protein domain families http://www.toulouse.inra.fr/prodom.html
  ProtoMap Hierarchical classification of Swiss-Prot proteins http://protomap.cornell.edu/
  ProtoNet Hierarchical clustering of Swiss-Prot proteins http://www.protonet.cs.huji.ac.il/
  S4 Structure-based sequence alignments of SCOP superfamilies http://compbio.mds.qmw.ac.uk/~james/S4.shtml
  SBASE Protein domain sequences and tools http://www.icgeb.org/sbase
  SMART Simple modular architecture research tool: signalling, extracellular and chromatin-associated protein domains http://smart.embl-heidelberg.de/
  SUPFAM Grouping of sequence families into superfamilies http://pauling.mbu.iisc.ernet.in/~supfam
  SYSTERS Systematic re-searching and clustering of proteins http://systers.molgen.mpg.de/
  TIGRFAMs TIGR protein families adapted for functional annotation http://www.tigr.org/TIGRFAMs
3.6. Databases of individual protein families
  AARSDB Aminoacyl-tRNA synthetase database http://rose.man.poznan.pl/aars/index.html
  ASPD Artificial selected proteins/peptides database http://wwwmgs.bionet.nsc.ru/mgs/gnw/aspd/
  BacTregulators Transcriptional regulators of AraC and TetR families http://www.bactregulators.org/
  CSDBase Cold shock domain-containing proteins http://www.chemie.uni-marburg.de/~csdbase/
  CuticleDB Structural proteins of Arthropod cuticle http://bioinformatics.biol.uoa.gr/cuticleDB
  DCCP Database of copper-chelating proteins http://sdbi.sdut.edu.cn/DCCP/en/index.php
  DExH/D Family Database DEAD-box, DEAH-box and DExH-box proteins http://www.helicase.net/dexhd/dbhome.htm
  Endogenous GPCR List G protein-coupled receptors; expression in cell lines http://www.tumor-gene.org/GPCR/gpcr.html
  ESTHER Esterases and other alpha/beta hydrolase enzymes http://www.ensam.inra.fr/esther
  EyeSite Families of proteins functioning in the eye http://eyesite.cryst.bbk.ac.uk/
  GPCRDB G protein-coupled receptors database http://www.gpcr.org/7tm/
  gpDB G-proteins and their interaction with GPCRs http://bioinformatics.biol.uoa.gr/gpDB
  Histone Database Histone fold sequences and structures http://research.nhgri.nih.gov/histones/
  Homeobox Page Homeobox proteins, classification and evolution http://www.biosci.ki.se/groups/tbu/homeo.html
  Hox-Pro Homeobox genes database http://www.iephb.nw.ru/labs/lab38/spirov/hox_pro/hox-pro00.html
  Homeodomain Resource Homeodomain sequences, structures and related genetic and genomic information http://research.nhgri.nih.gov/homeodomain/
  HORDE Human olfactory receptor data exploratorium http://bioinfo.weizmann.ac.il/HORDE/
  InBase Inteins (protein splicing elements) database: properties, sequences, bibliography http://www.neb.com/neb/inteins.html
  KinG—Kinases in Genomes S/T/Y-specific protein kinases encoded in complete genomes http://hodgkin.mbu.iisc.ernet.in/~king
  Knottins Database of knottins—small proteins with an unusual ‘disulfide through disulfide’ knot http://knottin.cbs.cnrs.fr
  LGICdb Ligand-gated ion channel subunit sequences database http://www.pasteur.fr/recherche/banques/LGIC/LGIC.html
  Lipase Engineering Database Sequence, structure and function of lipases and esterases http://www.led.uni-stuttgart.de/
  LOX-DB Mammalian, invertebrate, plant and fungal lipoxygenases http://www.dkfz-heidelberg.de/spec/lox-db/
  MEROPS Database of proteolytic enzymes (peptidases) http://merops.sanger.ac.uk/
  NPD Nuclear protein database http://npd.hgu.mrc.ac.uk/
  NucleaRDB Nuclear receptor superfamily http://www.receptors.org/NR/
  Nuclear Receptor Resource Nuclear receptor superfamily http://nrr.georgetown.edu/NRR/nrrhome.htm
  NUREBASE Nuclear hormone receptors database http://www.ens-lyon.fr/LBMC/laudet/nurebase.html
  Olfactory Receptor Database Sequences for olfactory receptor-like molecules http://senselab.med.yale.edu/senselab/ordb/
  ooTFD Object-oriented transcription factors database http://www.ifti.org/ootfd
  PKR Protein kinase resource: sequences, enzymology, genetics and molecular and structural properties http://pkr.sdsc.edu/html/index.shtml
  PLPMDB Pyridoxal-5'-phosphate dependent enzymes mutations http://www.studiofmp.com/plpmdb/
  ProLysED A database of bacterial protease systems http://genome.ukm.my/prolyses/
  Prolysis Proteases and natural and synthetic protease inhibitors http://delphi.phys.univ-tours.fr/Prolysis/
  REBASE Restriction enzymes and associated methylases http://rebase.neb.com/rebase/rebase.html
  Ribonuclease P Database RNase P sequences, alignments and structures http://www.mbio.ncsu.edu/RNaseP/home.html
  RPG Ribosomal protein gene database http://ribosome.miyazaki-med.ac.jp/
  RTKdb Receptor tyrosine kinase sequences http://pbil.univ-lyon1.fr/RTKdb/
  S/MARt dB Nuclear scaffold/matrix attached regions http://smartdb.bioinf.med.uni-goettingen.de/
  Scorpion Database of scorpion toxins http://research.i2r.a-star.edu.sg:8080/scorpion/
  SDAP Structural database of allergenic proteins and food allergens http://fermi.utmb.edu/SDAP
  SENTRA Sensory signal transduction proteins http://www-wit.mcs.anl.gov/sentra/
  SEVENS 7-transmembrane helix receptors (G-protein-coupled) http://sevens.cbrc.jp/
  SRPDB Proteins of the signal recognition particles http://bio.lundberg.gu.se/dbs/SRPDB/SRPDB.html
  TrSDB Transcription factor database http://ibb.uab.es/trsdb
  VKCDB Voltage-gated potassium channel database http://vkcdb.biology.ualberta.ca/
  Wnt Database Wnt proteins and phenotypes http://www.stanford.edu/~rnusse/wntwindow.html
4. Structure Databases
4.1. Small molecules
  ChEBI Chemical entities of biological interest http://www.ebi.ac.uk/chebi/
  CSD Cambridge structural database: crystal structure information for organic and metal-organic compounds http://www.ccdc.cam.ac.uk/prods/csd/csd.html
  HIC-Up Hetero-compound Information Centre—Uppsala http://xray.bmc.uu.se/hicup
  AANT Amino acid–nucleotide interaction database http://aant.icmb.utexas.edu/
  Klotho Collection and categorization of biological compounds http://www.biocheminfo.org/klotho
  LIGAND Chemical compounds and reactions in biological pathways http://www.genome.ad.jp/ligand/
  PDB-Ligand 3D structures of small molecules bound to proteins and nucleic acids http://www.idrtech.com/PDB-Ligand/
  PubChem Structures and biological activities of small organic molecules http://pubchem.ncbi.nlm.nih.gov/
4.2. Carbohydrates
  CCSD Complex carbohydrate structure database (CarbBank) http://bssv01.lancs.ac.uk/gig/pages/gag/carbbank.htm
  CSS Carbohydrate structure suite: carbohydrate 3D structures derived from the PDB http://www.dkfz.de/spec/css/
  Glycan Carbohydrate database, part of the KEGG system http://glycan.genome.ad.jp/
  GlycoSuiteDB N- and O-linked glycan structures and biological sources http://www.glycosuite.com/
  Monosaccharide Browser Space-filling Fischer projections of monosaccharides http://www.jonmaber.demon.co.uk/monosaccharide
  SWEET-DB Annotated carbohydrate structure and substance information http://www.dkfz-heidelberg.de/spec2/sweetdb/
4.3. Nucleic acid structure
  NDB Nucleic acid-containing structures http://ndbserver.rutgers.edu/
  NTDB Thermodynamic data for nucleic acids http://ntdb.chem.cuhk.edu.hk/
  RNABase RNA-containing structures from PDB and NDB http://www.rnabase.org/
  SCOR Structural classification of RNA: RNA motifs by structure, function and tertiary interactions http://scor.lbl.gov/
4.4. Protein structure
  ArchDB Automated classification of protein loop structures http://gurion.imim.es/archdb
  ASTRAL Sequences of domains of known structure, selected subsets and sequence–structure correspondences http://astral.stanford.edu/
  BAliBASE A database for comparison of multiple sequence alignments http://www-igbmc.u-strasbg.fr/BioInfo/BAliBASE2/index.html
  BioMagResBank NMR spectroscopic data for proteins and nucleic acids http://www.bmrb.wisc.edu/
  CADB Conformational angles in proteins database http://cluster.physics.iisc.ernet.in/cadb/
  CATH Protein domain structures database http://www.biochem.ucl.ac.uk/bsm/cath_new
  CE 3D protein structure alignments http://cl.sdsc.edu/ce.html
  CKAAPs DB Structurally similar proteins with dissimilar sequences http://ckaap.sdsc.edu/
  Dali Protein fold classification using the Dali search engine http://www.bioinfo.biocenter.helsinki.fi:8080/dali/
  Decoys ‘R’ Us Computer-generated protein conformations http://dd.stanford.edu/
  DisProt Database of Protein Disorder: proteins that lack fixed 3D structure in their native states http://divac.ist.temple.edu/disprot
  DomIns Domain insertions in known protein structures http://stash.mrc-lmb.cam.ac.uk/DomIns
  DSDBASE Native and modeled disulfide bonds in proteins http://www.ncbs.res.in/~faculty/mini/dsdbase/dsdbase.html
  DSMM Database of simulated molecular motions http://projects.villa-bosch.de/dbase/dsmm/
  eF-site Electrostatic surface of Functional site: electrostatic potentials and hydrophobic properties of the active sites http://ef-site.protein.osaka-u.ac.jp/eF-site
  GenDiS Genomic distribution of protein structural superfamilies http://caps.ncbs.res.in/gendis/home.html
  Gene3D Precalculated structural assignments for whole genomes http://www.biochem.ucl.ac.uk/bsm/cath_new/Gene3D/
  GTD Genomic threading database: structural annotations of complete proteomes http://bioinf.cs.ucl.ac.uk/GTD
  GTOP Protein fold predictions from genome sequences http://spock.genes.nig.ac.jp/~genome/
  Het-PDB Navi Hetero-atoms in protein structures http://daisy.nagahama-i-bio.ac.jp/golab/hetpdbnavi.html
  HOMSTRAD Homologous structure alignment database: curated structure-based alignments for protein families http://www-cryst.bioc.cam.ac.uk/homstrad
  IMB Jena Image Library Visualization and analysis of 3D biopolymer structures http://www.imb-jena.de/IMAGE.html
  IMGT/3Dstructure-DB Sequences and 3D structures of vertebrate immunoglobulins, T cell receptors and MHC proteins http://imgt3d.igh.cnrs.fr/
  ISSD Integrated sequence–structure database http://www.protein.bio.msu.su/issd
  LPFC Library of protein family core structures http://www-smi.stanford.edu/projects/helix/LPFC
  MMDB NCBI's database of 3D structures, part of NCBI Entrez http://www.ncbi.nlm.nih.gov/Structure
  E-MSD EBI's macromolecular structure database http://www.ebi.ac.uk/msd
  ModBase Annotated comparative protein structure models http://alto.compbio.ucsf.edu/modbase-cgi/index.cgi
  MolMovDB Database of macromolecular movements: descriptions of protein and macromolecular motions, including movies http://bioinfo.mbb.yale.edu/MolMovDB/
  PALI Phylogeny and alignment of homologous protein structures http://pauling.mbu.iisc.ernet.in/~pali
  PASS2 Structural motifs of protein superfamilies http://ncbs.res.in/~faculty/mini/campass/pass.html
  PepConfDB A database of peptide conformations http://www.peptidome.org/products/list.htm
  PDB Protein structure databank: all publicly available 3D structures of proteins and nucleic acids http://www.rcsb.org/pdb
  PDB-REPRDB Representative protein chains, based on PDB entries http://mbs.cbrc.jp/pdbreprdb-cgi/reprdb_menu.pl
  PDBsum Summaries and analyses of PDB structures http://www.biochem.ucl.ac.uk/bsm/pdbsum
  PDB_TM Transmembrane proteins with known 3D structure http://www.enzim.hu/PDB_TM/
  Protein Folding Database Experimental data on protein folding http://pfd.med.monash.edu.au
  SCOP Structural classification of proteins http://scop.mrc-lmb.cam.ac.uk/scop
  Sloop Classification of protein loops http://www-cryst.bioc.cam.ac.uk/~sloop/
  Structure Superposition Database Pairwise superposition of TIM-barrel structures http://ssd.rbvi.ucsf.edu/
  SWISS-MODEL Repository Database of annotated 3D protein structure models http://swissmodel.expasy.org/repository
  SUPERFAMILY Assignments of proteins to structural superfamilies http://supfam.org/
  SURFACE Surface residues and functions annotated, compared and evaluated: a database of protein surface patches http://cbm.bio.uniroma2.it/surface
  TargetDB Target data from worldwide structural genomics projects http://targetdb.pdb.org/
  3D-GENOMICS Structural annotations for complete proteomes http://www.sbg.bio.ic.ac.uk/3dgenomics
  TOPS Topology of protein structures database http://www.tops.leeds.ac.uk
5. Genomics Databases (non-human)
5.1. Genome annotation terms, ontologies and nomenclature
  Genew Human gene nomenclature: approved gene symbols http://www.gene.ucl.ac.uk/nomenclature
  GO Gene ontology consortium database http://www.geneontology.org/
  GOA EBI's gene ontology annotation project http://www.ebi.ac.uk/GOA
  IUBMB Nomenclature database Nomenclature of enzymes, membrane transporters, electron transport proteins and other proteins http://www.chem.qmul.ac.uk/iubmb
  IUPAC Nomenclature database Nomenclature of biochemical and organic compounds approved by the IUBMB-IUPAC Joint Commission http://www.chem.qmul.ac.uk/iupac
  IUPHAR-RD The International Union of Pharmacology recommendations on receptor nomenclature and drug classification http://www.iuphar-db.org/iuphar-rd/
  PANTHER Gene products organized by biological function http://panther.celera.com/
  UMLS Unified medical language system http://umlsks.nlm.nih.gov/
5.1.1. Taxonomy and Identification
  ICB gyrB database for identification and classification of bacteria http://seasquirt.mbio.co.jp/icb/index.php
  NCBI Taxonomy Names of all organisms represented in GenBank http://www.ncbi.nlm.nih.gov/Taxonomy/
  PANDIT Protein and associated nucleotide domains with inferred trees http://www.ebi.ac.uk/goldman-srv/pandit/
  RIDOM rRNA-based differentiation of medical microorganisms http://www.ridom-rdna.de/
  RDP-II Ribosomal database project http://rdp.cme.msu.edu
  Tree of Life Information on phylogeny and biodiversity http://phylogeny.arizona.edu/tree/phylogeny.html
5.2. General genomics databases
  COG Clusters of orthologous groups of proteins http://www.ncbi.nlm.nih.gov/COG
  COGENT Complete genome tracking: predicted peptides from fully sequenced genomes http://maine.ebi.ac.uk:8000/services/cogent/
  CORG Comparative regulatory genomics: conserved non-coding sequence blocks http://corg.molgen.mpg.de/
  DEG Database of essential genes from bacteria and yeast http://tubic.tju.edu.cn/deg
  EBI Genomes EBI's collection of databases for the analysis of complete and unfinished viral, pro- and eukaryotic genomes http://www.ebi.ac.uk/genomes
  EGO Eukaryotic gene orthologs: orthologous DNA sequences in the TIGR gene indices http://www.tigr.org/tdb/tgi/ego/
  EMGlib Enhanced microbial genomes library: completely sequenced genomes of unicellular organisms http://pbil.univ-lyon1.fr/emglib/emglib.html
  Entrez Genomes NCBI's collection of databases for the analysis of complete and unfinished viral, pro- and eukaryotic genomes http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Genome
  ERGOLight Integrated biochemical data on nine bacterial genomes: publicly available portion of the ERGO database http://www.ergo-light.com/ERGO
  FusionDB Database of bacterial and archaeal gene fusion events http://igs-server.cnrs-mrs.fr/FusionDB
  Genome Atlas DNA structural properties of sequenced genomes http://www.cbs.dtu.dk/services/GenomeAtlas/
  Genome Information Broker DDBJ's collection of databases for the analysis of complete and unfinished viral, pro- and eukaryotic genomes http://gib.genes.nig.ac.jp
  Genome Reviews Integrated view of complete genomes http://www.ebi.ac.uk/GenomeReviews/
  GOLD Genomes online database: a listing of completed and ongoing genome projects http://www.genomesonline.org/
  HGT-DB Putative horizontally transferred genes in prokaryotic genomes http://www.fut.es/~debb/HGT/
  Integr8 Functional classification of proteins in whole genomes http://www.ebi.ac.uk/integr8/
  KEGG Kyoto encyclopedia of genes and genomes: integrated suite of databases on genes, proteins and metabolic pathways http://www.genome.jp/kegg
  MBGD Microbial genome database for comparative analysis http://mbgd.genome.ad.jp/
  ORFanage Database of orphan ORFs (ORFs with no homologs) in complete microbial genomes http://www.cs.bgu.ac.il/~nomsiew/ORFans
  PACRAT Archaeal and bacterial intergenic sequence features http://www.biosci.ohio-tate.edu/~pacrat
  PartiGeneDB Assembled partial genomes for 250 eukaryotic organisms http://www.partigenedb.org/
  PEDANT Results of an automated analysis of genomic sequences http://pedant.gsf.de/
  TIGR Microbial Database Lists of completed and ongoing genome projects with links to complete genome sequences http://www.tigr.org/tdb/mdb/mdbcomplete.html
  TIGR Comprehensive Microbial Resource Various data on complete microbial genomes: uniform annotation, properties of DNA and predicted proteins http://www.tigr.org/CMR
  TransportDB Predicted membrane transporters in complete genomes, classified according to the TC classification system http://www.membranetransport.org/
  WIT3 What is there? Metabolic reconstruction for completely sequenced microbial genomes http://www-wit.mcs.anl.gov/wit3/
5.3. Organism-specific databases
5.3.1. Viruses
  HCVDB The hepatitis C virus database http://hepatitis.ibcp.fr/
  HIV Drug Resistance Database HIV mutations that confer resistance to anti-HIV drugs http://resdb.lanl.gov/Resist_DB/default.htm
  HIV Molecular Immunology Database HIV epitopes http://hiv-web.lanl.gov/immunology/
  HIV RT and Protease Sequence Database HIV reverse transcriptase and protease sequences http://hivdb.stanford.edu/
  NCBI Viral Genomes Viral genome resource at NCBI http://www.ncbi.nlm.nih.gov/genomes/VIRUSES/viruses.html
  Poxvirus.org Poxvirus genomic sequences and gene annotation http://www.poxvirus.org/
  T4-like genome database Sequences of T4-like bacteriophages from various sources http://phage.bioc.tulane.edu/
  VIDA Homologous viral protein families database http://www.biochem.ucl.ac.uk/bsm/virus_database/VIDA.html
  VIPER Virus particle explorer: virus capsid structures http://mmtsb.scripps.edu/viper
  VirOligo Virus-specific oligonucleotides for PCR and hybridization http://viroligo.okstate.edu/
5.3.2. Prokaryotes
  BacMap Picture atlas of annotated bacterial genomes http://wishart.biology.ualberta.ca/BacMap
  MetaGrowth Growth requirements of bacterial pathogens http://igs-server.cnrs-mrs.fr/axenic/
  PGTdb Prokaryotic growth temperature database http://pgtdb.csie.ncu.edu.tw/
5.3.2.1. Escherichia coli
  ASAP A systematic annotation package for community analysis of E.coli and related genomes https://asap.ahabs.wisc.edu/annotation/php/ASAP1.htm
  CyberCell database A collection of data on E.coli K12 intended for mathematical modeling to simulate the bacterial cell http://redpoll.pharmacy.ualberta.ca/CCDB
  coliBase A database for E.coli, Salmonella and Shigella http://colibase.bham.ac.uk/
  Colibri E.coli genome database at Institut Pasteur http://genolist.pasteur.fr/Colibri/
  EchoBASE Post-genomic studies of Escherichia coli http://www.ecoli-york.org/
  Essential genes in E.coli First results of an E.coli gene deletion project http://www.genome.wisc.edu/resources/essential.htm
  GenoBase E.coli genome database at Nara Institute http://ecoli.aist-nara.ac.jp/
  GenProtEC E.coli K12 genome and proteome database http://genprotec.mbl.edu
  PEC Profiling of E.coli chromosome http://shigen.lab.nig.ac.jp/ecoli/pec
  EcoCyc E.coli K12 genes, metabolic pathways, transporters and gene regulation http://ecocyc.org/
  EcoGene Sequence and literature data on E.coli genes and proteins http://bmb.med.miami.edu/EcoGene/EcoWeb/
  RegulonDB Transcriptional regulation and operon organization in E.coli http://www.cifn.unam.mx/Computational_Genomics/regulondb/
5.3.2.2. Bacillus subtilis
  BSORF Bacillus subtilis genome database at Kyoto U. http://bacillus.genome.ad.jp/
  NRSub Non-redundant Bacillus subtilis database at U. Lyon http://pbil.univ-lyon1.fr/nrsub/nrsub.html
  SubtiList Bacillus subtilis genome database at Institut Pasteur http://genolist.pasteur.fr/SubtiList/
5.3.2.3. Other bacteria
  BioCyc Pathway/genome databases for many bacteria http://biocyc.org/
  CampyDB Database for Campylobacter genome analysis http://campy.bham.ac.uk/
  ClostriDB Finished and unfinished genomes of Clostridium spp. http://clostri.bham.ac.uk/
  CIDB Chlamydia Interactive Database: gene expression data http://www.it.deakin.edu.au/CIDB
  CyanoBase Cyanobacterial genomes http://www.kazusa.or.jp/cyano
  LeptoList Leptospira interrogans genome http://bioinfo.hku.hk/LeptoList
  MolliGen Genomic data on mollicutes http://cbi.labri.fr/outils/molligen/
  PseudoCAP Pseudomonas aeruginosa genome database and community annotation project http://www.pseudomonas.com/
  RsGDB Rhodobacter sphaeroides genome http://www.hgsc.bcm.tmc.edu/projects/microbial/Rsphaeroides/
  VirFact Bacterial virulence factors and pathogenicity islands http://virfact.burnham.org/
  Virulence Factors Reference database for microbial virulence factors http://zdsys.chgb.org.cn/VFs/main.htm
5.3.3. Unicellular eukaryotes
  ApiEST-DB EST sequences from various Apicomplexan parasites http://www.cbil.upenn.edu/paradbs-servlet
  CryptoDB Cryptosporidium parvum genome database http://cryptodb.org/
  Diatom EST Database ESTs from two diatom algae, Thalassiosira pseudonana and Phaeodactylum tricornutum http://avesthagen.sznbowler.com/
  DictyBase Universal resource for Dictyostelium discoideum http://dictybase.org/
  Full-Malaria Full-length cDNA library from erythrocytic-stage Plasmodium falciparum http://fullmal.ims.u-tokyo.ac.jp/
  GeneDB Curated database for various Sanger-sequenced genomes http://www.genedb.org/
  LumbriBASE ESTs of the earthworm Lumbricus rubellus http://www.earthworms.org/
  PlasmoDB Plasmodium genome database http://plasmodb.org/
  TcruziDB Trypanosoma cruzi genome database http://tcruzidb.org/
  ToxoDB Toxoplasma gondii genome database http://toxodb.org/
5.3.4. Fungi
5.3.4.1. Yeasts
  AGD Ashbya gossypii genome database http://agd.unibas.ch
  CandidaDB Candida albicans genome database http://genolist.pasteur.fr/CandidaDB
  Candida Genome Candida albicans genome database http://www.candidagenome.org/
  CYGD MIPS Comprehensive yeast genome database http://mips.gsf.de/proj/yeast
  Génolevures A comparison of S.cerevisiae and 14 other yeast species http://cbi.labri.fr/Genolevures
  PROPHECY Profiling of phenotypic characteristics in yeast http://prophecy.lundberg.gu.se/
  SCMD Saccharomyces cerevisiae morphological database: micrographs of budding yeast mutants http://yeast.gi.k.u-tokyo.ac.jp/
  SCPD Saccharomyces cerevisiae promoter database http://cgsigma.cshl.org/jian
  SGD Saccharomyces genome database http://www.yeastgenome.org/
  TRIPLES Transposon-insertion phenotypes, localization and expression in Saccharomyces http://ygac.med.yale.edu/triples/
  YDPM Yeast deletion project and mitochondria database http://www-deletion.stanford.edu/YDPM/YDPM_index.html
  Yeast Intron Database Ares lab database of splicesomal introns in S.cerevisiae http://www.cse.ucsc.edu/research/compbio/yeast_introns.html
  Yeast snoRNA Database Yeast small nucleolar RNAs http://www.bio.umass.edu/biochem/rna-sequence/Yeast_snoRNA_Database/snoRNA_DataBase.html
  yMGV Yeast microarray global viewer http://www.transcriptome.ens.fr/ymgv/
  YRC PDR Yeast resource center public data repository http://www.yeastrc.org/pdr/
5.3.4.2. Other fungi
  CADRE Central Aspergillus data repository http://www.cadre.man.ac.uk/
  COGEME Phytopathogenic fungi and oomycete EST database http://cogeme.ex.ac.uk
  MNCDB MIPS Neurospora crassa database http://mips.gsf.de/proj/neurospora/
  OGD Oomycete Genomics Database: ESTs and annotation http://www.oomycete.net/
  Phytophthora Functional Genomics Database ESTs and expression data from P.infestans and P.sojae http://www.pfgd.org/pfgd/
5.3.5. Invertebrates
5.3.5.1. Caenorhabditis elegans
  C.elegans Project Genome sequencing data at the Sanger Institute http://www.sanger.ac.uk/Projects/C_elegans
  Intronerator Introns and splicing in C.elegans and C.briggsae http://www.cse.ucsc.edu/~kent/intronerator/
  RNAiDB RNAi phenotypic analysis of C.elegans genes http://www.rnai.org/
  WILMA C.elegans annotation database http://www.came.sbg.ac.at/wilma/
  WorfDB C.elegans ORFeome http://worfdb.dfci.harvard.edu/
  WormBase Data repository for C.elegans and C.briggsae: curated genome annotation, genetic and physical maps, pathways http://www.wormbase.org/
5.3.5.2. Drosophila melanogaster
  FlyBase Drosophila sequences and genomic information http://flybase.bio.indiana.edu/
  FlyBrain Database of the Drosophila nervous system http://flybrain.neurobio.arizona.edu
  FlyMine Integration of insect genomic and proteomic data http://www.flymine.org/
  FlyTrap Drosophila mutants created using GFP protein trap strategy http://flytrap.med.yale.edu/
  GadFly Genome annotation database of Drosophila http://www.fruitfly.org
  GeniSys Enhancer- and promoter-inserted mutants of Drosophila http://genisys.kaist.ac.kr:8080/
  DPDB Drosophila polymorphism database http://dpdb.uab.es/
  Drosophila microarray project Data and tools for Drosophila gene expression studies http://www.flyarrays.com/fruitfly
  InterActive Fly Drosophila genes and thei