Reset all


Content Types


AID systems


Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 42 result(s)
STRING is a database of known and predicted protein interactions. The interactions include direct (physical) and indirect (functional) associations; they are derived from four sources: - Genomic Context - High-throughput Experiments - (Conserved) Coexpression - Previous Knowledge STRING quantitatively integrates interaction data from these sources for a large number of organisms, and transfers information between these organisms where applicable.
Chempound is a new generation repository architecture based on RDF, semantic dictionaries and linked data. It has been developed to hold any type of chemical object expressible in CML and is exemplified by crystallographic experiments and computational chemistry calculations. In both examples, the repository can hold >50k entries which can be searched by SPARQL endpoints and pre-indexing of key fields. The Chempound architecture is general and adaptable to other fields of data-rich science.
!!!!! This database doesn't exist anymore. 2017-09-05 !!!!!BeetleBase is a comprehensive sequence database and important community resource for Tribolium genetics, genomics and developmental biology. It provides genetic data on the Tribolium Castaneum, Red Flour Beetle, as gene maps, official gene set, reference sequences, predicted models, and whole-genome tiling array representing developmental stages.
The Organelle Genome Megasequencing Program (OGMP) provides mitochondrial, chloroplast, and mitochondrial plasmid genome data. OGMP tools allow direct comparison of OGMP and NCBI validated records. Includes GOBASE, a taxonomically broad organelle genome database that organizes and integrates diverse data related to mitochondria and chloroplasts.
>>>!!!<<< SMD has been retired. After approximately fifteen years of microarray-centric research service, the Stanford Microarray Database has been retired. We apologize for any inconvenience; please read below for possible resolutions to your queries. If you are looking for any raw data that was directly linked to SMD from a manuscript, please search one of the public repositories. NCBI Gene Expression Omnibus EBI ArrayExpress All published data were previously communicated to one (or both) of the public repositories. Alternatively, data for publications between 1997 and 2004 were likely migrated to the Princeton University MicroArray Database, and are accessible there. If you are looking for a manuscript supplement (i.e. from a domain other than, perhaps try searching the Internet Archive: Wayback Machine . >>>!!!<<< The Stanford Microarray Database (SMD) is a DNA microarray research database that provides a large amount of data for public use.
Presented here are excitation cross sections measured for a select number of transitions using the Merged Electron-Ion Beams Energy Loss (MEIBEL) experiment. This is a collaboration of JILA and the Multicharged Ion Research Facility (MIRF) at Oak Ridge National Laboratory (ORNL), where the apparatus is located. Since there exist a nearly infinite number of transitions in multicharged ions we have chosen a few that serve as benchmarks for theoretical efforts. Of particular interest are forbidden transitions which are often dominated by dielectronic resonances whose positions and magnitudes are difficult to predict theoretically.
Online materials database (known as PAULING FILE project) with nearly 2 million entries: physical properties, crystal structures, phase diagrams, available via API, ready for modern data-intensive applications. The source of these entries are about 300,000 peer-reviewed publications in materials science, processed during the last 16 years by an international team of PhD editors. The results are presented online with a quick search interface. The basic access is provided for free.
The Allen Brain Atlas provides a unique online public resource integrating extensive gene expression data, connectivity data and neuroanatomical information with powerful search and viewing tools for the adult and developing brain in mouse, human and non-human primate
This interactive database provides complete access to statistics on seasonal cotton supply and use for each country and each region in the world, from 1920/21 to date. This project is part of ICAC’s efforts to improve the transparency of world cotton statistics.
This Open GIS Data Portal provides residents, visitors, business owners and investors a means to easily find, access and view information through the use of maps and mapping technology.
The CDPP is the French national data centre for natural plasmas of the solar system. The CDPP assures the long term preservation of data obtained primarily from instruments built using French resources, and renders them readily accessible and exploitable by the international community. The CDPP also provides services to enable on-line data analysis (AMDA), 3D data visualization in context (3DView), and a propagation tool which bridges solar perturbations to in-situ measurements. The CDPP is involved in the development of interoperability, participates in several Virtual Observatory projects, and supports data distribution for scientific missions (Solar Orbiter, JUICE).
Presented here are experimental ionization cross sections measured using the Electron-Ion Crossed Beams apparatus in the Multicharged Ion Research Facility (MIRF) at the Physics Division of Oak Ridge National Laboratory (ORNL). The data are given in both graphical and tabular form along with the reference to the original publication of the experimental results. Also presented in the figures are theoretical cross sections supporting the experiments.
BiGG is a knowledgebase of Biochemically, Genetically and Genomically structured genome-scale metabolic network reconstructions. BiGG integrates several published genome-scale metabolic networks into one resource with standard nomenclature which allows components to be compared across different organisms. BiGG can be used to browse model content, visualize metabolic pathway maps, and export SBML files of the models for further analysis by external software packages. Users may follow links from BiGG to several external databases to obtain additional information on genes, proteins, reactions, metabolites and citations of interest.
CDAAC is responsible for processing the science data received from COSMIC. This data is currently being processed not long after the data is received, i.e. approximately eighty percent of radio occultation profiles are delivered to operational weather centers within 3 hours of observation as well as in a more accurate post-processed mode (within 8 weeks of observation).
SuperDARN is an international HF radar network designed to measure global-scale magnetospheric convection by observing plasma motion in the Earth’s upper atmosphere. This network consists of more than 20 radars operating on frequencies between 8 and 20 MHz that look into the polar regions of Earth. These radars can measure the position and velocity of charged particles in our ionosphere, the highest layer of the Earth's atmosphere, and provide scientists with information regarding Earth's interaction with the space environment.
GOBASE is a taxonomically broad organelle genome database that organizes and integrates diverse data related to mitochondria and chloroplasts. GOBASE is currently expanding to include information on representative bacteria that are thought to be specifically related to the bacterial ancestors of mitochondria and chloroplasts
On February 24, 2000, Terra began collecting what will ultimately become a new, 15-year global data set on which to base scientific investigations about our complex home planet. Together with the entire fleet of EOS spacecraft, Terra is helping scientists unravel the mysteries of climate and environmental change. TERRA's data collection instruments include: Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER), Clouds and the Earth's Radiant Energy System (CERES), Multi-angle Imaging Spectro-Radiometer (MISR), Moderate-resolution Imaging Spectroradiometer (MODIS), Measurement of Pollution in the Troposphere (MOPITT)
The Erythron Database is a resource dedicated to facilitating better understanding of the cellular and molecular underpinnings of mammalian erythropoiesis. The resource is built upon a searchable database of gene expression in murine primitive and definitive erythroid cells at progressive stages of maturation.
OrtholugeDB contains Ortholuge-based orthology predictions for completely sequenced bacterial and archaeal genomes. It is also a resource for reciprocal best BLAST-based ortholog predictions, in-paralog predictions (recently duplicated genes) and ortholog groups in Bacteria and Archaea. The Ortholuge method improves the specificity of high-throughput orthology prediction.
The UniPROBE (Universal PBM Resource for Oligonucleotide Binding Evaluation) database hosts data generated by universal protein binding microarray (PBM) technology on the in vitro DNA binding specificities of proteins. This initial release of the UniPROBE database provides a centralized resource for accessing comprehensive data on the preferences of proteins for all possible sequence variants ('words') of length k ('k-mers'), as well as position weight matrix (PWM) and graphical sequence logo representations of the k-mer data. In total, the database currently hosts DNA binding data for 406 nonredundant proteins from a diverse collection of organisms, including the prokaryote Vibrio harveyi, the eukaryotic malarial parasite Plasmodium falciparum, the parasitic Apicomplexan Cryptosporidium parvum, the yeast Saccharomyces cerevisiae, the worm Caenorhabditis elegans, mouse, and human. The database's web tools (on the right) include a text-based search, a function for assessing motif similarity between user-entered data and database PWMs, and a function for locating putative binding sites along user-entered nucleotide sequences
Since January 2012, two previously independent resources called "ViFaArt – Virtual Library for Contemporary Art" and " – Virtual Library for Art History" have been joint together, forming a new service called This unique union makes it now possible to research the whole subject spectrum belonging to Art History. The special interest collection of Art History focuses on Medieval and Early European Art History, including art influenced by Europe in the USA, Canada and Australia, continuing chronologically from the Early Christian era until 1945. The special interest collection of Contemporary Art continues the art historical subject spectrum to include European and North American Art History from 1945. contains text and image resources as well as comprehensive, academically relevant information dealing with all media from the Middle Ages up to the present. pools the resources and know-how of the responsible partner institutions, thus making this portal an essential forum for research and teaching.
The Virtual Liver Network (VLN) represents a major research investment by the German Government focusing on work at the “bleeding edge” of Systems Biology and Systems Medicine. This Flagship Programme is tackling one of the major challenges in the life sciences: that is, how to integrate the wealth of data we have acquired post-genome, not just in a mathematical model, but more importantly in a series of models that are linked across scales to represent organ function. As the project is prototyping how to achieve true multi-scale modelling within a single organ and linking this to human physiology, it will be developing tools and protocols that can be applied to other systems, helping to drive forward the application of modelling and simulation to modern medical practice. It is the only programme of its type to our knowledge that bridges investigations from the sub-cellular through to ethically cleared patient and volunteer studies in an integrated workflow. As such, this programme is contributing significantly to the development of a new paradigm in biology and medicine.
>>>!!!<<<Efforts to obtain renewed funding after 2008 were unfortunately not successful. PANDIT has therefore been frozen since November 2008, and its data are not updated since September 2005 when version 17.0 was released (corresponding to Pfam 17.0). The existing data and website remain available from these pages, and should remain stable and, we hope, useful. >>>!!!<<< PANDIT is a collection of multiple sequence alignments and phylogenetic trees. It contains corresponding amino acid and nucleotide sequence alignments, with trees inferred from each alignment. PANDIT is based on the Pfam database (Protein families database of alignments and HMMs), and includes the seed amino acid alignments of most families in the Pfam-A database. DNA sequences for as many members of each family as possible are extracted from the EMBL Nucleotide Sequence Database and aligned according to the amino acid alignment. PANDIT also contains a further copy of the amino acid alignments, restricted to the sequences for which DNA sequences were found.
ERDDAP is a data server that gives you a simple, consistent way to download subsets of gridded and tabular scientific datasets in common file formats and make graphs and maps. This particular ERDDAP installation has oceanographic data (for example, data from satellites and buoys).