Reset all


Content Types


AID systems



Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type


Metadata standards

PID systems

Provider types

Quality management

Repository languages



Repository types


  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 287 result(s)
This classic collection of test cases for validation of turbulence models started as an EU / ERCOFTAC project led by Pr. W. Rodi in 1995. It is maintained by Dr. T. Craft at Manchester since 1999. Initialy limited to experimental data, computational results, and results and conclusions drawn from the ERCOFTAC Workshops on Refined Turbulence Modelling (SIG15). At the moment, each case should contain at least a brief description, some data to download, and references to published work. Some cases contain significantly more information than this.
>>>>!!!!<<<< The Cancer Genomics Hub mission is now completed. The Cancer Genomics Hub was established in August 2011 to provide a repository to The Cancer Genome Atlas, the childhood cancer initiative Therapeutically Applicable Research to Generate Effective Treatments and the Cancer Genome Characterization Initiative. CGHub rapidly grew to be the largest database of cancer genomes in the world, storing more than 2.5 petabytes of data and serving downloads of nearly 3 petabytes per month. As the central repository for the foundational genome files, CGHub streamlined team science efforts as data became as easy to obtain as downloading from a hard drive. The convenient access to Big Data, and the collaborations that CGHub made possible, are now essential to cancer research. That work continues at the NCI's Genomic Data Commons. All files previously stored at CGHub can be found there. The Website for the Genomic Data Commons is here: >>>>!!!!<<<< The Cancer Genomics Hub (CGHub) is a secure repository for storing, cataloging, and accessing cancer genome sequences, alignments, and mutation information from the Cancer Genome Atlas (TCGA) consortium and related projects. Access to CGHub Data: All researchers using CGHub must meet the access and use criteria established by the National Institutes of Health (NIH) to ensure the privacy, security, and integrity of participant data. CGHub also hosts some publicly available data, in particular data from the Cancer Cell Line Encyclopedia. All metadata is publicly available and the catalog of metadata and associated BAMs can be explored using the CGHub Data Browser.
The "Subaru Observatory Project" was originally planned for producing very important output to astronomical society by systematic time allocation and using characteristic functions of Subaru Telescope. The observation time for this project consists of guaranteed time both for telescope builders and for people responsible for telescope operation. 3 proposals were selected for execution during 2002 and 2003 fiscal years. They are Subaru Deep Field (SDF) (PI is Dr. Kashikawa at Mitaka, NAOJ), Subaru XMM-Newton Deep Survey (SXDS) (PI is Dr. Sekiguchi at Hilo, Subaru Telescope), and Disk and Planet Searches (SDPS) (PI is Dr. Hayashi at Hilo, Subaru Telescope). SOAPs web server provide the data (fully reduced images and catalogs) download obtained from SDF and SXDS projects. Raw Data are available at the SMOKA Science Archive:
The UniPROBE (Universal PBM Resource for Oligonucleotide Binding Evaluation) database hosts data generated by universal protein binding microarray (PBM) technology on the in vitro DNA binding specificities of proteins. This initial release of the UniPROBE database provides a centralized resource for accessing comprehensive data on the preferences of proteins for all possible sequence variants ('words') of length k ('k-mers'), as well as position weight matrix (PWM) and graphical sequence logo representations of the k-mer data. In total, the database currently hosts DNA binding data for 406 nonredundant proteins from a diverse collection of organisms, including the prokaryote Vibrio harveyi, the eukaryotic malarial parasite Plasmodium falciparum, the parasitic Apicomplexan Cryptosporidium parvum, the yeast Saccharomyces cerevisiae, the worm Caenorhabditis elegans, mouse, and human. The database's web tools (on the right) include a text-based search, a function for assessing motif similarity between user-entered data and database PWMs, and a function for locating putative binding sites along user-entered nucleotide sequences
The Clouds and the Earth’s Radiant Energy System (CERES) is a key component of the Earth Observing System (EOS) program. CERES instruments provide radiometric measurements of the Earth’s atmosphere from three broadband channels. CERES products include both solar-reflected and Earth-emitted radiation from the top of the atmosphere to the Earth's surface.
The British Geological Survey (BGS), the world’s oldest national geological survey, has over 400 datasets including environmental monitoring data, digital databases, physical collections (borehole core, rocks, minerals and fossils), records and archives.
The Crop EST Database (CR-EST) is a public available online resource providing access to sequence, classification, clustering, and annotation data of crop EST projects at the IPK. A view of these information give the summarized numbers about genomic data of species listed in the adjacent table.
>>>!!!<<< Crystaleye has now been excitingly integrated into the Crystallography Open Database at>>>!!!<<< Crystallography Open Database now is including data and software from CrystalEye, developed by Nick Day at the department of Chemistry, the University of Cambridge under supervision of Peter Murray-Rust. The aim of the CrystalEye project is to aggregate crystallography from web resources, and to provide methods to easily browse, search, and to keep up to date with the latest published information.At present we are aggregating the crystallography from the supplementary data to articles at publishers websites.
Satellite-tracked drifting buoys ("drifters") collect measurements of upper ocean currents and sea surface temperatures (SST) around the world as part of the Global Drifter Program. Drifter locations are estimated from 16-20 satellite fixes per day, per drifter. The Drifter Data Assembly Center (DAC) at NOAA's Atlantic Oceanographic and Meteorological Laboratory (AOML) assembles these raw data, applies quality control procedures, and interpolates them via kriging to regular six-hour intervals. The raw observations and processed data are archived at AOML and at the Marine Environmental Data Services (MEDS) in Canada. Two types of data are available: "metadata" contains deployment location and time, time of drogue (sea anchor) loss, date of final transmission, etc. for each drifter. "Interpolated data" contains the quality-controlled, interpolated drifter observations. is the Centers for Disease Control and Prevention primary online communication channel. provides users with credible, reliable health information on Data and Statistics, Diseases and Conditions, Emergencies and Disasters, Environmental Health, Healthy Living, Injury, Violence and Safety,Life Stages and Populations, Travelers' Health, Workplace Safety and Health
-----<<<<< The repository is no longer available. This record is out-dated. >>>>>----- The Clean Energy Project Database (CEPDB) is a massive reference database for organic semiconductors with a particular emphasis on photovoltaic applications. It was created to store and provide access to data from computational as well as experimental studies, on both known and virtual compounds. It is a free and open resource designed to support researchers in the field of organic electronics in their scientific pursuits. The CEPDB was established as part of the Harvard Clean Energy Project (CEP), a virtual high-throughput screening initiative to identify promising new candidates for the next generation of carbon-based solar cell materials.
The Joint Evaluated Fission and Fusion File (JEFF) project is a collaboration between NEA Data Bank member countries. The JEFF library combines the efforts of the JEFF and EFF/EAF Working Groups to produce a common sets of evaluated nuclear data, mainly for fission and fusion applications. It contains a number of different data types, including neutron and proton interaction data, radioactive decay data, fission yields, and thermal scattering law data
The Barrow, Alaska Observatory (BRW) archives and provides digital access to their findings related to climate change, ozone depletion and baseline air quality. The BRW is part of the National Oceanic and Atmospheric Administration and Earth System Research Laboratory Global Monitoring Division.
Central data management of the USGS for water data that provides access to water-resources data collected at approximately 1.5 million sites in all 50 States, the District of Columbia, Puerto Rico, the Virgin Islands, Guam, American Samoa and the Commonwealth of the Northern Mariana Islands. Includes data on water use and quality, groundwater, and surface water.
The CCKP contains environmental, disaster risk, and socio-economic datasets, as well as synthesis products, such as the Climate Adaptation Country Profiles, which are built and packaged for specific user-focused functions such as climate change indices for a particular country. The portal also provides intelligent links to other resources and tools.
The CliSAP-Integrated Climate Data Center (ICDC) allows easy access to climate relevant data from in-situ measurements and satellite remote sensing. These data are important to determine the status and the changes in the climate system. Additionally some relevant re-analysis data are included, which are modeled on the basis of observational data.
The Global Terrorism Database (GTD) is an open-source database including information on terrorist events around the world from 1970 through 2015 (with annual updates planned for the future). Unlike many other event databases, the GTD includes systematic data on domestic as well as international terrorist incidents that have occurred during this time period and now includes more than 150,000 cases.
Since January 2012, two previously independent resources called "ViFaArt – Virtual Library for Contemporary Art" and " – Virtual Library for Art History" have been joint together, forming a new service called This unique union makes it now possible to research the whole subject spectrum belonging to Art History. The special interest collection of Art History focuses on Medieval and Early European Art History, including art influenced by Europe in the USA, Canada and Australia, continuing chronologically from the Early Christian era until 1945. The special interest collection of Contemporary Art continues the art historical subject spectrum to include European and North American Art History from 1945. contains text and image resources as well as comprehensive, academically relevant information dealing with all media from the Middle Ages up to the present. pools the resources and know-how of the responsible partner institutions, thus making this portal an essential forum for research and teaching.
One of the Bundesbank's tasks is to collect statistical data. Some of these data are then fed into the time series published on the internet in the time series database. In recent years, there have been increased demands for all data providers to make more data available for research purposes. This applies, on the one hand, to the individual data used for the calculation of the published aggregated data. This is because only using these individual data is it possible to accurately reflect the heterogeneity of the relevant feature carriers in academic studies. Due to legal requirements, individual data cannot be made generally available. However, these data are made available under strict conditions and for clearly defined academic research purposes at the Research Centre of the Deutsche Bundesbank. Furthermore, in the area of aggregated data, there is also an increasing demand for real-time data. These data reflect an out-of-date data set and may therefore deviate from data currently available as a result of corrections carried out in the meantime.
The modENCODE Project, Model Organism ENCyclopedia Of DNA Elements, was initiated by the funding of applications received in response to Requests for Applications (RFAs) HG-06-006, entitled Identification of All Functional Elements in Selected Model Organism Genomes and HG-06-007, entitled A Data Coordination Center for the Model Organism ENCODE Project (modENCODE). The modENCODE Project is being run as an open consortium and welcomes any investigator willing to abide by the criteria for participation that have been established for the project. Both computational and experimental approaches are being applied by modENCODE investigators to study the genomes of D. melanogaster and C. elegans. An added benefit of studying functional elements in model organisms is the ability to biologically validate the elements discovered using methods that cannot be applied in humans. The comprehensive dataset that is expected to result from the modENCODE Project will provide important insights into the biology of D. melanogaster and C. elegans as well as other organisms, including humans.
The Virtual Liver Network (VLN) represents a major research investment by the German Government focusing on work at the “bleeding edge” of Systems Biology and Systems Medicine. This Flagship Programme is tackling one of the major challenges in the life sciences: that is, how to integrate the wealth of data we have acquired post-genome, not just in a mathematical model, but more importantly in a series of models that are linked across scales to represent organ function. As the project is prototyping how to achieve true multi-scale modelling within a single organ and linking this to human physiology, it will be developing tools and protocols that can be applied to other systems, helping to drive forward the application of modelling and simulation to modern medical practice. It is the only programme of its type to our knowledge that bridges investigations from the sub-cellular through to ethically cleared patient and volunteer studies in an integrated workflow. As such, this programme is contributing significantly to the development of a new paradigm in biology and medicine.
The Database contains all publicly available HMS LINCS datasets and information for each dataset about experimental reagents (small molecule perturbagens, cells, antibodies, and proteins) and experimental and data analysis protocols.
The Swiss HIV Cohort Study (SHCS), established in 1988, is a systematic longitudinal study enrolling HIV-infected individuals in Switzerland. It is a collaboration of all Swiss University Hospital infectious disease outpatient clinics, two large cantonal hospitals, all with affiliated laboratories, and with affiliated smaller hospitals and private physicians carrying for HIV patients. The Swiss Mother and Child HIV Cohort Study (MoCHiV) is integrated into the SHCS. It aims at preventing mother to child transmission and enrolls HIV-infected pregnant women and their children. The SHCS involves practically all researchers being active in patient-oriented HIV research in Switzerland. The clinics can delegate recruitment of participants and follow-up visits to other outpatient clinics or to specialized private physicians, provided that the requirements of the protocol can be entirely fulfilled and controlled. The laboratories can contract other laboratories for some of the analyses.