Search | re3data.org

SISSA Open Data

Scuola Internazionale Superiore di Studi Avanzati Open Data

Subject(s)

Content type(s)

Country

Italy

SISSA Open Data is the Sissa repository for the research data managment. It is an institutional repository that captures, stores, preserves, and redistributes the data of the SISSA scientific community in digital form. SISSA Open Data is managed by the SISSA Library as a service to the SISSA scientific community.

The Global Proteome Machine

GPM

Subject(s)

Content type(s)

Country

Canada

The Global Proteome Machine (GPM) is a protein identification database. This data repository allows users to post and compare results. GPM's data is provided by contributors like The Informatics Factory, University of Michigan, and Pacific Northwestern National Laboratories. The GPM searchable databases are: GPMDB, pSYT, SNAP, MRM, PEPTIDE and HOT.

NCBI Virus

Subject(s)

Content type(s)

Country

United States

NCBI Virus is a community portal for viral sequence data from RefSeq, GenBank and other NCBI repositories. To find, retrieve and analyze data, choose one of the offered options.

EchoBase

an integrated post-genomic database for E.coli

Subject(s)

Content type(s)

Country

United Kingdom

EchoBase is a database that curates new experimental and bioinformatic information about the genes and gene products of the model bacterium Escherichia coli K-12 strain MG1655.

NCBI Structure

Subject(s)

Content type(s)

Country

United States

The Structure database provides three-dimensional structures of macromolecules for a variety of research purposes and allows the user to retrieve structures for specific molecule types as well as structures for genes and proteins of interest. Three main databases comprise Structure-The Molecular Modeling Database; Conserved Domains and Protein Classification; and the BioSystems Database. Structure also links to the PubChem databases to connect biological activity data to the macromolecular structures. Users can locate structural templates for proteins and interactively view structures and sequence data to closely examine sequence-structure relationships.

RDP

Ribosomal Database Project

Subject(s)

Content type(s)

Country

United States

<<<!!!<<< The RDP website is no longer available. A stand-alone version of the RDP Classifier is available on Sorceforge https://sourceforge.net/projects/rdp-classifier/. Instructions for installing a command-line version of RDP Tools can be found at Dr. J.Quensen's Website https://john-quensen.com/tutorials/tutorial-1/ and https://jfq3.gitbook.io/rdptools-docker/rdptools-docker/readme. >>>!!!>>>

NCBI Datasets

Subject(s)

Content type(s)

Country

United States

NCBI Datasets is a continually evolving platform designed to provide easy and intuitive access to NCBI’s sequence data and metadata. NCBI Datasets is part of the NIH Comparative Genomics Resource (CGR). CGR facilitates reliable comparative genomics analyses for all eukaryotic organisms through an NCBI Toolkit and community collaboration.

Portal de Datos Genómicos del SNDG

Sistema Nacional de Datos Genómicos - Portal de datos

Subject(s)

Content type(s)

Country

Argentina

<<<!!!<<< This repository is no longer available. >>>!!!>>>

MGnify

formerly: EBI Metagenomics

Subject(s)

Content type(s)

Country

MGnify (formerly: EBI Metagenomics) offers an automated pipeline for the analysis and archiving of microbiome data to help determine the taxonomic diversity and functional & metabolic potential of environmental samples. Users can submit their own data for analysis or freely browse all of the analysed public datasets held within the repository. In addition, users can request analysis of any appropriate dataset within the European Nucleotide Archive (ENA). User-submitted or ENA-derived datasets can also be assembled on request, prior to analysis.

miRBase

Subject(s)

Content type(s)

Country

United Kingdom

The miRBase database is a searchable database of published miRNA sequences and annotation. Each entry in the miRBase Sequence database represents a predicted hairpin portion of a miRNA transcript (termed mir in the database), with information on the location and sequence of the mature miRNA sequence (termed miR). Both hairpin and mature sequences are available for searching and browsing, and entries can also be retrieved by name, keyword, references and annotation. All sequence and annotation data are also available for download. The miRBase Registry provides miRNA gene hunters with unique names for novel miRNA genes prior to publication of results.

CLAE

Characterized Lignocellulose-Active Proteins of Fungal Origin

Subject(s)

Content type(s)

Country

<<<!!!<<< This repository is no longer available. >>>!!!>>> Formerly known as mycoCLAP, CLAE is a curated database of Characterized Lignocellulose-Active Enzymes

MalaCards

Human disease Database

Subject(s)

Content type(s)

Country

MalaCards is an integrated database of human maladies and their annotations, modeled on the architecture and richness of the popular GeneCards database of human genes. MalaCards mines and merges varied web data sources to generate a computerized web card for each human disease. Each MalaCard contains disease specific prioritized annotative information, as well as links between associated diseases, leveraging the GeneCards relational database, search engine, and GeneDecks set-distillation tool. As proofs of concept of the search/distill/infer pipeline we find expected elucidations, as well as potentially novel ones.

NCBI Protein

Subject(s)

Content type(s)

Country

United States

The Protein database is a collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB. Protein sequences are the fundamental determinants of biological structure and function.

NCBI

National Center for Biotechnology Information

Subject(s)

Content type(s)

Country

United States

The National Center for Biotechnology Information advances science and health by providing access to biomedical and genomic information

MetabolomeXchange

Subject(s)

Content type(s)

Country

MetabolomeXchange.org delivers the mechanisms needed for disseminating the data to the metabolomics community at large (both metabolomics researchers and databases). The main objective is to make it easier for metabolomics researchers to become aware of newly released, publicly available, metabolomics datasets that may be useful for their research. MetabolomeXchange contains datasets from different data providers: MetaboLights, Metabolomic Repository Bordeaux, Metabolomics Workbench, and Metabolonote

GeneCards

The Human Gene Database

Subject(s)

Content type(s)

Country

GeneCards is a searchable, integrative database that provides comprehensive, user-friendly information on all annotated and predicted human genes. It automatically integrates gene-centric data from ~125 web sources, including genomic, transcriptomic, proteomic, genetic, clinical and functional information.

cisRED

Databases of genome-wide regulatory module and element predictions

Subject(s)

Content type(s)

Country

Canada

<<<!!!<<< This repository is no longer available. >>>!!!>>>

Mammalian Transcriptomic Database

MTD

Subject(s)

Content type(s)

Country

China

MTD is focused on mammalian transcriptomes with a current version that contains data from humans, mice, rats and pigs. Regarding the core features, the MTD browses genes based on their neighboring genomic coordinates or joint KEGG pathway and provides expression information on exons, transcripts, and genes by integrating them into a genome browser. We developed a novel nomenclature for each transcript that considers its genomic position and transcriptional features.

NCBI Genome

Subject(s)

Content type(s)

Country

United States

<<<!!!<<< Effective May 2024, NCBI's Genome resource will no longer be available. NCBI Genome data can now be found on the NCBI Datasets taxonomy pages. https://www.re3data.org/repository/r3d100014298 >>>!!!>>> The Genome database contains annotations and analysis of eukaryotic and prokaryotic genomes, as well as tools that allow users to compare genomes and gene sequences from humans, microbes, plants, viruses and organelles. Users can browse by organism, and view genome maps and protein clusters.

SWISS-MODEL Repository

SMR

Subject(s)

Content type(s)

Country

Switzerland

The SWISS-MODEL Repository is a database of annotated three-dimensional comparative protein structure models generated by the fully automated homology-modelling pipeline SWISS-MODEL.

CATH

CATH-Gene3D

Subject(s)

Content type(s)

Country

The CATH database is a hierarchical domain classification of protein structures in the Protein Data Bank. Protein structures are classified using a combination of automated and manual procedures. There are four major levels in the CATH hierarchy; Class, Architecture, Topology and Homologous superfamily.

BioModels

Subject(s)

Content type(s)

Country

BioModels is a repository of mathematical models of biological and biomedical systems. It hosts a vast selection of existing literature-based physiologically and pharmaceutically relevant mechanistic models in standard formats. Our mission is to provide the systems modelling community with reproducible, high-quality, freely-accessible models published in the scientific literature.

PDBj

Protein Data Bank Japan

Subject(s)

Content type(s)

Country

PDBj (Protein Data Bank Japan) provides a centralized PDB archive of macromolecular structures, integrated tools for data retrieval, visualization, and functional characterization. PDBj is supported by JST-NBDC and Osaka University.

AnimalGenome.ORG

BioMart AnimalGenome.ORG

Subject(s)

Content type(s)

Country

United States

This is a animal and human genome database that uses the BioMart software.

4DGenome

Chromatin Interaction Database

Subject(s)

Content type(s)

Country

United States

4DGenome is a public database that archives and disseminates chromatin interaction data. Currently, 4DGenome contains over 8,038,247 interactions curated from both experimental studies (high throughput and individual studies) and computational predictions. It covers five organisms, Homo sapiens, Mus musculus, Drosophila melanogaster, Plasmodium falciparum, and Saccharomyces cerevisiae.

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning