Search | re3data.org

Grand Challenge Data Collection

Carnegie Mellon University Multi-Modal Activity Database

Subject(s)

Content type(s)

Country

United States

The CMU Multi-Modal Activity Database (CMU-MMAC) database contains multimodal measures of the human activity of subjects performing the tasks involved in cooking and food preparation. The CMU-MMAC database was collected in Carnegie Mellon's Motion Capture Lab. A kitchen was built and to date twenty-five subjects have been recorded cooking five different recipes: brownies, pizza, sandwich, salad, and scrambled eggs.

Kikapu

University of the Western Cape Institutional Research Data Respository

Subject(s)

Content type(s)

Country

South Africa

The University of the Western Cape (UWC) uses Figshare for Institutions for their institutional research data repository. It is called Kikapu, and serves as a repository for storing and disseminating research data.

Dalhousie University Dataverse @ Borealis

Subject(s)

Content type(s)

Country

The Dalhousie University Dataverse is a research data repository for our faculty, students, and staff. Files are held in a secure environment on Canadian servers, hosted by Borealis.

Repository CLARIN Centre Leipzig

CLARIN repository at the Saxon Academy of Sciences and Humanities

Subject(s)

Content type(s)

Country

The CLARIN/Text+ repository at the Saxon Academy of Sciences and Humanities in Leipzig offers longterm preservation of digital resources, along with their descriptive metadata. The mission of the repository is to ensure the availability and longterm preservation of resources, to preserve knowledge gained in research, to aid the transfer of knowledge into new contexts, and to integrate new methods and resources into university curricula. Among the resources currently available in the Leipzig repository are a set of corpora of the Leipzig Corpora Collection (LCC), based on newspaper, Wikipedia and Web text. Furthermore several REST-based webservices are provided for a variety of different NLP-relevant tasks The repository is part of the CLARIN infrastructure and part of the NFDI consortium Text+. It is operated by the Saxon Academy of Sciences and Humanities in Leipzig.

Pacific Northwest National Laboratory DataHub: Scientific Data Repository

PNNL2

Subject(s)

Content type(s)

Country

United States

Sharing and preserving data are central to protecting the integrity of science. DataHub, a Research Computing endeavor, provides tools and services to meet scientific data challenges at Pacific Northwest National Laboratory (PNNL). DataHub helps researchers address the full data life cycle for their institutional projects and provides a path to creating findable, accessible, interoperable, and reusable (FAIR) data products. Although open science data is a crucial focus of DataHub’s core services, we are interested in working with evidence-based data throughout the PNNL research community.

Stanford Network Analysis Project

SNAP

Subject(s)

Content type(s)

Country

United States

Stanford Network Analysis Platform (SNAP) is a general purpose network analysis and graph mining library. It is written in C++ and easily scales to massive networks with hundreds of millions of nodes, and billions of edges. It efficiently manipulates large graphs, calculates structural properties, generates regular and random graphs, and supports attributes on nodes and edges. SNAP is also available through the NodeXL which is a graphical front-end that integrates network analysis into Microsoft Office and Excel. The SNAP library is being actively developed since 2004 and is organically growing as a result of our research pursuits in analysis of large social and information networks. Largest network we analyzed so far using the library was the Microsoft Instant Messenger network from 2006 with 240 million nodes and 1.3 billion edges. The datasets available on the website were mostly collected (scraped) for the purposes of our research. The website was launched in July 2009.

bonndata

Subject(s)

Content type(s)

Country

Germany

bonndata is the institutional, FAIR-aligned and curated, cross-disciplinary research data repository for the publication of research data for all researchers at the University of Bonn. The repository is fully embedded into the University IT and Data Center and curated by the Research Data Service Center (https://www.forschungsdaten.uni-bonn.de/en). The software that bonndata is based on is the open source software Dataverse (https://dataverse.org)

CLARIN-ERIC

Common Language Resources and Technology Infrastructure - European Research Infrastructure Consortium

Subject(s)

Content type(s)

Country

CLARIN is a European Research Infrastructure for the Humanities and Social Sciences, focusing on language resources (data and tools). It is being implemented and constantly improved at leading institutions in a large and growing number of European countries, aiming at improving Europe's multi-linguality competence. CLARIN provides several services, such as access to language data and tools to analyze data, and offers to deposit research data, as well as direct access to knowledge about relevant topics in relation to (research on and with) language resources. The main tool is the 'Virtual Language Observatory' providing metadata and access to the different national CLARIN centers and their data.

Center for International Earth Science Information Network

CIESIN

Subject(s)

Content type(s)

Country

United States

CIESIN is an interdisciplinary research and data center that provides access to a wide range of global data, associated documentation, and visualization and analysis tools to improve understanding of human interactions in the environment.

TU Wien Research Data

Subject(s)

Content type(s)

Country

Austria

TU Wien Research Data is an institutional repository of TU Wien to enable storing, sharing and publishing of digital objects, in particular research data. It facilitates the funders' requirements for open access to research data and the FAIR principles by making research output findable, accessible, interoperable, and reusable. A DOI is assigned to each dataset published in TU Wien Research Data. This service is developed by the TU Wien Center for Research Data Management and hosted by TU.it.

OLAC

Open Language Archives Community

Subject(s)

Content type(s)

Country

United States

OLAC, the Open Language Archives Community, is an international partnership of institutions and individuals who are creating a worldwide virtual library of language resources by: (i) developing consensus on best current practice for the digital archiving of language resources, and (ii) developing a network of interoperating repositories and services for housing and accessing such resources. The OLAC system has 2016 been integrated with the Linguistic Linked Open Data Cloud.

O2 Repositori UOC - Dades

Subject(s)

Content type(s)

Country

Spain

O2 is the UOC's institutional repository. Section 'Dades' contains primary data accompanying documents published in the Research and Institutional communities.

Monash Bridges

monash.figshare (formerly)

Subject(s)

Content type(s)

Country

Bridges is Monash University's repository for research data, collections, and research activity outputs. It is also the home of the University's online archive of PhD and Masters by Research theses.

GEON

GEONGRID

Subject(s)

Content type(s)

Country

United States

-----<<<<< The repository is no longer available. This record is out-dated. >>>>>----- GEON is an open collaborative project that is developing cyberinfrastructure for integration of 3 and 4 dimensional earth science data. GEON will develop services for data integration and model integration, and associated model execution and visualization. Mid-Atlantic test bed will focus on tectonothermal, paleogeographic, and biotic history from the late-Proterozoicto mid-Paleozoic. Rockies test bed will focus on integration of data with dynamic models, to better understand deformation history. GEON will develop the most comprehensive regional datasets in test bed areas.

melbourne.figshare.com

University of Melbourne data repository

Subject(s)

Content type(s)

Country

melbourne.figshare.com is a specialised service that has been tailored according to specific needs and requirements of the University and our community of researchers. The service offered at the University is free to use, provides 100GB of data, and stores all data on the University's storage system.

ZivaHub

Open Data UCT

Subject(s)

Content type(s)

Country

South Africa

The University of Cape Town (UCT) uses Figshare for institutions for their data repository, which was launched in 2017 and is called ZivaHub: Open Data UCT. ZivaHub serves principal investigators at the University of Cape Town who are in need of a repository to store and openly disseminate the data that support their published research findings. The repository service is provided in terms of the UCT Research Data Management Policy. It provides open access to supplementary research data files and links to their respective scholarly publications (e.g. theses, dissertations, papers et al) hosted on other platforms, such as OpenUCT.

Spec Patterns

Specification Patterns

Subject(s)

Content type(s)

Country

United States

Specification Patterns is an online repository for information about property specification for finite-state verification. The intent of this repository is to collect patterns that occur commonly in the specification of concurrent and reactive systems.

Edmond

The Open Research Data Repository of the Max Planck Society

Subject(s)

Content type(s)

Country

Germany

Edmond is the institutional repository of the Max Planck Society for public research data. It enables Max Planck scientists to create citable scientific assets by describing, enriching, sharing, exposing, linking, publishing and archiving research data of all kinds. Further on, all objects within Edmond have a unique identifier and therefore can be clearly referenced in publications or reused in other contexts.

University of Manchester figshare

Subject(s)

Content type(s)

Country

United Kingdom

The University selected figshare as a general purpose research data repository to enable researchers to share research data, facilitate open research practices and meet the evolving requirements of research funders and academic publishers. This is a public-facing platform for researchers to share their data and build, over time, a comprehensive representation of the research done at the University across all faculties and disciplines.

Kaggle

Your home for data science

Subject(s)

Content type(s)

Country

United States

Kaggle is a platform for predictive modelling and analytics competitions in which statisticians and data miners compete to produce the best models for predicting and describing the datasets uploaded by companies and users. This crowdsourcing approach relies on the fact that there are countless strategies that can be applied to any predictive modelling task and it is impossible to know beforehand which technique or analyst will be most effective.

Wilfrid Laurier University Dataverse

WLU Dataverse

Subject(s)

Content type(s)

Country

The Wilfrid Laurier University Dataverse is a research data repository for our faculty, students, and staff. Files are held in a secure environment on Canadian servers.

UK Government Web Archive

UKGWA

Subject(s)

Content type(s)

Country

United Kingdom

The UK Government Web Archive captures, preserves, and makes accessible UK central government information published on the web. The web archive includes videos, tweets, images and websites dating from 1996 to present.

ELRA Catalogue of Language Resources

ELRA Catalogue

Subject(s)

Content type(s)

Country

European Union

An increasing number of Language Resources (LT) in the various fields of Human Language Technology (HLT) are distributed on behalf of ELRA via its operational body ELDA, thanks to the contribution of various players of the HLT community. Our aim is to provide Language Resources, by means of this repository, so as to prevent researchers and developers from investing efforts to rebuild resources which already exist as well as help them identify and access those resources.

brainlife

brainlife.io

Subject(s)

Content type(s)

Country

United States

Brainlife promotes engagement and education in reproducible neuroscience. We do this by providing an online platform where users can publish code (Apps), Data, and make it "alive" by integragrate various HPC and cloud computing resources to run those Apps. Brainlife also provide mechanisms to publish all research assets associated with a scientific project (data and analyses) embedded in a cloud computing environment and referenced by a single digital-object-identifier (DOI). The platform is unique because of its focus on supporting scientific reproducibility beyond open code and open data, by providing fundamental smart mechanisms for what we refer to as “Open Services.”

Academic Torrents

Subject(s)

Content type(s)

Country

United States

Academic Torrents is a distributed data repository. The academic torrents network is built for researchers, by researchers. Its distributed peer-to-peer library system automatically replicates your datasets on many servers, so you don't have to worry about managing your own servers or file availability. Everyone who has data becomes a mirror for those data so the system is fault-tolerant.

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning