Filter
Reset all

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
Found 451 result(s)
Språkbanken was established in 1975 as a national center located in the Faculty of Arts, University of Gothenburg. Allén's groundbreaking corpus linguistic research resulted in the creation of one of the first large electronic text corpora in another language than English, with one million words of newspaper text. The task of Språkbanken is to collect, develop, and store (Swedish) text corpora, and to make linguistic data extracted from the corpora available to researchers and to the public.
Country
The Repository of University of Wroclaw is an institutional repository, which archives and makes available scientific as well as research and development materials, that were created by the employees, postgraduate students, and (in the selection) by the students of the University of Wroclaw or issued at the University of Wroclaw. These materials include, inter alia, dissertations, postdoctoral thesis, selected undergraduate’s and postgraduate’s thesis, research articles, conference papers, monographs or their chapters, didactic materials, posters, and also research data. Repository is organized by fields of knowledge, in accordance with the areas represented at the University in the frameworks of its organizational units, such as departments, institutes and other interfaculty units, and its structure is hierarchical, based on groups of subjects, covering a variety of collections.
The CLARIN­/Text+ repository at the Saxon Academy of Sciences and Humanities in Leipzig offers long­term preservation of digital resources, along with their descriptive metadata. The mission of the repository is to ensure the availability and long­term preservation of resources, to preserve knowledge gained in research, to aid the transfer of knowledge into new contexts, and to integrate new methods and resources into university curricula. Among the resources currently available in the Leipzig repository are a set of corpora of the Leipzig Corpora Collection (LCC), based on newspaper, Wikipedia and Web text. Furthermore several REST-based webservices are provided for a variety of different NLP-relevant tasks The repository is part of the CLARIN infrastructure and part of the NFDI consortium Text+. It is operated by the Saxon Academy of Sciences and Humanities in Leipzig.
ORA (Oxford University Research Archive) is the institutional repository for the University of Oxford. ORA was established in 2007 as a permanent and secure online archive of research materials produced by members of the University of Oxford. ORA aims to provide access to the full text of as much of Oxford's academic research as possible. This includes articles, conference papers, theses, research data, working papers, posters and more. Making materials open access removes barriers that restrict access to research, allowing for free dissemination of full text content, available to anyone with Internet access. ORA promotes and encourages the sharing of the scholarly output produced by the members of the University of Oxford that have been published under open access conditions, whilst additionally supporting University compliance with research funder policy and assessment.
The IMLS conducts annual surveys of public and state libraries in the US that have response rates near 100%. Data is compiled for states, library systems, and individual library branches and includes statistics for circulation, visits, staff, expenditures, and more. Data is available in two formats: MS Access and flat file, plain text. Data for museums is now included.
In collaboration with other centres in the Text+ consortium and in the CLARIN infrastructure, the CLARIND-UdS enables eHumanities by providing a service for hosting and processing language resources (notably corpora) for members of the research community. CLARIND-UdS centre thus contributes of lifting the fragmentation of language resources by assisting members of the research community in preparing language materials in such a way that easy discovery is ensured, interchange is facilitated and preservation is enabled by enriching such materials with meta-information, transforming them into sustainable formats and hosting them. We have an explicit mission to archive language resources especially multilingual corpora (parallel, comparable) and corpora including specific registers, both collected by associated researchers as well as researchers who are not affiliated with us.
The Linguistic Data Consortium (LDC) is an open consortium of universities, libraries, corporations and government research laboratories. It was formed in 1992 to address the critical data shortage then facing language technology research and development. Initially, LDC's primary role was as a repository and distribution point for language resources. Since that time, and with the help of its members, LDC has grown into an organization that creates and distributes a wide array of language resources. LDC also supports sponsored research programs and language-based technology evaluations by providing resources and contributing organizational expertise. LDC is hosted by the University of Pennsylvania and is a center within the University’s School of Arts and Sciences.
eLaborate is an online work environment in which scholars can upload scans, transcribe and annotate text, and publish the results as on online text edition which is freely available to all users. Short information about and a link to already published editions is presented on the page Editions under Published. Information about editions currently being prepared is posted on the page Ongoing projects. The eLaborate work environment for the creation and publication of online digital editions is developed by the Huygens Institute for the History of the Netherlands of the Royal Netherlands Academy of Arts and Sciences. Although the institute considers itself primarily a research facility and does not maintain a public collection profile, Huygens ING actively maintains almost 200 digitally available resource collections.
Country
>>>!!!<<<As stated 2017-05-23 Cancer GEnome Mine is no longer available >>>!!!<<< Cancer GEnome Mine is a public database for storing clinical information about tumor samples and microarray data, with emphasis on array comparative genomic hybridization (aCGH) and data mining of gene copy number changes.
Country
DRO is Deakin University's research repository, providing digital curation by describing and preserving the University's research output and enabling worldwide discovery.
The Medical Expenditure Panel Survey (MEPS) is a set of large-scale surveys of families and individuals, their medical providers, and employers across the United States. MEPS is the most complete source of data on the cost and use of health care and health insurance coverage.
Country
ResearchGate is a network where 15+ million scientists and researchers worldwide connect to share their work. Researchers can upload data of any type and receive DOIs, detailed statistics and real-time feedback. In Data discovery Section of ResearchGate you can explore the added datasets.
ScholarSphere is an institutional repository managed by Penn State University Libraries. Anyone with a Penn State Access ID can deposit materials relating to the University’s teaching, learning, and research mission to ScholarSphere. All types of scholarly materials, including publications, instructional materials, creative works, and research data are accepted. ScholarSphere supports Penn State’s commitment to open access and open science. Researchers at Penn State can use ScholarSphere to satisfy open access and data availability requirements from funding agencies and publishers.
Arch is an open access repository for the research and scholarly output of Northwestern University. Log in with your NetID to deposit, describe, and organize your research for public access and long-term preservation. We'll use our expertise to help you curate, share, and preserve your work.
A data repository and social network so that researchers can interact and collaborate, also offers tutorials and datasets for data science learning. "data.world is designed for data and the people who work with data. From professional projects to open data, data.world helps you host and share your data, collaborate with your team, and capture context and conclusions as you work."
The UBIRA eData repository is a multidisciplinary online service for the registration, preservation and publication of research datasets produced or collected at the University of Birmingham. It is part of the University of Birmingham Research Archive (UBIRA).
Country
CHERRY, ie CHEmistry RepositoRY is a joint digital repository of the all departments in University of Belgrade - Faculty of Chemistry. CHERRY provides open access to the publications, as well as to other outputs of the research projects implemented in this institution. The software platform meets the current requirements that apply to the dissemination of scholarly publications and it is compatible with relevant international infrastructures.
Country
ACU Research Bank is the Australian Catholic University's institutional research repository. It serves to collect, preserve, and showcase the research publications and outputs of ACU staff and higher degree students. Where possible and permissible, a full text version of a research output is available as open access.
Institutional repository of the University of Bern. BORIS Portal allows researchers at the University of Bern to archive and manage research data as well as project and funding information, to make it accessible and clearly identifiable.
The Warwick Research Archive Portal (WRAP) is the home of the University's full text, open access research content and contains, journal articles, Warwick doctoral dissertations, book chapters, conference papers, working papers and more.
Country
The UniSC Research Bank is the institutional research repository for the University of the Sunshine Coast. It provides an open access showcase of the University's scholarly research output ensuring that research is made available to the local, national and international communities. UniSC Research Bank is harvested by search engines, and is also indexed by the National Library of Australia's TROVE. By making research easily accessible, it also facilitates collaboration between researchers. Where possible, access to the full text of the publication is made available, in line with copyright permissions for each output. To access relevant research, use the Browse function, or specific records can be searched for by using the search box. Find research data by filtering by resource type 'Research Dataset'.
As a member of SWE-CLARIN, the Humanities Lab will provide tools and expertise related to language archiving, corpus and (meta)data management, with a continued emphasis on multimodal corpora, many of which contain Swedish resources, but also other (often endangered) languages, multilingual or learner corpora. As a CLARIN K-centre we provide advice on multimodal and sensor-based methods, including EEG, eye-tracking, articulography, virtual reality, motion capture, av-recording. Current work targets automatic data retrieval from multimodal data sets, as well as the linking of measurement data (e.g. EEG, fMRI) or geo-demographic data (GIS, GPS) to language data (audio, video, text, annotations). We also provide assistance with speech and language technology related matters to various projects. A primary resource in the Lab is The Humanities Lab corpus server, containing a varied set of multimodal language corpora with standardised metadata and linked layers of annotations and other resources.
Country
>>>!!!<<< the url is no longer responsive <<<!!!>>> South Australia has considerable potential for petroleum and geothermal energy. The Energy Resources Division provides geoscientific and engineering information and data to support industry exploration and development.