Search | re3data.org

Filter

Reset all

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database access restrictions

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

Toogle short help

* at the end of a keyword allows wildcard searches
" quotes can be used for searching phrases
+ represents an AND search (default)
| represents an OR search
- represents a NOT operation
( and ) implies priority
~N after a word specifies the desired edit distance (fuzziness)
~N after a phrase specifies the desired slop amount

← Previous
1 (current)
2
3
4
5
6
7
…
25
Next →

Found 607 result(s)

Australian National Corpus

AusNC

Subject(s)

Content type(s)

Country

Australia

The Australian National Corpus collates and provides access to assorted examples of Australian English text, transcriptions, audio and audio-visual materials. Text analysis tools are embedded in the interface allowing analysis and downloads in *.CSV format.

The University of Oxford Text Archive

OTA

Subject(s)

Content type(s)

Country

The University of Oxford Text Archive develops, collects, catalogues and preserves electronic literary and linguistic resources for use in Higher Education, in research, teaching and learning. We also give advice on the creation and use of these resources, and are involved in the development of standards and infrastructure for electronic language resources.

Deutsches Textarchiv

DTA

Subject(s)

Content type(s)

Country

The German Text Archive (Deutsches Textarchiv, DTA) presents online a selection of key German-language works in various disciplines from the 17th to 19th centuries. The electronic full-texts are indexed linguistically and the search facilities tolerate a range of spelling variants. The DTA presents German-language printed works from around 1650 to 1900 as full text and as digital facsimile. The selection of texts was made on the basis of lexicographical criteria and includes scientific or scholarly texts, texts from everyday life, and literary works. The digitalisation was made from the first edition of each work. Using the digital images of these editions, the text was first typed up manually twice (‘double keying’). To represent the structure of the text, the electronic full-text was encoded in conformity with the XML standard TEI P5. The next stages complete the linguistic analysis, i.e. the text is tokenised, lemmatised, and the parts of speech are annotated. The DTA thus presents a linguistically analysed, historical full-text corpus, available for a range of questions in corpus linguistics. Thanks to the interdisciplinary nature of the DTA Corpus, it also offers valuable source-texts for neighbouring disciplines in the humanities, and for scientists, legal scholars and economists.

Repository CLARIN Centre Leipzig

CLARIN repository at the Saxon Academy of Sciences and Humanities

Subject(s)

Content type(s)

Country

The CLARIN/Text+ repository at the Saxon Academy of Sciences and Humanities in Leipzig offers longterm preservation of digital resources, along with their descriptive metadata. The mission of the repository is to ensure the availability and longterm preservation of resources, to preserve knowledge gained in research, to aid the transfer of knowledge into new contexts, and to integrate new methods and resources into university curricula. Among the resources currently available in the Leipzig repository are a set of corpora of the Leipzig Corpora Collection (LCC), based on newspaper, Wikipedia and Web text. Furthermore several REST-based webservices are provided for a variety of different NLP-relevant tasks The repository is part of the CLARIN infrastructure and part of the NFDI consortium Text+. It is operated by the Saxon Academy of Sciences and Humanities in Leipzig.

Kujawsko-Pomorska Digital Library

Kujawsko-Pomorska Biblioteka Cyfrowa

Subject(s)

Content type(s)

Country

Poland

The KPDL covers cultural heritage, scientific and regional collections – digital copies of different forms of publications: books, journals, graphics, articles, leaflets, posters, playbills, photographs, invitations, maps, exhibition catalogues and trade fairs of the region. The Kujawsko-Pomorska Digital Library is to serve scientists, students, schoolchildren and all the citizens of the region.

CLARIN service center of the Zentrum Sprache at the BBAW

CLARIN Center BBAW

Subject(s)

Content type(s)

Structured text

Country

The Berlin-Brandenburg Academy of Sciences and Humanities (BBAW) is a CLARIN partner institution and has been an officially certified CLARIN service center since June 20th, 2013. The CLARIN center at the BBAW focuses on historical text corpora (predominantly provided by the 'Deutsches Textarchiv'/German Text Archive, DTA) as well as on lexical resources (e.g. dictionaries provided by the 'Digitales Wörterbuch der Deutschen Sprache'/Digital Dictionary of the German Language, DWDS).

Tree of Life Web Project

ToL

Subject(s)

Content type(s)

Country

United States

The Tree of Life Web Project is a collection of information about biodiversity compiled collaboratively by hundreds of expert and amateur contributors. Its goal is to contain a page with pictures, text, and other information for every species and for each group of organisms, living or extinct. Connections between Tree of Life web pages follow phylogenetic branching patterns between groups of organisms, so visitors can browse the hierarchy of life and learn about phylogeny and evolution as well as the characteristics of individual groups.

PolMine

PolMine Project

Subject(s)

Content type(s)

Country

The focus of PolMine is on texts published by public institutions in Germany. Corpora of parliamentary protocols are at the heart of the project: Parliamentary proceedings are available for long stretches of time, cover a broad set of public policies and are in the public domain, making them a valuable text resource for political science. The project develops repositories of textual data in a sustainable fashion to suit the research needs of political science. Concerning data, the focus is on converting text issued by public institutions into a sustainable digital format (TEI/XML).

CLARIN-LV repository

CLARIN Latvia repository

Subject(s)

Content type(s)

Country

CLARIN-LV is a national node of Clarin ERIC (Common Language Resources and Technology Infrastructure). The mission of the repository is to ensure the availability and long term preservation of language resources. The data stored in the repository are being actively used and cited in scientific publications.

TextGrid Repository

Virtual research environment for the Humanities

Subject(s)

Content type(s)

Country

The TextGrid Repository is a digital preservation archive for human sciences research data. It offers an extensive searchable and adaptable corpus of XML/TEI encoded texts, pictures and databases. Amongst the continuously growing corpus is the Digital Library of TextGrid, which consists of works of more than 600 authors of fiction (prose verse and drama) as well as nonfiction from the beginning of the printing press to the early 20th century written in or translated into German. The files are saved in different output formats (XML, ePub, PDF), published and made searchable. Different tools e.g. viewing or quantitative text-analysis tools can be used for visualization or to further research the text. The TextGrid Repository is part of the virtual research environment TextGrid, which besides offering digital preservation also offers open-source software for collaborative creations and publications of e.g. digital editions that are based on XML/TEI.

eLaborate

Huygens ING: eLaborate

Subject(s)

Content type(s)

Country

eLaborate is an online work environment in which scholars can upload scans, transcribe and annotate text, and publish the results as on online text edition which is freely available to all users. Short information about and a link to already published editions is presented on the page Editions under Published. Information about editions currently being prepared is posted on the page Ongoing projects. The eLaborate work environment for the creation and publication of online digital editions is developed by the Huygens Institute for the History of the Netherlands of the Royal Netherlands Academy of Arts and Sciences. Although the institute considers itself primarily a research facility and does not maintain a public collection profile, Huygens ING actively maintains almost 200 digitally available resource collections.

Kielipankki

The Language Bank of Finland

Subject(s)

Content type(s)

Country

The Language Bank features text and speech corpora with different kinds of annotations in over 60 languages. There is also a selection of tools for working with them, from linguistic analyzers to programming environments. Corpora are also available via web interfaces, and users can be allowed to download some of them. The IP holders can monitor the use of their resources and view user statistics.

CLARIN INT Portal

CLARIN INT Center

Subject(s)

Content type(s)

Country

The focus of CLARIN INT Portal is on resources that are relevant to the lexicological study of the Dutch language and on resources relevant for research in and development of language and speech technology. For Example: lexicons, lexical databases, text corpora, speech corpora, language and speech technology tools, etc. The resources are: Cornetto-LMF (Lexicon Markup Framework), Corpus of Contemporary Dutch (Corpus Hedendaags Nederlands), Corpus Gysseling, Corpus VU-DNC (VU University Diachronic News text Corpus), Dictionary of the Frisian Language (Woordenboek der Friese Taal), DuELME-LMF (Lexicon Markup Framework), Language Portal (Taalportaal), Namescape, NERD (Named Entity Recognition and Disambiguation) and TICCLops (Text-Induced Corpus Clean-up online processing system).

CanGEM

Cancer GEnome Mine

Subject(s)

Content type(s)

Country

Finland

>>>!!!<<<As stated 2017-05-23 Cancer GEnome Mine is no longer available >>>!!!<<< Cancer GEnome Mine is a public database for storing clinical information about tumor samples and microarray data, with emphasis on array comparative genomic hybridization (aCGH) and data mining of gene copy number changes.

KNMI Climate Explorer

Climate Explorer

Subject(s)

Content type(s)

Country

Netherlands

The KNMI Climate Explorer is a web application to analysis climate data statistically.

Deakin Research Online

DRO

Subject(s)

Content type(s)

Country

Australia

DRO is Deakin University's research repository, providing digital curation by describing and preserving the University's research output and enabling worldwide discovery.

ResearchGate Data

Subject(s)

Content type(s)

Country

Germany

ResearchGate is a network where 15+ million scientists and researchers worldwide connect to share their work. Researchers can upload data of any type and receive DOIs, detailed statistics and real-time feedback. In Data discovery Section of ResearchGate you can explore the added datasets.

DATA.GOV.UK

Opening up Government

Subject(s)

Content type(s)

Country

United Kingdom

The Government is releasing public data to help people understand how government works and how policies are made. Some of this data is already available, but data.gov.uk brings it together in one searchable website. Making this data easily available means it will be easier for people to make decisions and suggestions about government policies based on detailed information.

Astromaterials Data System

AstroMat

Subject(s)

Content type(s)

Country

United States

The Astromaterials Data System (AstroMat) is a data infrastructure to store, curate, and provide access to laboratory data acquired on samples curated in the Astromaterials Collection of the Johnson Space Center. AstroMat is developed and operated at the Lamont-Doherty Earth Observatory of Columbia University and funded by NASA.

ScholarSphere

PennState ScholarSphere

Subject(s)

Content type(s)

Country

United States

ScholarSphere is an institutional repository managed by Penn State University Libraries. Anyone with a Penn State Access ID can deposit materials relating to the University’s teaching, learning, and research mission to ScholarSphere. All types of scholarly materials, including publications, instructional materials, creative works, and research data are accepted. ScholarSphere supports Penn State’s commitment to open access and open science. Researchers at Penn State can use ScholarSphere to satisfy open access and data availability requirements from funding agencies and publishers.

data.world

Subject(s)

Content type(s)

Country

United States

A data repository and social network so that researchers can interact and collaborate, also offers tutorials and datasets for data science learning. "data.world is designed for data and the people who work with data. From professional projects to open data, data.world helps you host and share your data, collaborate with your team, and capture context and conclusions as you work."

OSF - Vrije Universiteit Amsterdam

Subject(s)

Content type(s)

Country

A Research Data Management Tool for the Vrije Universiteit Amsterdam Research Community.

PSLC DataShop

Pittsburgh Science of Learning Center DataShop

Subject(s)

Content type(s)

Country

United States

PSLC DataShop houses datasets in the areas of learning science and educational software. The site also provides online tools for analyzing and reporting the data.

Tropicos®

w³Tropicos

Subject(s)

Content type(s)

Country

United States

Tropicos® was originally created for internal research but has since been made available to the world’s scientific community. All of the nomenclatural, bibliographic, and specimen data accumulated in MBG’s electronic databases during the past 30 years are publicly available here.

Scholar's Bank

Subject(s)

Content type(s)

Country

United States

Scholars' Bank is the open access repository for the intellectual work of faculty, students and staff at the University of Oregon and partner institution collections.

← Previous
1 (current)
2
3
4
5
6
7
…
25
Next →

Current projects
EOSC FAIR-IMPACT

re3data COREF

To the extent possible under law, re3data.org has waived all copyright and related or neighboring rights to the database entries of re3data.org.
Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution 4.0 International License .
Cite this service: re3data.org - Registry of Research Data Repositories. https://doi.org/10.17616/R3D last accessed: 2024-04-23