Filter
Reset all

Subjects

Content Types

Countries

AID systems

API

Certificates

Data access

Data access restrictions

Database access

Database licenses

Data licenses

Data upload

Data upload restrictions

Enhanced publication

Institution responsibility type

Institution type

Keywords

Metadata standards

PID systems

Provider types

Quality management

Repository languages

Software

Syndications

Repository types

Versioning

  • * at the end of a keyword allows wildcard searches
  • " quotes can be used for searching phrases
  • + represents an AND search (default)
  • | represents an OR search
  • - represents a NOT operation
  • ( and ) implies priority
  • ~N after a word specifies the desired edit distance (fuzziness)
  • ~N after a phrase specifies the desired slop amount
  • 1 (current)
Found 16 result(s)
MICASE provides a collection of transcripts of academic speech events recorded at the University of Michigan. The original DAT audiotapes are held in the English Language Institute and may be consulted by bona fide researchers under special arrangements. Additional access: https://lsa.umich.edu/eli/language-resources/micase-micusp.html
ILC-CNR for CLARIN-IT repository is a library for linguistic data and tools. Including: Text Processing and Computational Philology; Natural Language Processing and Knowledge Extraction; Resources, Standards and Infrastructures; Computational Models of Language Usage. The studies carried out within each area are highly interdisciplinary and involve different professional skills and expertises that extend across the disciplines of Linguistics, Computational Linguistics, Computer Science and Bio-Engineering.
Country
The speaking language atlas gives a multimedia impression of the dialects of the state Baden-Württemberg in Germany. The maps of the Speaking Language Atlas of Baden-Württemberg are based on two databases: Südwestdeutschen Sprachatlas (SSA) and the Sprachatlas von Nord Baden-Württemberg (SNBW). The dialect recordings that form the basis for the maps were carried out at the SSA between 1974 and 1986, but at the SNBW between 2009 and 2012. For the southern part, this means that the maps may present a state of affairs that is no longer valid today.
Welcome to the UCLA Phonetics Lab Archive. For over half a century, the UCLA Phonetics Laboratory has collected recordings of hundreds of languages from around the world, providing source materials for phonetic and phonological research, of value to scholars, speakers of the languages, and language learners alike. The materials on this site comprise audio recordings illustrating phonetic structures from over 200 languages with phonetic transcriptions, plus scans of original field notes where relevant.
The English Lexicon Project (supported by the National Science Foundation) affords access to a large set of lexical characteristics, along with behavioral data from visual lexical decision and naming studies of 40,481 words and 40,481 nonwords.
Country
The program "Humanist Virtual Libraries" distributes heritage documents and pursues research associating skills in human sciences and computer science. It aggregates several types of digital documents: A selection of facsimiles of Renaissance works digitized in the Central Region and in partner institutions, the Epistemon Textual Database, which offers digital editions in XML-TEI, and Transcripts or analyzes of notarial minutes and manuscripts
The Alaska Native Language Archive houses documentation of the various Native languages of Alaska and helps to preserve and cultivate this unique heritage for future generations. As the premier repository worldwide for information relating to the Native languages of Alaska, the Archive serves researchers, teachers and students, as well as members of the broader community. The collection includes both published and unpublished materials in or on all of the Alaska Native languages and related languages. The collection has enduring cultural, historic, and intellectual value, particularly for Alaska Native language speakers and their descendants
Content type(s)
A place of living memory, the Phonotheque of the MMSH aims to bring together recordings of the sound heritage that have the value of ethnological, linguistic, historical, musicological or literary information on the Mediterranean area. It documents fields little covered by conventional sources, or completes them with the point of view of actors or witnesses. The collection holds more than 8000 hours of audio archives recorded since the late 1950s concerning all the humanities sciences.
Språkbanken is a collection of Norwegian language technology resources, and a national infrastructure for language technology and research. Our mandate is to collect and develop language resources, and to make these available for researchers, students and the ICT industry which works with the development of language-based ICT solutions. Språkbanken was established as a language policy initiative, designed to ensure that language technology solutions based on the Norwegian language will be developed, and thereby prevent domain loss of Norwegian in technology-dependent areas, cf. Mål og meining (Report 35, 2007 – 2008). As of today the collection contains resources in both Norwegian Bokmål and Nynorsk, as well as in Swedish, Danish and Norwegian Sign Language (NTS).
The South African Centre for Digital Language Resources (SADiLaR) is a national centre supported by the Department of Science and Technology (DST). SADiLaR has an enabling function, with a focus on all official languages of South Africa, supporting research and development in the domains of language technologies and language-related studies in the humanities and social sciences.
The Linguistic Data Consortium (LDC) is an open consortium of universities, libraries, corporations and government research laboratories. It was formed in 1992 to address the critical data shortage then facing language technology research and development. Initially, LDC's primary role was as a repository and distribution point for language resources. Since that time, and with the help of its members, LDC has grown into an organization that creates and distributes a wide array of language resources. LDC also supports sponsored research programs and language-based technology evaluations by providing resources and contributing organizational expertise. LDC is hosted by the University of Pennsylvania and is a center within the University’s School of Arts and Sciences.
Content type(s)
UK RED is a database documenting the history of reading in Britain from 1450 to 1945. Reading experiences of British subjects, both at home and abroad presented in UK RED are drawn from published and unpublished sources as diverse as diaries, commonplace books, memoirs, sociological surveys, and criminal court and prison records.
The University of Oxford Text Archive develops, collects, catalogues and preserves electronic literary and linguistic resources for use in Higher Education, in research, teaching and learning. We also give advice on the creation and use of these resources, and are involved in the development of standards and infrastructure for electronic language resources.
The Phonogrammarchiv is a multi-disciplinary research sound and video archive, covering holdings from all continents. Since its foundation in 1899 the Phonogrammarchiv has been building up its holdings by cooperating with Austrian scholars and archiving their collected material, or by fieldwork conducted by staff members on special topics exploring new fields of methods and contents. The main tasks comprise the production, annotation, cataloguing and long-term preservation of audio-visual field recordings, making the cultural heritage available for future generations and enabling the dissemination of the recordings as well as technical developments in the field of AV recording and storage. Thus the Phonogrammarchiv adds to infrastructural performance valuable to both the scholarly community and the public at large.
By stimulating inspiring research and producing innovative tools, Huygens ING intends to open up old and inaccessible sources, and to understand them better. Huygens ING’s focus is on Digital Humanities, History, History of Science, and Textual Scholarship. Huygens ING pursues research in the fields of History, Literary Studies, the History of Science and Digital Humanities. Huygens ING aims to publish digital sources and data responsibly and with care. Innovative tools are made as widely available as possible. We strive to share the available knowledge at the institute with both academic peers and the wider public.