Item details
Item ID
DGB1-2020corpus_dict
Title 2020 Matukar Panau Corpus and ELAN lexicon file
Description Compiled in October 2020, a Matukar Panau corpus of over 150,000 words and a separate corpus of over 20,000 pared and glossed morphemes. Corpora produced from newly and previously collected data, including words in context, speaker metadat, file metadata and where available parsing and glossing and translations. Also lexicons produced from ELAN. Funded by the Endangered Languages Documentations Program, Project Name MDP0382 Matukar Panau corpus building for the study of language use in context.
Origination date 2020-10-29
Origination date free form
Archive link https://catalog.paradisec.org.au/repository/DGB1/2020corpus_dict
URL
Collector
Danielle Barth
Countries To view related information on a country, click its name
Language as given
Subject language(s) To view related information on a language, click its name
Content language(s) To view related information on a language, click its name
Dialect
Region / village Oceania
Originating university Australian National University
Operator Julia Colleen Miller
Data Categories
Data Types
Discourse type
Roles Danielle Barth : compiler
DOI 10.26278/wqbr-dw76
Cite as Danielle Barth (collector), Danielle Barth (compiler), 2020. 2020 Matukar Panau Corpus and ELAN lexicon file. CSV/PLAIN/XML. DGB1-2020corpus_dict at catalog.paradisec.org.au. https://dx.doi.org/10.26278/wqbr-dw76
Content Files (5)
Filename Type File size Duration File access
DGB1-2020corpus_dict-2020_corpus_by_word.csv text/csv 106 MB
DGB1-2020corpus_dict-2020_corpus_by_word.txt text/plain 106 MB
DGB1-2020corpus_dict-2020_corpus_subset_by_morpheme.csv text/csv 7.96 MB
DGB1-2020corpus_dict-2020_corpus_subset_by_morpheme.txt text/plain 7.96 MB
DGB1-2020corpus_dict-ELAN_lexicon.xml application/xml 656 KB
5 files -- 228 MB -- --

Show 10 Show 50 Show all 5

Collection Information
Collection ID DGB1
Collection title Matukar Panau Language Documentation
Description Recordings collected during 2010-2022 for language documentation. Collection is ongoing. Includes traditional stories, descriptions of traditional practices, family stories, songs, myths and procedural texts including narration of videos of typical village activities.
Countries To view related information on a country, click its name
Languages To view related information on a language, click its name
Access Information
Edit access
View/Download access
Data access conditions Open (subject to agreeing to PDSC access conditions)
Data access narrative
Metadata
RO-Crate Metadata
Comments

Must be logged in to comment


No comments found