Language Data Ontology
This is a language data ontology based on OLAC terms for use in the ATAP and LDaCA projects
Classes
CollectionEvent | CollectionProtocol | PersonSnapshot | DataDepositLicense | DataLicense | DataReuseLicense | DerivedMaterial | Annotation | PrimaryMaterial
Properties
annotationOf | annotationType | annotator | author | channels | collectionEventType | collectionProtocolType | compiler | consultant | dataInputter | depositor | derivationOf | developer | doi | editor | geoJSON | hasAnnotation | hasDerivation | illustrator | indexableText | interpreter | interviewee | interviewer | subjectLanguage | modality | participant | performer | person | photographer | recorder | register | researchParticipant | researcher | responder | signer | singer | speaker | sponsor | transcriber | translator | hasCollectionProtocol | isDeIdentified | access | accessControlList | authorizationWorkflow | openAccessIndex
DefinedTerms
Coded | Dialogue | Drama | ElicitationTask | Formulaic | Gesture | Handwritten | Informational | Interview | Lexicon | Ludic | Oratory | Orthographic | PartOfSpeech | Phonemic | Phonetic | Phonological | Procedural | Prosodic | Report | Semantic | Gestural | Session | SignLanguage | Song | SpokenLanguage | Syntactic | TextSelectionCriteria | Thesaurus | Narrative | Transcription | Translation | Typeset | Typewritten | WrittenLanguage | WhistledLanguage | AccessControlList | AgreeToTerms | AuthorizationByApplication | AuthorizationByInvitation | AuthorizedAccess | SelfAuthorization | OpenAccess | FullText
DefinedTermsSets
CollectionEventTypeTerms | CollectionProtocolTypeTerms | ModalityTerms | AnnotationTypeTerms | WrittenLanguageTypeTerms | LinguisticGenreTerms | AccessTypes | AuthorizationWorkflows | IndexTypes
Defined Term: Coded
The resource contains an analysis or annotations represented by a code (such as the International Phonetic Alphabet).
Top of page
Class: CollectionEvent
A description of an event at which one or more PrimaryMaterials were captured, eg as video or audio
Subclass of:
schema:Event | schema:CreateAction |
Properties
[ collectionEventType ] |
:
Top of page
Defined Term Set: CollectionEventTypeTerms
Set of terms which are expected values for CollectionEvent types
Session ] |
Has defined terms[Top of page
Class: CollectionProtocol
Description of how this Object or Collection was obtained – such as strategy used for selecting written source texts, or the prompts given to participants
Subclass of:
[ http://schema.org/CreativeWork ] |
Properties
[ collectionProtocolType ] |
:
Top of page
Defined Term Set: CollectionProtocolTypeTerms
Set of terms which are expected values for CollectionProtocol types
ElicitationTask ] | [ TextSelectionCriteria ] |
Has defined terms[Top of page
Defined Term Set: ModalityTerms
Set of expected values for modality types
SpokenLanguage ] | [ WrittenLanguage ] | [ Song ] | [ Gesture ] | [ SignLanguage ] | [ WhistledLanguage ] |
Has defined terms[Top of page
Defined Term: Dialogue
An interactive discourse with two or more participants. Examples of dialogues include conversations, interviews, correspondence, consultations, greetings and leave-takings.
Top of page
Defined Term: Drama
A planned, creative, rendition of discourse with two or more participants intended for presentation to an audience.
Same as:
[ text/drama ] |
Top of page
Defined Term: ElicitationTask
The collection protocol includes a task-based prompt to participants
Is an expected value for the following property
[ collectionProtocolType ] |
Top of page
Defined Term: Formulaic
The resource is a ritually or conventionally structured discourse.
Same as:
[ text/formulaic ] |
Top of page
Defined Term: Gesture
The resource contains non-linguistic gestural communication (ie not sign language)
Is an expected value for the following property
[ modality ] |
Top of page
Defined Term: Handwritten
The resource was written using a writing implement such as pen, pencil, brush or computer stylus (From Nyingarn - TODO check this)
Top of page
Defined Term: Informational
Discourse whose primary purpose is to inform the audience about the natural or social world.
Top of page
Defined Term: Interview
The resource is conversation where one or more speakers are directing the conversation
Top of page
Defined Term: Lexicon
The resource includes a systematic listing of lexical items.
Same as:
[ Lexicon ] |
Top of page
Defined Term: Ludic
Ludic discourse is language whose primary function is to be part of play, or a style of speech that involves a creative manipulation of the structures of the language. Examples of ludic discourse are play languages, jokes, secret languages, and speech disguises.
Same as:
[ text/ludic ] |
Top of page
Defined Term: Oratory
The art of public speaking, or of speaking eloquently according to rules or conventions. Examples of oratory include sermons, lectures, political speeches, and invocations.
Same as:
[ text/orratory ] |
Top of page
Defined Term: Orthographic
The resource contains annotations using orthography (a writing system) as opposed to a coded representation such as a phonetic transcription
Same as:
[ description/orthographic ] |
Top of page
Defined Term: PartOfSpeech
An annotation which assigns lexical elements of language to classes on the basis of their distributional properties (for sign languages, the term 'sign class' is appropriate)
Same as:
Top of page
Class: PersonSnapshot
This class represents a snapshot of a Person in time, in their role as a contributor to one or more creative works. The purpose of this class is to capture the metadata that applies to a person at a particular time, as their name, age, gender, social status etc may be different over time.
Subclass of:
schema:Role | [ http://schema.org/Person ] |
Properties
[ person ] |
:
Top of page
Defined Term: Phonemic
An annotation which represents speech in terms of the sound contrasts made in a language.
Is an expected value for the following property
[ annotationType ] |
Same as:
[ description/phonemic ] |
Top of page
Defined Term: Phonetic
A representation of speech in terms of the sounds produced, typically using the International Phonetic Alphabet
Is an expected value for the following property
[ annotationType ] |
Same as:
[ description/phonetic ] |
Top of page
Defined Term: Phonological
An annotation which includes information about the sound system of a language, such as the contrasts between sounds which make up the sound system and the locally conditioned realisations of sounds which characterise speech in the language.
Is an expected value for the following property
[ annotationType ] |
Same as:
[ description/phonological ] |
Top of page
Defined Term: Procedural
An explanation or description of a method, process, or situation having ordered steps.
Same as:
[ text/procedural ] |
Top of page
Defined Term: Prosodic
An annotation which provides a symbolic record of intonation, stress, tone or other suprasegmental features that is expressed independently of regular phonetic transcription.
Is an expected value for the following property
[ annotationType ] |
Same as:
[ description/prosodic ] |
Top of page
Top of page
Defined Term: Semantic
The resource includes annotation or analysis concerning the encoding of meaning.
Is an expected value for the following property
[ annotationType ] |
Same as:
[ description/semantic ] |
Top of page
Defined Term: Gestural
The resource describes the gestural content of the resource it annotates.
Is an expected value for the following property
[ annotationType ] |
Same as:
[ description/gestural ] |
Top of page
Defined Term: Session
A collection event which is a recording or elicitation Session with participants.
Is an expected value for the following property
[ collectionEventType ] |
Same as:
[ https://www.mpi.nl/ISLE/documents/draft/ISLE_MetaData_2.5.pdf ] |
Top of page
Defined Term: SignLanguage
The resource contains data for which the medium of interaction was signing.
Is an expected value for the following property
[ modality ] |
Top of page
Defined Term: Song
"Words or sounds [articulated] in succession with musical inflections or modulations of the voice" OED.
Is an expected value for the following property
[ modality ] |
Same as:
[ text/singing ] |
Top of page
Defined Term: SpokenLanguage
The resource contains data for which the medium of interaction was speech
Is an expected value for the following property
[ modality ] |
Top of page
Defined Term: Syntactic
The resource contains annotation or analysis describing the combinatorial patterns of words in another resource
Is an expected value for the following property
[ annotationType ] |
Same as:
[ description/syntactic ] |
Top of page
Defined Term: TextSelectionCriteria
A description of the criteria used to select texts in a collection
Is an expected value for the following property
[ collectionProtocolType ] |
Top of page
Defined Term: Thesaurus
The resource contains a list or data structure consisting of words or concepts arranged according to sense.
Same as:
[ lexicon/thesaurus ] |
Top of page
Defined Term: Narrative
A discourse, monologic or co-constructed, which represents temporally organized events. Types of narratives include historical, traditional, and personal narratives, myths, folktales, fables, and humorous stories.
Same as:
[ text/narrative ] |
Top of page
Defined Term: Transcription
The resource contains a transcription, which is a written representation (orthographic or coded) of an audio or visual signal.
Is an expected value for the following property
[ annotationType ] |
Same as:
[ transcription ] |
Top of page
Defined Term: Translation
The resource has been translated from one natural language to another.
Is an expected value for the following property
[ annotationType ] |
Same as:
[ annotation/translation ] |
Top of page
Defined Term Set: AnnotationTypeTerms
The set of expected values for annotation types
Gestural ] | [ Prosodic ] | [ Phonemic ] | [ Phonetic ] | [ Phonological ] | [ Syntactic ] | [ Translation ] | [ Semantic ] | [ Transcription ] |
Has defined terms[Top of page
Class: DataDepositLicense
A license document setting out terms for deposit into a repository
Subclass of:
[ DataLicense ] |
:
Top of page
Class: DataLicense
A licence document for data licensing. This is a superclass of DataReuseLicense and DataLicense
Subclass of:
[ http://schema.org/CreativeWork ] |
Same as:
[ License ] |
:
Top of page
Class: DataReuseLicense
A license document, setting out terms for reuse of data
Subclass of:
[ DataLicense ] |
Properties
[ access ] | [ accessControlList ] | [ authorizationWorkflow ] |
Same as:
[ License ] |
:
Top of page
Class: DerivedMaterial
This is derived from another source, such as a Primary Material, via some process, eg a downsampled video or a sample or an abstract of a resource which is not an annotation (an analysis or description)
Subclass of:
[ http://schema.org/CreativeWork ] |
Properties
[ derivationOf ] |
Same as:
[ text ] |
:
Top of page
Defined Term: Typeset
The resource has been formatted for display.
Top of page
Defined Term: Typewritten
The resource contains text produced on a tpyewriter (From Nyingarn - TODO check this)
Top of page
Defined Term: WrittenLanguage
TThe resource contains data for which the medium of interaction was writing.
Is an expected value for the following property
[ modality ] |
Top of page
Defined Term: WhistledLanguage
The resource contains data for which the medium of interaction was whistling.
Is an expected value for the following property
[ modality ] |
Top of page
Property: annotationOf
This resource contains some kind of description which adds information to the resource it references
Values expected to be one of these types:
[ PrimaryMaterial ] |
Used on these types:
Top of page
Property: annotationType
The type of annotation for Annotation resources
Values expected to be one of these types:
Used on these types:
[ Annotation ] |
Values expected to be one of these defined terms:
[ Gestural ] | [ Prosodic ] | [ Phonemic ] | [ Phonetic ] | [ Phonological ] | [ Syntactic ] | [ Translation ] | [ Semantic ] | [ Transcription ] |
Top of page
Property: annotator
The participant produced an annotation of this or a related resource.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ annotator ] |
Top of page
Top of page
Property: channels
Number of audio channels this resource contains (eg 1, 2, 5.1)
Values expected to be one of these types:
Used on these types:
Top of page
Property: collectionEventType
An event with a start and end time during which data are gathered from participants, or from other materials
Values expected to be one of these types:
Used on these types:
[ CollectionEvent ] |
Values expected to be one of these defined terms:
[ Session ] |
Top of page
Property: collectionProtocolType
A description of the process used to collect or collate data, such as prompts given to participants, or how texts are selected for inclusion in a collection
Values expected to be one of these types:
Used on these types:
[ CollectionProtocol ] |
Values expected to be one of these defined terms:
[ ElicitationTask ] | [ TextSelectionCriteria ] |
Top of page
Property: compiler
The participant is responsible for collecting the sub-parts of the resource together.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ compiler ] |
Top of page
Property: consultant
The participant contributes expertise to the creation of a work.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ consultant ] |
Top of page
Property: dataInputter
The participant was responsible for entering, re-typing, and/or structuring the data contained in the resource.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ data_inputter ] |
Top of page
Property: depositor
The participant was responsible for depositing the resource in an archive.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ depositor ] |
Top of page
Property: derivationOf
This resource references another resource that is derived from it such as a downsampled audio or video file, or text extracted from a PDF
Values expected to be one of these types:
[ PrimaryMaterial ] | [ Annotation ] |
Used on these types:
[ DerivedMaterial ] |
Top of page
Property: developer
The participant developed the methodology or tools that constitute the resource, or that were used to create the resource.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ developer ] |
Top of page
Property: doi
A digital Object Identifier
Values expected to be one of these types:
Used on these types:
Top of page
Property: editor
The participant reviewed, corrected, and/or tested the resource.
Values expected to be one of these types:
[ http://schema.org/Organization ] | [ http://schema.org/Person ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ editor ] |
Top of page
Property: geoJSON
A valid GEOJson feature or feature collection as a string that can be parsed as JSON
Values expected to be one of these types:
Text |
Used on these types:
schema:GeoCoordinates | schema:GeoShape | schema:Language |
Top of page
Property: hasAnnotation
This resource is referenced by another resource that describes it such as a translation, transcription or other analysis
Values expected to be one of these types:
[ Annnotation ] |
Used on these types:
[ PrimaryMaterial ] |
Top of page
Property: hasDerivation
This resource references another resource that is derived from it such as a downsampled audio or video file, or text extracted from a PDF
Values expected to be one of these types:
[ DerivedMaterial ] |
Used on these types:
[ PrimaryMaterial ] |
Top of page
Property: illustrator
The participant contributed drawings or other illustrations to the resource.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ illustrator ] |
Top of page
Property: indexableText
Indicates one or more target File that together contain the full text of an item – each file should indicate its language
Values expected to be one of these types:
schema:File |
Used on these types:
Top of page
Property: interpreter
The participant translates in real-time or explains the discourse recorded in the resource.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ interpreter ] |
Top of page
Property: interviewee
The participant was a respondent in an interview
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
Top of page
Property: interviewer
The participant conducted an interview that forms part of the resource.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ interviewer ] |
Top of page
Property: subjectLanguage
The language(s) that this annotation resource is about
Values expected to be one of these types:
[ http://schema.org/Language ] |
Used on these types:
[ Annotation ] |
Top of page
Property: modality
The mode (spoken, written, signed etc) of this resource. There may be more than one value for this property.
Values expected to be one of these types:
Used on these types:
[ http://schema.org/CreativeWork ] |
Values expected to be one of these defined terms:
[ SpokenLanguage ] | [ WrittenLanguage ] | [ Song ] | [ Gesture ] | [ SignLanguage ] | [ WhistledLanguage ] |
Top of page
Property: participant
The participant was present during the creation of the resource, but did not contribute substantially to its content.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ participant ] |
Top of page
Property: performer
The participant performed some portion of a recorded, filmed, or transcribed resource.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ performer ] |
Top of page
Property: person
This property references a Person item which represents the persistent identity of one or more ContributingPerson items.
Values expected to be one of these types:
[ http://schema.org/Person ] |
Used on these types:
[ PersonSnapshot ] |
Top of page
Property: photographer
The participant took the photograph, or shot the film, that appears in or constitutes the resource.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ photographer ] |
Top of page
Property: recorder
The participant operated the recording machinery used to create the resource.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ recorder ] |
Top of page
Property: register
Specifies the type of register (any of the varieties of a language that a speaker uses in a particular social context [Merriam-Webster]) of the contents of a language resource.
Values expected to be one of these types:
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
Top of page
Property: researchParticipant
The participant acted as a research subject or responded to a questionnaire, the results of which study form the basis of the resource.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ research_participant ] |
Top of page
Property: researcher
The resource was created as part of the participant's research, or the research presents interim or final results from the participant's research.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ researcher ] |
Top of page
Property: responder
The participant was an interlocutor in some sort of discourse event.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ responder ] |
Top of page
Property: signer
The participant was a principal signer in a resource that consists of a recording, a film, or a transcription of a recorded resource.
Values expected to be one of these types:
[ http://schema.org/Organization ] | [ http://schema.org/Person ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ signer ] |
Top of page
Property: singer
The participant sang, either individually or as part of a group, in a resource that consists of a recording, a film, or a transcription of a recorded resource.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ singer ] |
Top of page
Property: speaker
The participant was a principal speaker in a resource that consists of a recording, a film, or a transcription of a recorded resource.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ speaker ] |
Top of page
Property: sponsor
The participant contributed financial support to the creation of the resource.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ sponsor ] |
Top of page
Property: transcriber
The participant produced a transcription of this or a related resource.
Values expected to be one of these types:
[ http://schema.org/Person ] | [ http://schema.org/Organization ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ transcriber ] |
Top of page
Property: translator
The participant produced a translation of this or a related resource.
Values expected to be one of these types:
[ http://schema.org/Organization ] | [ http://schema.org/Person ] |
Used on these types:
[ http://schema.org/CreativeWork ] |
Same as:
[ translator ] |
Top of page
Property: hasCollectionProtocol
This resource was assembled or collected according to the linked protocol
Values expected to be one of these types:
[ CollectionProtocol ] |
Used on these types:
Top of page
Property: isDeIdentified
The data in this item has had identifying information removed, or in the case of a person the name is an alias
Values expected to be one of these types:
schema:Boolean |
Used on these types:
[ [{"@id":"schema:CreativeWork"}, {"@id":"schema:Person"}, https://purl.archive.org/language-data-commons/terms#PersonSnapshot] ] |
Top of page
Defined Term Set: WrittenLanguageTypeTerms
Set of expected types for WrittenLanguage modality (this set is incomplete - more work needed)
Handwritten ] | [ Typewritten ] | [ Typeset ] |
Has defined terms[Top of page
Defined Term Set: LinguisticGenreTerms
Set of expected values for linguistic genre of a resource
Formulaic ] | [ Thesaurus ] | [ Dialogue ] | [ Oratory ] | [ Report ] | [ Ludic ] | [ Procedural ] | [ Narrative ] | [ Interview ] | [ Drama ] | [ Informational ] |
Has defined terms[Top of page
Class: Annotation
The resource includes material which adds information to some other linguistic record.
Subclass of:
[ http://schema.org/CreativeWork ] |
Properties
[ annotationType ] | [ linguisticGenre ] |
Same as:
[ annotation ] |
:
Top of page
Class: PrimaryMaterial
The object of study, such as a literary work, film, or recording of natural discourse
Subclass of:
[ http://schema.org/CreativeWork ] |
Properties
[ hasAnnotation ] | [ hasDerivation ] |
Same as:
[ text ] |
:
Top of page
Property: access
Is this an open or restricted access license
Values expected to be one of these types:
Used on these types:
[ DataReuseLicense ] |
Values expected to be one of these defined terms:
[ OpenAccess ] | [ AuthorizedAccess ] |
Top of page
Property: accessControlList
When a license has an authorizationWorkflow property with a value of the DefineTerm AcessControlList this property has a URI value that points to a list of userIDs
Values expected to be one of these types:
[ http://schema.org/URL ] |
Used on these types:
[ DataReuseLicense ] |
Top of page
Defined Term: AccessControlList
License grants access to data based on a list of approved users, specified using the property accessControlList
Top of page
Defined Term: AgreeToTerms
A user is expected to explicitly agree to a set of license terms, this may be combined with AccessControlList - to note that even if a user has been pre-approved for a license they must agree to license terms
Is an expected value for the following property
[ authorizationWorkflow ] |
Top of page
Top of page
Top of page
Top of page
Top of page
Defined Term Set: AccessTypes
Set of defined terms to specify whether a DataReuseLicense allows open or restricted (authorized) access
OpenAccess ] | [ AuthorizedAccess ] |
Has defined terms[Top of page
Top of page
Top of page
Defined Term: OpenAccess
Data covered by this license may be accessed as long as the license is served alongside it, does not require any specific authorization step
Is an expected value for the following property
[ access ] |
Top of page
Defined Term Set: IndexTypes
Set of defined terms for types of indexing, such as FullText
FullText ] |
Has defined terms[Top of page
Defined Term: FullText
A text index which makes the full text of a data resources findable via a search interface
Is an expected value for the following property
[ openAccessIndex ] |
Top of page
Property: openAccessIndex
One or more public index types allowed by a license, eg FullText indexing may be allowed for discovery even when an item is not
Values expected to be one of these types:
Used on these types:
[ http://schema.org/CreativeWork ] |
Values expected to be one of these defined terms:
[ FullText ] |
Top of page