Skip to search
Skip to main content
Skip to first result
Search
Search Results
Publisher:
Universiteit van Amsterdam
Type:
corpus
Description:
Documentation of the Sri Lanka Malay project (DoBeS project)
Rights:
Code of conduct
Publisher:
University of St. Andrews
Type:
corpus
Description:
Collection of Ancient Egyptian texts, containing hieroglyphs, a transliteration and a translation.
Rights:
Not specified
Publisher:
NBG/DBNL/INL; Nicoline van der Sijs
Type:
corpus
Language:
Dutch
Description:
A digitised version of the Statenvertaling (Bible) of 1637
Rights:
Not specified
Type:
corpus
Language:
Swedish
Description:
Interlanguage/Learner corpus (essays written by SL Swedish learners with many native languages); appr. 200 kW; POS tags; base forms of words (in TEI/XCES XML format)
Rights:
Not specified
Publisher:
Center for Dutch Language and Speech, University of Antwerp
Type:
corpus
Description:
audio of Swahili syllables and phonemes
Rights:
Not specified
Type:
corpus
Language:
Swedish
Description:
appr. 100 kW, functional/dependency (one token per line plus its POS and syntactic annotation[s])
Rights:
Not specified
Publisher:
Department of Informatics, Human Language Technology Group, University of Szeged
Format:
application/xml
Type:
corpus
Subject:
monolingual corpus , annotated corpus , and POS annotation
Language:
Hungarian
Description:
written, monolingual, general, manually POS annotated reference corpus; 1,247,546 tokens; MSD tagset, XML (TEIxLite) files
Rights:
Not specified
Publisher:
Department of Informatics, Human Language Technology Group, University of Szeged
Format:
application/xml
Type:
corpus
Subject:
monolingual corpus , annotated corpus , and POS annotation
Language:
Hungarian
Description:
written, monolingual, general, manually POS annotated reference corpus; 1,459,288 tokens; MSD tagset, XML (TEI P4) files
Rights:
Not specified
Publisher:
Department of Informatics, Human Language Technology Group, University of Szeged
Format:
application/xml
Type:
corpus
Language:
Hungarian
Description:
82,000 sentences with shallow syntactic annotation (NP-level).
Rights:
Not specified
Publisher:
Department of Informatics, Human Language Technology Group, University of Szeged
Format:
application/xml
Type:
corpus
Language:
Hungarian
Description:
82,000 sentences with full syntactic annotation.
Rights:
Not specified