Skip to search
Skip to main content
Skip to first result
Search
Search Results
Type:
corpus
Language:
Bulgarian and Croatian
Description:
written; domain-specific (newspaper); diachronic; bilingual; comparable; ca 3,500,000 tokens (393 Kw Bulgarian; 3.1 Mw Croatian)
Rights:
Not specified
Publisher:
University of Zagreb, Faculty of Humanities and Social Sciences
Format:
application/octet-stream
Type:
corpus
Language:
Croatian
Description:
Manually tagged dependency treebank, analytical layer according to the PDT formalism adapted for Croatian
Rights:
Not specified
Publisher:
University of Zagreb, Faculty of Humanities and Social Sciences
Type:
toolService
Language:
Croatian
Description:
On line service for lemmatization, full POS or MSD tagging of Croatian texts.
Rights:
Not specified
Publisher:
University of Zagreb, Faculty of Humanities and Social Sciences
Type:
lexicalConceptualResource
Language:
Croatian
Description:
110,000+ lemmas; 3,900,000+ word-forms, MulText East lexica format
Rights:
Not specified
Publisher:
University of Zagreb, Faculty of Humanities and Social Sciences
Type:
corpus
Language:
Croatian
Description:
This is the reference corpus of standard Croatian. In its 3.0 version, which is accessible via noSketch Engine, it has 216.8 million tokens. In terms of annotation, the corpus is tokenised, lemmatised and tagged for MSDs (morphosyntactic descriptions).
Rights:
Not specified
Type:
corpus
Language:
Croatian and French
Description:
written; domain-specific (fiction); diachronic (the French side); bilingual; parallel; ca 263,000 tokens (148 Kw French; 115 Kw Croatian); XML; S-alignment
Rights:
Not specified