ILSP Dependency Parser is a tool trained on the Greek Dependency Treebank, a resource which comprises data annotated at several linguistic levels. Training data at the level of syntax consisted of ~70 KWords annotated using a dependency-based syntactic scheme that includes 25 main relations.
ILSP FBT Tagger is an adaptation of the Brill tagger trained on Greek text. It uses a PAROLE compatible tagset of 584 different tags which capture the morphosyntactic particularities of the Greek language. Working on the output of a sentence detection and tokenisation tool, the tagger assigns initial tags, looking up in a lexicon created from a manually annotated corpus during training. A suffix lexicon is used for initially tagging unknown words. 799 contextual rules are then applied to improve the initial phase output.
The Imdi Browser enables users to navigate and search swiftly through an IMDI metadata repository. It has two incarnations: a standalone application and a web application.
A corpus of spontaneous discussions and read-aloud performances from native speakers of different ages. Parallel corpus in Russian, Finnish, and Dutch.