Search
Search Results
- Publisher:
- Budapest University of Technology and Economics Media Research (BME MOKK)
- Type:
- lexicalConceptualResource
- Language:
- Hungarian
- Description:
- 100,000 lemmas
- Rights:
- Not specified
- Publisher:
- MTA-SZTE Research Group on Artificial Intelligence
- Type:
- corpus
- Subject:
- speech corpus
- Language:
- Hungarian
- Description:
- spoken, monolingual, manually segmented domain-specific corpus of numbers, 5857 recorded words
- Rights:
- Not specified
- Publisher:
- Department of Informatics, Human Language Technology Group, University of Szeged
- Format:
- application/xml
- Type:
- corpus
- Subject:
- monolingual corpus, annotated corpus, and POS annotation
- Language:
- Hungarian
- Description:
- written, monolingual, general, manually POS annotated reference corpus; 1,247,546 tokens; MSD tagset, XML (TEIxLite) files
- Rights:
- Not specified
- Publisher:
- Department of Informatics, Human Language Technology Group, University of Szeged
- Format:
- application/xml
- Type:
- corpus
- Subject:
- monolingual corpus, annotated corpus, and POS annotation
- Language:
- Hungarian
- Description:
- written, monolingual, general, manually POS annotated reference corpus; 1,459,288 tokens; MSD tagset, XML (TEI P4) files
- Rights:
- Not specified
- Publisher:
- Department of Informatics, Human Language Technology Group, University of Szeged
- Format:
- application/xml
- Type:
- corpus
- Language:
- Hungarian
- Description:
- 82,000 sentences with shallow syntactic annotation (NP-level).
- Rights:
- Not specified
- Publisher:
- Department of Informatics, Human Language Technology Group, University of Szeged
- Format:
- application/xml
- Type:
- corpus
- Language:
- Hungarian
- Description:
- 82,000 sentences with full syntactic annotation.
- Rights:
- Not specified