Language: Hungarian / Rights: Not specified / Type: corpus - LINDAT/CLARIAH-CZ Catalog Search Results

Start Over Language Hungarian Rights Not specified Type corpus

1. Budapest Sociolinguistic Interview (BSI)

Publisher:: Academy of Sciences
Type:: corpus
Language:: Hungarian
Description:: BSI is a large-scale survey which provides reliable data on and analyses of the varieties of Hungarian spoken in Budapest.
Rights:: Not specified

2. Hungarian Historical Corpus

Publisher:: Academy of Sciences
Format:: application/xml
Type:: corpus
Language:: Hungarian
Description:: Containing 27 million running words the Hungarian Historical Corpus provides a valuable basis for research on the history of words of Hungarian between the second half of the 18th century and 2000.
Rights:: Not specified

3. Hungarian National Corpus

Publisher:: Academy of Sciences
Format:: application/xml
Type:: corpus
Subject:: synchronic corpus
Language:: Hungarian
Description:: Written general synchronic reference corpus; 190m tokens; POS annotated XML
Rights:: Not specified

4. Hungarian Web Corpus

Publisher:: Budapest University of Technology and Economics Media Research (BME MOKK)
Type:: corpus
Subject:: Web corpus
Language:: Hungarian
Description:: Monolingual written general; 700 million tokens; Segmentation, disambiguation
Rights:: Not specified

5. JRC-Acquis

Publisher:: Joint Research Centre of the EU
Type:: corpus
Language:: Bulgarian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Modern Greek (1453-), Hungarian, Italian, Latvian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, and Swedish
Description:: The largest parallel corpus, contains EU law, the Acquis Communautaire in 22 languages.
Rights:: Not specified

6. Oasis Numbers

Publisher:: MTA-SZTE Research Group on Artificial Intelligence
Type:: corpus
Subject:: speech corpus
Language:: Hungarian
Description:: spoken, monolingual, manually segmented domain-specific corpus of numbers, 5857 recorded words
Rights:: Not specified

7. SpeechDat-East databases

Type:: corpus
Subject:: These databases serve as an important resource for the performance of voice driven teleservice systems in practical implementations
Language:: Czech, Hungarian, Polish, Russian, and Slovak
Description:: 5 telephone databases recorded over the PSTN. Contains interesting phonetically rich material. All orthographically transcribed. Speaker information included for gender, age, accent. Including pronunciation lexicon.
Rights:: Not specified

8. Speecon databases

Type:: corpus
Language:: Czech, Danish, Dutch, English, Finnish, French, German, Hungarian, Italian, Polish, Portuguese, Russian, Spanish, Swedish, Turkish, Chinese, Hebrew, Japanese, Korean, and Thai
Description:: 28 speech databases containing broadband recordings from 550 adults and 50 children per language. Contains interesting phonetically rich material. All orthographically transcribed. Speaker information included for gender, age, accent. Including pronunciation lexicon.
Rights:: Not specified

9. Szeged Corpus 1.0

Publisher:: Department of Informatics, Human Language Technology Group, University of Szeged
Format:: application/xml
Type:: corpus
Subject:: monolingual corpus, annotated corpus, and POS annotation
Language:: Hungarian
Description:: written, monolingual, general, manually POS annotated reference corpus; 1,247,546 tokens; MSD tagset, XML (TEIxLite) files
Rights:: Not specified

10. Szeged Corpus 2.0

Publisher:: Department of Informatics, Human Language Technology Group, University of Szeged
Format:: application/xml
Type:: corpus
Subject:: monolingual corpus, annotated corpus, and POS annotation
Language:: Hungarian
Description:: written, monolingual, general, manually POS annotated reference corpus; 1,459,288 tokens; MSD tagset, XML (TEI P4) files
Rights:: Not specified

1. Budapest Sociolinguistic Interview (BSI)

2. Hungarian Historical Corpus

3. Hungarian National Corpus

4. Hungarian Web Corpus

5. JRC-Acquis

6. Oasis Numbers

7. SpeechDat-East databases

8. Speecon databases

9. Szeged Corpus 1.0

10. Szeged Corpus 2.0

Limit your search

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Coverage

Format

Language

Show values starting with

Publisher

Rights

Subject

Type

Date

Original context has metadata only

Harvested from