1 - 3 of 3
Number of results to display per page
Search Results
2. DiscoMT 2017 Shared Task on Cross-lingual Pronoun Prediction
- Creator:
- Loáiciga, Sharid, Stymne, Sara, Nakov, Preslav, Hardmeier, Christian, Tiedemann, Jörg, Cettolo, Mauro, and Versley, Yannick
- Publisher:
- Uppsala University
- Type:
- text and corpus
- Subject:
- machine translation, discourse, coreference, and pronouns
- Language:
- English, Spanish, German, and French
- Description:
- Data used in the 2017 shared task on cross-lingual pronoun prediction.
- Rights:
- Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), http://creativecommons.org/licenses/by-nc-nd/4.0/, and PUB
3. Large-Scale Colloquial Persian 0.5
- Creator:
- Abdi Khojasteh, Hadi, Ansari, Ebrahim, and Bohlouli, Mahdi
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) and Institute for Advanced Studies in Basic Sciences (IASBS)
- Type:
- text and corpus
- Subject:
- PoS tagging, corpus, annotated corpus, multilingual, derivation, dependency parser, machine translation, informal language, spoken language, monolingual corpus, and bilingual corpus annotation
- Language:
- Persian, English, German, Czech, Italian, and Hindi
- Description:
- "Large Scale Colloquial Persian Dataset" (LSCP) is hierarchically organized in asemantic taxonomy that focuses on multi-task informal Persian language understanding as a comprehensive problem. LSCP includes 120M sentences from 27M casual Persian tweets with its dependency relations in syntactic annotation, Part-of-speech tags, sentiment polarity and automatic translation of original Persian sentences in five different languages (EN, CS, DE, IT, HI).
- Rights:
- Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), http://creativecommons.org/licenses/by-nc-nd/4.0/, and PUB