Language: Spanish - LINDAT/CLARIAH-CZ Catalog Search Results

Creator:: Cardenas Acosta, Ronald, Bello Medina, Kevin, Coronado, Alberto, and Villota, Elizabeth
Publisher:: National University of Engineering, Peru
Type:: text and corpus
Subject:: job-advertisement, PoS tagging, and text corpora
Language:: Spanish
Description:: The corpus presented consists of job ads in Spanish related to Engineering positions in Peru. The documents were preprocessed and annotated for POS tagging, NER, and topic modeling tasks. The corpus is divided in two components: - POS tagging/ NER training data: Consisting of 800 job ads, each one tokenized and manually annotated with POS tag information (EAGLE format) and Entity Label in BIO format. - Topic modeling training data: containing 9000 documents stripped from stopwords. Comes in two formats: * Whole text documents: containing all the information originally posted in the ad. * Extracted chunks documents: containing chunks extracted by custom NER models (expected skills, tasks to perform, and preferred major), as described in Improving Topic Coherence Using Entity Extraction Denoising (to appear)
Rights:: Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB

Creator:: Gutiérrez Rubio, Enrique,
Type:: text and monografie
Subject:: Čeština, jazyk český, vývoj jazykový, sémantika, lingvistika kognitivní, přehledná zpracování dějin českých zemí (chronologicky), and jazyk, písmo
Language:: Spanish
Description:: Původně vydáno jako disertace (doktorská)--Universidad Complutense de Madrid, 2007
Rights:: unknown

Creator:: Agirre, Eneko, Branco, António, Popel, Martin, and Simov, Kiril
Publisher:: University of the Basque Country, UPV/EHU, Faculty of Science, Univeristy of Lisbon, FCUL, Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), and Bulgarian Academy of Sciences, IICT-BAS
Type:: text and corpus
Subject:: annotated corpus and multilingual
Language:: Basque, Bulgarian, Czech, English, Portuguese, and Spanish
Description:: This corpora is part of Deliverable 5.5 of the European Commission project QTLeap FP7-ICT-2013.4.1-610516 (http://qtleap.eu). The texts are sentences from the Europarl parallel corpus (Koehn, 2005). We selected the monolingual sentences from parallel corpora for the following pairs: Bulgarian-English, Czech-English, Portuguese-English and Spanish-English. The English corpus is comprised by the English side of the Spanish-English corpus. Basque is not in Europarl. In addition, it contains the Basque and English sides of the GNOME corpus. The texts have been automatically annotated with NLP tools, including Word Sense Disambiguation, Named Entity Disambiguation and Coreference resolution. Please check deliverable D5.6 in http://qtleap.eu/deliverables for more information.
Rights:: Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB

Creator:: Zavadil, Bohumil,
Type:: text and texty učební
Subject:: Iberorománské jazyky, Učební osnovy. Vyučovací předměty. Učebnice, jazyk španělský, dějiny jazyka, Španělsko, přehledná zpracování světových dějin (chronologicky), jazyk, písmo, and učebnice a skripta, učební pomůcky
Language:: Spanish
Rights:: unknown

Creator:: Binková, Simona,
Type:: text and monografie kolektivní
Subject:: Křesťanská sdružení, spolky a organizace. Řeholní řády, řád, jezuité, misionáři, misie, prameny písemné, bohemika, české (československé) sborníky a kolektivní monografie, Mexiko, světové dějiny novověku (1492-1918), církevní řády a kongregace, náboženská bratrstva, kláštery, Filipíny, and české země 1526-1792
Language:: Spanish
Rights:: unknown

Creator:: Binková, Simona,
Type:: text and monografie kolektivní
Subject:: Křesťanská sdružení, spolky a organizace. Řeholní řády, řád, jezuité, misionáři, misie, prameny písemné, bohemika, české (československé) sborníky a kolektivní monografie, Mexiko, světové dějiny novověku (1492-1918), církevní řády a kongregace, náboženská bratrstva, kláštery, Filipíny, and české země 1526-1792
Language:: Spanish
Rights:: unknown

Limit your search