Language: Spanish - LINDAT/CLARIAH-CZ Catalog Search Results

221. Corpus documental de Carlos V.

Type:: text and dokumenty
Subject:: Dějiny Evropy, Karel, politika zahraniční, světové dějiny 1492-1648, and politické dějiny, politici
Language:: Spanish
Rights:: unknown

222. Corpus for training and evaluating diacritics restoration systems

Creator:: Náplava, Jakub, Straka, Milan, Hajič, Jan, and Straňák, Pavel
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: diacritical marks generation and natural language correction
Language:: Czech, Vietnamese, Romanian, Polish, Slovak, Spanish, Croatian, Irish, Latvian, Hungarian, French, and Turkish
Description:: Corpus of texts in 12 languages. For each language, we provide one training, one development and one testing set acquired from Wikipedia articles. Moreover, each language dataset contains (substantially larger) training set collected from (general) Web texts. All sets, except for Wikipedia and Web training sets that can contain similar sentences, are disjoint. Data are segmented into sentences which are further word tokenized. All data in the corpus contain diacritics. To strip diacritics from them, use Python script diacritization_stripping.py contained within attached stripping_diacritics.zip. This script has two modes. We generally recommend using method called uninames, which for some languages behaves better. The code for training recurrent neural-network based model for diacritics restoration is located at https://github.com/arahusky/diacritics_restoration.
Rights:: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB

223. Corpus PAAU 92

Publisher:: Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra
Type:: corpus
Language:: Spanish
Description:: The electronic version of the book “Corpus PAAU 1992: Descriptive Studies, Texts and Vocabulary” includes the texts that have been object of analysis in this project as well as the vocabulary lists that make up the Corpus 92.
Rights:: Not specified

224. Corpus Tècnic de l'IULA

Publisher:: Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra
Type:: corpus
Language:: Catalan, English, and Spanish
Description:: domain specific corpus (Law, Economy, Computing, Medicine and Environment as well as a contrastive corpus from the press); EN 3.3 M tokens, SP 33 M tokens, CAT 19 M tokens; EAGLEs pos tagset
Rights:: Not specified

225. CorpusExplorer

Creator:: Rüdiger, Jan Oliver
Publisher:: Jan Oliver Rüdiger
Type:: tool and toolService
Subject:: Corpus Linguisitics, NLP, conll, tei, XML, nlp, Natural Language Processing, linguistics, Linguistics, Computational Linguistics, corpus processing, tagger, POS tagger, lemmatization, text cleaning, CommonCrawl, epub, JSON, Twitter, Pandoc, Wikipedia, digital data, DTA, DSpin, MySQL, ElasticSearch, TextGrid, text corpora, TigerXML, and WeblichtXML
Language:: German, English, French, Italian, Dutch, Spanish, Polish, Arabic, Chinese, and Portuguese
Description:: Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 45 interactive visualizations under a user-friendly interface. Routine tasks such as text acquisition, cleaning or tagging are completely automated. The simple interface supports the use in university teaching and leads users/students to fast and substantial results. The CorpusExplorer is open for many standards (XML, CSV, JSON, R, etc.) and also offers its own software development kit (SDK). Source code available at https://github.com/notesjor/corpusexplorer2.0
Rights:: Not specified

226. Correspondencia diplomática de don Baltasar de Zúñiga y El Enbaxador: práctica e ideal de la época sobre aspectos seleccionados de la función informativo-comunicativa de la diplomacia moderna /

Creator:: Bardoňová, Martina
Type:: studie
Subject:: Mezinárodní vztahy, světová politika, Zúñiga, Baltasar de,, vyslanci španělští, dvory panovnické, diplomacie, korespondence diplomatická, české země 1526-1620, Španělsko, světové dějiny 1492-1648, zahraniční politika, mezinárodní vztahy, and literatura, spisovatelé
Language:: Spanish
Rights:: unknown

221. Corpus documental de Carlos V.

222. Corpus for training and evaluating diacritics restoration systems

223. Corpus PAAU 92

224. Corpus Tècnic de l'IULA

225. CorpusExplorer

226. Correspondencia diplomática de don Baltasar de Zúñiga y El Enbaxador: práctica e ideal de la época sobre aspectos seleccionados de la función informativo-comunicativa de la diplomacia moderna /

227. Cortázar poeta /

228. Costa Rica :

229. Criollización y transculturación en la obra de Fernando Ortiz :

230. Crítica literaria y debates políticos en Puerto Rico (1930-1956) /

Limit your search

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Coverage

Show values starting with

Creator

Show values starting with

Format

Show values starting with

Language

Show values starting with

Publisher

Show values starting with

Rights

Show values starting with

Subject

Show values starting with

Type

Show values starting with

Date

Original context has metadata only

Harvested from