« Previous |
1 - 100 of 231
|
Next »
Number of results to display per page
Search Results
2. ...e procuri che non mi dimentichino i comuni amici... )...e procurem que não me esqueçam os nossos amigos comuns...) /
- Creator:
- Fričová, Yvonna,
- Subject:
- Frič, Alberto Vojtěch,, Boggiani, Guido,, cestovatelé, cesty výzkumné, cestopisy, cestovatelé, české země 1848-1918, Československo 1918-1992, and dějiny věd o neslovanských oblastech
- Language:
- Portuguese
- Rights:
- unknown
3. 100 anos das relações diplomáticas tcheco-brasileiras /
- Creator:
- Pelant, Matyáš
- Type:
- text and studie
- Subject:
- Světové dějiny, vztahy česko-brazilské, vztahy mezinárodní, diplomacie, vztahy hospodářské, Československo 1918-1992, zahraniční politika, mezinárodní vztahy, hospodářské dějiny, Brazílie, and světové dějiny od r. 1918 do současnosti
- Language:
- Portuguese
- Rights:
- unknown
4. 100 František-Jorge Listopad :
- Creator:
- Válová, Karolina,
- Type:
- text and monografie kolektivní
- Subject:
- Česká literatura (o ní), Listopad, František,, spisovatelé, básníci, literatura česká, život literární, české (československé) sborníky a kolektivní monografie, Československo 1918-1992, české země od r. 1993 do současnosti, and literatura, spisovatelé
- Language:
- Czech and Portuguese
- Rights:
- unknown
5. A Boémia e o terramoto de Lisboa de 1755 /
- Creator:
- Polišenský, Josef,
- Subject:
- Stepling, Josef,, lázně, vztahy česko-portugalské, vztahy kulturní, české země 1620-1740, and lékařství, lázně, nemocnice, špitály
- Language:
- Portuguese
- Rights:
- unknown
6. A descolonização da Guiné no contexto da descolonização portuguesa /
- Creator:
- Klíma, Jan,
- Type:
- studie
- Subject:
- Dějiny Afriky, kolonie portugalské, kolonialismus, dekolonizace, Guinea-Bissau, světové dějiny od r. 1945 do současnosti, and politické dějiny, politici
- Language:
- Portuguese
- Rights:
- unknown
7. A divulgaçäo da Tchecoslováquia e dos seus artigos comercias na América Latina no período entre-guerras :
- Creator:
- Novotný, Jiří,
- Type:
- text and studie
- Subject:
- Světová ekonomika a mezinárodní finance, výstavy zahraniční, veletrhy, výstavnictví, vztahy obchodní, vztahy československo-jihoamerické, vztahy hospodářské, obchod, zahraniční výstavy, and Československo 1918-1938
- Language:
- Portuguese
- Rights:
- unknown
8. A Emigração dos Países Tchecos ao Brasil antes de Originar-se a República Tchecoslovaca /
- Creator:
- Baďura, Bohumil,
- Subject:
- Lorenc, František Vladimír,, emigrace česká, světové dějiny 1789-1918, světové dějiny od r. 1918 do současnosti, Brazílie, migrace, vystěhovalectví, kolonizace, české země 1848-1918, and Československo 1918-1938
- Language:
- Portuguese
- Rights:
- unknown
9. A emigração européia na América Latina: a propaganda italiana no Sul do Brasil /
- Creator:
- Merlotti Herédia, Vania Beatriz
- Subject:
- emigrace, vystěhovalectví, propaganda, Italové, Brazílie, světové dějiny 1789-1918, Itálie, and migrace, vystěhovalectví, kolonizace
- Language:
- Portuguese
- Rights:
- unknown
10. A escravidão entre os imigrantes alemães no Sul do Brasil. (Colônia de São Leopoldo, Brasil - primeira metade do século 19) /
- Creator:
- Tamontini, Marcos Justo
- Subject:
- vztahy brazilsko-německé, emigrace německá, Němci brazilští, světové dějiny 1789-1918, Brazílie, Německo, and migrace, vystěhovalectví, kolonizace
- Language:
- Portuguese
- Rights:
- unknown
11. A exposição Novos mundos - Neue Welten. (Berlin, 24 de Outubro de 2007 - 10 de Fevereiro de 2008) /
- Creator:
- Binková, Simona,
- Subject:
- výstavy zahraniční, amerikanistika, and zahraniční výstavy
- Language:
- Portuguese
- Rights:
- unknown
12. A inserção dos emigrantes italianos na formação econômica da região colonial italiano no RS /
- Creator:
- Herédia, Vania Beatriz Merlotti
- Subject:
- emigrace italská, města brazilská, vztahy brazilsko-italské, dějiny hospodářské, světové dějiny 1789-1918, Brazílie, Itálie, and migrace, vystěhovalectví, kolonizace
- Language:
- Portuguese
- Rights:
- unknown
13. A juventude pelo direito ao trabalho :
- Type:
- text and dokumenty
- Subject:
- Politické strany a hnutí, setkání mládeže, mládež portugalská, projevy politické, Portugalsko, světové dějiny od r. 1945 do současnosti, and politické dějiny, politici
- Language:
- Portuguese
- Rights:
- unknown
14. A Nova África, a Nova Índia e o Novo Mundo - o Brasil - nos escritos quinhentistas checos /
- Creator:
- Binková, Simona,
- Subject:
- cestopisy, cestování, pohled na druhé, kosmologie, cestovatelé, cestopisy, cestovatelé, české země 1526-1620, and literatura, spisovatelé
- Language:
- Portuguese
- Rights:
- unknown
15. A revolução portuguesa :
- Creator:
- Cunhal, Álvaro,
- Type:
- text and monografie
- Subject:
- Dějiny států a území na Pyrenejském poloostrově, Cunhal, Álvaro,, politici portugalští, dějiny politické, Portugalsko, politické dějiny, politici, and světové dějiny od r. 1918 do současnosti
- Language:
- Portuguese
- Rights:
- unknown
16. A Revoluçao Portuguesa :
- Creator:
- Cunhal, Álvaro,
- Type:
- text and spisy
- Subject:
- Politické strany a hnutí, Dějiny států a území na Pyrenejském poloostrově, revoluce, dějiny politické, Portugalsko, vnitřní politika, and světové dějiny od r. 1945 do současnosti
- Language:
- Portuguese
- Rights:
- unknown
17. Acerca de la relevancia história de la emigración austríaca a América Latina /
- Creator:
- Kaller-Dietrich, Martina
- Subject:
- emigrace rakouská, vztahy argentinsko-evropské, světové dějiny 1789-1918, světové dějiny od r. 1918 do současnosti, Habsburská monarchie, Jugoslávie, Maďarsko, Rakousko, and migrace, vystěhovalectví, kolonizace
- Language:
- Portuguese
- Rights:
- unknown
18. Afinidades históricas e culturais entre o Brasil e a República Tcheca /
- Creator:
- Štěpánek, Pavel,
- Type:
- text and monografie
- Subject:
- Dějiny civilizace. Kulturní dějiny, vztahy česko-brazilské, vztahy kulturní, vztahy brazilsko-české, zahraniční politika, mezinárodní vztahy, přehledná zpracování světových dějin (chronologicky), Brazílie, and přehledná zpracování dějin českých zemí (chronologicky)
- Language:
- Portuguese
- Rights:
- unknown
19. Alberto da Veiga Simões :
- Creator:
- Madeira, Lina Alves,
- Type:
- text and biografie
- Subject:
- Dějiny států a území na Pyrenejském poloostrově, Simões, Alberto da Veiga,, diplomaté portugalští, vyslanci, politici portugalští, Portugalsko, politické dějiny, politici, světové dějiny od r. 1918 do současnosti, and světové dějiny 1789-1918
- Language:
- Portuguese
- Rights:
- unknown
20. Alberto Vojtěch Frič 08. 09. 1882, Praga (República Tcheca) - 04. 12. 1944, Praga (República Tcheca). Explorador, botânico, etnógrafo e publicitário /
- Creator:
- Fričová, Yvonna,
- Type:
- text and studie
- Subject:
- Dějiny civilizace. Kulturní dějiny, Frič, Alberto Vojtěch,, vztahy česko-jihoamerické, cestovatelé, botanici, etnografové, české země 1848-1918, Československo 1918-1945, světové dějiny 1789-1918, and dějiny vědy, umění, kultury a techniky, kulturní vztahy
- Language:
- Portuguese
- Rights:
- unknown
21. Amara - universal subtitles
- Type:
- corpus
- Language:
- Arabic, Danish, Dutch, English, German, Modern Greek (1453-), Italian, Japanese, Korean, Portuguese, Russian, Spanish, and Turkish
- Description:
- Large set of subtitles available for download in multiple languages. Can be used as parallel corpus.
- Rights:
- Not specified
22. Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.0)
- Creator:
- Savary, Agata, Ramisch, Carlos, Cordeiro, Silvio Ricardo, Sangati, Federico, Vincze, Veronika, QasemiZadeh, Behrang, Candito, Marie, Cap, Fabienne, Giouli, Voula, Stoyanova, Ivelina, Doucet, Antoine, Adalı, Kübra, Barbu Mititelu, Verginica, Bejček, Eduard, El Maarouf, Ismail, Eryiğit, Gülşen, Galea, Luke, Ha-Cohen Kerner, Yaakov, Liebeskind, Chaya, Monti, Johanna, Parra Escartín, Carla, Kovalevskaitė, Jolanta, Krek, Simon, van der Plas, Lonneke, Aceta, Cristina, Aduriz, Itziar, Antoine, Jean-Yves, Attard, Greta, Azzopardi, Kirsty, Boizou, Loic, Bonnici, Janice, Boz, Mert, Bumbulienė, Ieva, Busuttil, Jael, Caruso, Valeria, Cherchi, Manuela, Constant, Matthieu, Czerepowicka, Monika, De Santis, Anna, Dimitrova, Tsvetana, Dinç, Tutkum, Elyovich, Hevi, Fabri, Ray, Farrugia, Alison, Findlay, Jamie, Fotopoulou, Aggeliki, Foufi, Vassiliki, Galea, Sara Anne, Gantar, Polona, Gatt, Albert, Gatt, Anabelle, Herrero, Carlos, Iñurrieta, Uxoa, Jagfeld, Glorianna, Hnátková, Milena, Ionescu, Mihaela, Klyueva, Natalia, Koeva, Svetla, Kovács, Viktória, Kuzman, Taja, Leseva, Svetlozara, Louisou, Sevi, Lynn, Teresa, Malka, Ruth, Martínez Alonso, Héctor, McCrae, John, de Medeiros Caseli, Helena, Miral, Ayşenur, Muscat, Amanda, Nivre, Joakim, Oakes, Michael, Onofrei, Mihaela, Parmentier, Yannick, Pasquer, Caroline, Pia di Buono, Maria, Priego Sanchez, Belem, Raffone, Annalisa, Ramisch, Renata, Rimkutė, Erika, Rizea, Monica-Mihaela, Simkó, Katalin, Spagnol, Michael, Stefanova, Valentina, Stymne, Sara, Sulubacak, Umut, Tabone, Nicole, Tanti, Marc, Todorova, Maria, Urešová, Zdenka, Villavicencio, Aline, and Zilio, Leonardo
- Publisher:
- PARSEME
- Type:
- text and corpus
- Subject:
- Multiword expressions, verbal multiword expressions, idioms, light-verb constructions, verb-particle constructions, and inherently reflexive verbs
- Language:
- Bulgarian, Czech, German, Modern Greek (1453-), Spanish, Persian, French, Hebrew, Hungarian, Italian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Slovenian, Swedish, and Turkish
- Description:
- The PARSEME shared task aims at identifying verbal MWEs in running texts. Verbal MWEs include idioms (let the cat out of the bag), light verb constructions (make a decision), verb-particle constructions (give up), and inherently reflexive verbs (se suicider 'to suicide' in French). VMWEs were annotated according to the universal guidelines in 18 languages. The corpora are provided in the parsemetsv format, inspired by the CONLL-U format. For most languages, paired files in the CONLL-U format - not necessarily using UD tagsets - containing parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe). This item contains training and test data, tools and the universal guidelines file.
- Rights:
- PARSEME Shared Task Data (v. 1.0) Agreement, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-mwe-1.0, and PUB
23. Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.1)
- Creator:
- Ramisch, Carlos, Cordeiro, Silvio Ricardo, Savary, Agata, Vincze, Veronika, Barbu Mititelu, Verginica, Bhatia, Archna, Buljan, Maja, Candito, Marie, Gantar, Polona, Giouli, Voula, Güngör, Tunga, Hawwari, Abdelati, Iñurrieta, Uxoa, Kovalevskaitė, Jolanta, Krek, Simon, Lichte, Timm, Liebeskind, Chaya, Monti, Johanna, Parra Escartín, Carla, QasemiZadeh, Behrang, Ramisch, Renata, Schneider, Nathan, Stoyanova, Ivelina, Vaidya, Ashwini, Walsh, Abigail, Aceta, Cristina, Aduriz, Itziar, Antoine, Jean-Yves, Arhar Holdt, Špela, Berk, Gözde, Bielinskienė, Agnė, Blagus, Goranka, Boizou, Loic, Bonial, Claire, Caruso, Valeria, Čibej, Jaka, Constant, Matthieu, Cook, Paul, Diab, Mona, Dimitrova, Tsvetana, Ehren, Rafael, Elbadrashiny, Mohamed, Elyovich, Hevi, Erden, Berna, Estarrona, Ainara, Fotopoulou, Aggeliki, Foufi, Vassiliki, Geeraert, Kristina, van Gompel, Maarten, Gonzalez, Itziar, Gurrutxaga, Antton, Ha-Cohen Kerner, Yaakov, Ibrahim, Rehab, Ionescu, Mihaela, Jain, Kanishka, Jazbec, Ivo-Pavao, Kavčič, Teja, Klyueva, Natalia, Kocijan, Kristina, Kovács, Viktória, Kuzman, Taja, Leseva, Svetlozara, Ljubešić, Nikola, Malka, Ruth, Markantonatou, Stella, Martínez Alonso, Héctor, Matas, Ivana, McCrae, John, de Medeiros Caseli, Helena, Onofrei, Mihaela, Palka-Binkiewicz, Emilia, Papadelli, Stella, Parmentier, Yannick, Pascucci, Antonio, Pasquer, Caroline, Pia di Buono, Maria, Puri, Vandana, Raffone, Annalisa, Ratori, Shraddha, Riccio, Anna, Sangati, Federico, Shukla, Vishakha, Simkó, Katalin, Šnajder, Jan, Somers, Clarissa, Srivastava, Shubham, Stefanova, Valentina, Taslimipoor, Shiva, Theoxari, Natasa, Todorova, Maria, Urizar, Ruben, Villavicencio, Aline, and Zilio, Leonardo
- Publisher:
- PARSEME
- Type:
- text and corpus
- Subject:
- Multiword expressions, verbal multiword expressions, light-verb constructions, verb-particle constructions, inherently reflexive verbs, verbal idioms, and multi-verb constructions
- Language:
- Bulgarian, German, Modern Greek (1453-), Spanish, Persian, French, Hebrew, Hungarian, Italian, Lithuanian, Polish, Portuguese, Romanian, Slovenian, Turkish, Hindi, Basque, English, and Croatian
- Description:
- This multilingual resource contains corpora in which verbal MWEs have been manually annotated. VMWEs include idioms (let the cat out of the bag), light-verb constructions (make a decision), verb-particle constructions (give up), inherently reflexive verbs (help oneself), and multi-verb constructions (make do). VMWEs were annotated according to the universal guidelines in 19 languages. The corpora are provided in the cupt format, inspired by the CONLL-U format. The corpora were used in the 1.1 edition of the PARSEME Shared Task (2018). For most languages, morphological and syntactic information – not necessarily using UD tagsets – including parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe). This item contains training, development and test data, as well as the evaluation tools used in the PARSEME Shared Task 1.1 (2018). The annotation guidelines are available online: http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/1.1
- Rights:
- PARSEME Shared Task Data (v. 1.1) Agreement, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-mwe-1.1, and PUB
24. Annotated corpora and tools of the PARSEME Shared Task on Semi-Supervised Identification of Verbal Multiword Expressions (edition 1.2)
- Creator:
- Ramisch, Carlos, Guillaume, Bruno, Savary, Agata, Waszczuk, Jakub, Candito, Marie, Vaidya, Ashwini, Barbu Mititelu, Verginica, Bhatia, Archna, Iñurrieta, Uxoa, Giouli, Voula, Güngör, Tunga, Jiang, Menghan, Lichte, Timm, Liebeskind, Chaya, Monti, Johanna, Ramisch, Renata, Stymme, Sara, Walsh, Abigail, Xu, Hongzhi, Palka-Binkiewicz, Emilia, Ehren, Rafael, Stymne, Sara, Constant, Matthieu, Pasquer, Caroline, Parmentier, Yannick, Antoine, Jean-Yves, Carlino, Carola, Caruso, Valeria, Di Buono, Maria Pia, Pascucci, Antonio, Raffone, Annalisa, Riccio, Anna, Sangati, Federico, Speranza, Giulia, Cordeiro, Silvio Ricardo, de Medeiros Caseli, Helena, Miranda, Isaac, Rademaker, Alexandre, Vale, Oto, Villavicencio, Aline, Wick Pedro, Gabriela, Wilkens, Rodrigo, Zilio, Leonardo, Rizea, Monica-Mihaela, Ionescu, Mihaela, Onofrei, Mihaela, Chen, Jia, Ge, Xiaomin, Hu, Fangyuan, Hu, Sha, Li, Minli, Liu, Siyuan, Qin, Zhenzhen, Sun, Ruilong, Wang, Chenweng, Xiao, Huangyang, Yan, Peiyi, Yih, Tsy, Yu, Ke, Yu, Songping, Zeng, Si, Zhang, Yongchen, Zhao, Yun, Foufi, Vassiliki, Fotopoulou, Aggeliki, Markantonatou, Stella, Papadelli, Stella, Louizou, Sevasti, Aduriz, Itziar, Estarrona, Ainara, Gonzalez, Itziar, Gurrutxaga, Antton, Uria, Larraitz, Urizar, Ruben, Foster, Jennifer, Lynn, Teresa, Elyovitch, Hevi, Ha-Cohen Kerner, Yaakov, Malka, Ruth, Jain, Kanishka, Puri, Vandana, Ratori, Shraddha, Shukla, Vishakha, Srivastava, Shubham, Berk, Gozde, Erden, Berna, and Yirmibeşoğlu, Zeynep
- Publisher:
- PARSEME
- Type:
- text and corpus
- Subject:
- multiword expressions, verbal multiword expressions, light verb construction, verb-particle constructions, inherently reflexive verbs, verbal idioms, and multi-verb constructions
- Language:
- German, Modern Greek (1453-), Basque, French, Irish, Hebrew, Hindi, Italian, Polish, Portuguese, Romanian, Swedish, Turkish, and Chinese
- Description:
- This multilingual resource contains corpora in which verbal MWEs have been manually annotated, gathered at the occasion of the 1.2 edition of the PARSEME Shared Task on semi-supervised Identification of Verbal MWEs (2020). VMWEs include idioms (let the cat out of the bag), light-verb constructions (make a decision), verb-particle constructions (give up), inherently reflexive verbs (help oneself), and multi-verb constructions (make do). For the 1.2 shared task edition, the data covers 14 languages, for which VMWEs were annotated according to the universal guidelines. The corpora are provided in the cupt format, inspired by the CONLL-U format. Morphological and syntactic information – not necessarily using UD tagsets – including parts of speech, lemmas, morphological features and/or syntactic dependencies are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe). This item contains training, development and test data, as well as the evaluation tools used in the PARSEME Shared Task 1.2 (2020). The annotation guidelines are available online: http://parsemefr.lif.univ-mrs.fr/parseme-st-guidelines/1.2
- Rights:
- PARSEME Shared Task Data (v. 1.2) Agreement, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-mwe-1.2, and PUB
25. Antecedentes de la emigración masiva: migración en grupos. (Los misioneros jesuitas y los especialistas en minería centroeuropeos en América Latina. Condiciones y resultados) /
- Creator:
- Binková, Simona,
- Subject:
- vztahy evropsko-latinskoamerické, migrace, misionáři, řád, jezuité, bibliografie tematické, světové dějiny 1492-1648, světové dějiny 1648-1789, církevní řády a kongregace, náboženská bratrstva, kláštery, and bibliografie oborové a tematické, rejstříky časopisů
- Language:
- Portuguese
- Description:
- [Bibliografie s. 72-73].
- Rights:
- unknown
26. As Eleiçoes para a Assembleia da República /
- Creator:
- Cunhal, Álvaro,
- Type:
- text and projevy
- Subject:
- Dějiny států a území na Balkánském poloostrově, Cunhal, Álvaro,, politici portugalští, strany politické, strany politické komunistické, Portugalsko, světové dějiny od r. 1918 do současnosti, and politické dějiny, politici
- Language:
- Portuguese
- Rights:
- unknown
27. As greves de 8 e 9 de maio de 1944
- Type:
- text and dokumenty
- Subject:
- Politické strany a hnutí, hnutí dělnické, stávky, Portugalsko, dělnictvo, chudina, and světové dějiny 1939-1945
- Language:
- Portuguese
- Rights:
- unknown
28. As viagens e os viajantes para os portos da Lusofonia /
- Creator:
- Cristóvão, Fernando
- Subject:
- cestovatelé, cestování, přístavy, světové dějiny středověku (do r. 1492), světové dějiny novověku (1492-1918), and doprava, komunikace, pošta, inženýrské sítě
- Language:
- Portuguese
- Rights:
- unknown
29. Atlas Histórico de Portugal e do Ultramar Português /
- Creator:
- Marques, António Henrique R. de Oliveira
- Publisher:
- Centro de Estudos Históricos,
- Subject:
- atlasy, geografie historická, kolonie, objevy zámořské, dějiny států, politické dějiny, politici, přehledná zpracování světových dějin (chronologicky), Portugalsko, and historická geografie, kartografie a topografie
- Language:
- Portuguese
- Rights:
- unknown
30. Atlas Histórico de Portugal e do Ultramar Português /
- Creator:
- Marques, António Henrique R. de Oliveira,
- Type:
- text and atlasy
- Subject:
- Dějiny států a území na Pyrenejském poloostrově, Historická geografie, geografie historická, kolonie, objevy zámořské, dějiny států, Portugalsko, politické dějiny, politici, přehledná zpracování světových dějin (chronologicky), and historická geografie, kartografie a topografie
- Language:
- Portuguese
- Rights:
- unknown
31. Basic vocabulary on the Human Genome
- Publisher:
- Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra
- Type:
- lexicalConceptualResource
- Language:
- Catalan, English, French, Galician, Italian, Portuguese, and Spanish
- Description:
- A vocabulary resulting from the cooperation of the groups of REALITER network that collects the basic terminology mostly used in texts about Genomics. It contains equivalents in English, Peninsular and Latinamerican Spanish, French, Italian, Galician, Portuguese and Catalan.
- Rights:
- Not specified
32. Bedřich Katzer 05. 06. 1861, Rokycany (República Tcheca) - 03. 02. 1925, Sarajevo (Bósnia e Herzegovina). Geólogo e viajante /
- Creator:
- Martínek, Jiří,
- Type:
- text and studie
- Subject:
- Vědy o Zemi. Geologické vědy, Katzer, Bedřich,, geologové, cestovatelé, vztahy česko-brazilské, české země 1848-1918, vědy o neživé přírodě, přírodní prostředí, astronomie, Brazílie, Bosna a Hercegovina, světové dějiny 1789-1918, and Habsburská monarchie
- Language:
- Portuguese
- Rights:
- unknown
33. Bohemio-alemanes en Chile. Entre el olvido y la asimilación /
- Creator:
- Witker, Ivan
- Subject:
- emigrace německá, Němci čeští, Němci chilští, světové dějiny od r. 1918 do současnosti, Chile, migrace, vystěhovalectví, kolonizace, and české země 1848-1918
- Language:
- Portuguese
- Rights:
- unknown
34. Brasileiros ilegais em Portugal: uma reflexão sobre as fronteiras nacionais /
- Creator:
- Oliveira, Sergio P.
- Subject:
- imigrace, vztahy brazilsko-portugalské, hranice státní, světové dějiny od r. 1918 do současnosti, Brazílie, Portugalsko, and migrace, vystěhovalectví, kolonizace
- Language:
- Portuguese
- Rights:
- unknown
35. C4Corpus (CC BY-NC part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Panjabi, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NonCommercial 4.0 International (CC BY-NC 4.0), http://creativecommons.org/licenses/by-nc/4.0/, and PUB
36. C4Corpus (CC BY-NC-ND part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), http://creativecommons.org/licenses/by-nc-nd/4.0/, and PUB
37. C4Corpus (CC BY-NC-SA part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB
38. C4Corpus (CC BY-ND part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Korean, Latvian, Lithuanian, Malayalam, Macedonian, Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-NoDerivatives 4.0 International (CC BY-ND 4.0), http://creativecommons.org/licenses/by-nc/4.0/, and PUB
39. C4Corpus (CC BY-SA part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Panjabi, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
40. C4Corpus (CC-BY part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bengali, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Gujarati, Hebrew, Hindi, Croatian, Hungarian, Indonesian, Italian, Japanese, Kannada, Korean, Latvian, Lithuanian, Malayalam, Marathi, Macedonian, Nepali (macrolanguage), Dutch, Norwegian, Panjabi, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Somali, Spanish, Albanian, Swahili (macrolanguage), Swedish, Tamil, Telugu, Tagalog, Thai, Turkish, Ukrainian, Undetermined, Urdu, Vietnamese, and Chinese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB
41. C4Corpus (publicdomain part)
- Creator:
- Gurevych, Iryna, Habernal, Ivan, and Zayed, Omnia
- Publisher:
- Technische Universität Darmstadt
- Type:
- text and corpus
- Subject:
- CommonCrawl, Creative Commons, Web corpus, and Amazon Web Services
- Language:
- Afrikaans, Arabic, Bulgarian, Czech, Danish, German, Modern Greek (1453-), English, Estonian, Persian, Finnish, French, Croatian, Hungarian, Indonesian, Italian, Japanese, Korean, Latvian, Lithuanian, Dutch, Norwegian, Polish, Portuguese, Russian, Slovenian, Somali, Spanish, Swahili (macrolanguage), Swedish, Tagalog, Thai, Turkish, Ukrainian, Undetermined, and Vietnamese
- Description:
- A large web corpus (over 10 billion tokens) licensed under CreativeCommons license family in 50+ languages that has been extracted from CommonCrawl, the largest publicly available general Web crawl to date with about 2 billion crawled URLs.
- Rights:
- Public Domain Mark (PD), http://creativecommons.org/publicdomain/mark/1.0/, and PUB
42. Cartas: Recordaçoes e testemunhos do vivenciado /
- Creator:
- Piccolo, Helga Iracema Landgraf,
- Subject:
- emigrace německá, vztahy německo-brazilské, korespondence, světové dějiny 1789-1918, Brazílie, Německo, and migrace, vystěhovalectví, kolonizace
- Language:
- Portuguese
- Rights:
- unknown
43. Čestmír Loukotka 12. 11. 1895, Chrášťany (República Tcheca) - 13. 04. 1966, Praga (República Tcheca). Linguista e antropólogo /
- Creator:
- Křížová, Markéta,
- Type:
- text and studie
- Subject:
- Dějiny civilizace. Kulturní dějiny, Loukotka, Čestmír,, lingvisté, antropologové, Československo 1918-1992, and dějiny vědy, umění, kultury a techniky, kulturní vztahy
- Language:
- Portuguese
- Rights:
- unknown
44. Cien años de rearme - la emigración checo-austro-húngara a Guatemala entre 1880 y 1980 /
- Creator:
- Dietrich, Wolfgang
- Subject:
- emigrace česká, emigrace hospodářská, Češi guatemalští, Guatemala, migrace, vystěhovalectví, kolonizace, české země 1848-1918, and Československo 1918-1992
- Language:
- Portuguese
- Rights:
- unknown
45. Com uma imensa alegria :
- Creator:
- Jorge, Joaquim Pires,
- Type:
- text and autobiografie
- Subject:
- Politika, Jorge, Joaquim Pires,, komunisté portugalští, antifašismus, Portugalsko, světové dějiny od r. 1918 do současnosti, odboj, odpor, antifašismus, antikomunismus, and politické dějiny, politici
- Language:
- Portuguese
- Rights:
- unknown
46. Comenius :
- Creator:
- Covello, Sergio Carlos
- Type:
- text and studie
- Subject:
- Organizace výuky a vzdělávání, Komenský, Jan Amos,, myšlení pedagogické, české země 1526-1792, and školství, pedagogika, učitelé, péče o mládež
- Language:
- Portuguese
- Rights:
- unknown
47. Comenius :
- Creator:
- Kulesza, Wojciech Andrzej
- Type:
- text and studie
- Subject:
- Organizace výuky a vzdělávání, Komenský, Jan Amos,, myšlení pedagogické, pedagogika, české země 1526-1792, and školství, pedagogika, učitelé, péče o mládež
- Language:
- Portuguese
- Rights:
- unknown
48. Comenius no Brasil :
- Creator:
- Araújo Sampaio, Bohumila de
- Type:
- text and studie
- Subject:
- Organizace výuky a vzdělávání, Komenský, Jan Amos,, teologové, filozofové, vztahy česko-brazilské, světové dějiny 1789-1918, světové dějiny od r. 1918 do současnosti, Brazílie, české země 1526-1792, and dějiny vědy, umění, kultury a techniky, kulturní vztahy
- Language:
- Portuguese
- Rights:
- unknown
49. Comenius, o fundador de Pedagogia Moderna e o seu legado para a humanidade /
- Creator:
- Pánek, Jaroslav,
- Type:
- text and studie
- Subject:
- Výchova a vzdělávání, Komenský, Jan Amos,, myšlení pedagogické, filozofové čeští, české země 1526-1792, and školství, pedagogika, učitelé, péče o mládež
- Language:
- Portuguese
- Rights:
- unknown
50. COMPARA : Portuguese - English parallel translation corpus
- Type:
- corpus
- Language:
- English and Portuguese
- Description:
- bi-directional parallel corpus based on an open-ended collection of Portuguese-English and English-Portuguese source-texts and translations. Searchable via the IMS Corpus Query Processor and the DISPARA interface
- Rights:
- Not specified
51. CoNLL 2017 and 2018 Shared Task Blind and Preprocessed Test Data
- Creator:
- Zeman, Daniel and Straka, Milan
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- tokenization, word segmentation, morphology, tagging, syntax, parsing, and universal dependencies
- Language:
- Afrikaans, Arabic, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Persian, Finnish, French, Old French (842-ca. 1400), Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Thai, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
- Description:
- CoNLL 2017 and 2018 shared tasks: Multilingual Parsing from Raw Text to Universal Dependencies This package contains the test data in the form in which they ware presented to the participating systems: raw text files and files preprocessed by UDPipe. The metadata.json files contain lists of files to process and to output; README files in the respective folders describe the syntax of metadata.json. For full training, development and gold standard test data, see Universal Dependencies 2.0 (CoNLL 2017) Universal Dependencies 2.2 (CoNLL 2018) See the download links at http://universaldependencies.org/. For more information on the shared tasks, see http://universaldependencies.org/conll17/ http://universaldependencies.org/conll18/ Contents: conll17-ud-test-2017-05-09 ... CoNLL 2017 test data conll18-ud-test-2018-05-06 ... CoNLL 2018 test data conll18-ud-test-2018-05-06-for-conll17 ... CoNLL 2018 test data with metadata and filenames modified so that it is digestible by the 2017 systems.
- Rights:
- Licence Universal Dependencies v2.2, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.2, and PUB
52. CoNLL 2017 Shared Task System Outputs
- Creator:
- Zeman, Daniel, Potthast, Martin, Straka, Milan, Popel, Martin, Dozat, Timothy, Qi, Peng, Manning, Christopher, Shi, Tianze, Wu, Felix G., Chen, Xilun, Cheng, Yao, Björkelund, Anders, Falenska, Agnieszka, Yu, Xiang, Kuhn, Jonas, Che, Wanxiang, Guo, Jiang, Wang, Yuxuan, Zheng, Bo, Zhao, Huaipeng, Liu, Yang, Teng, Dechuan, Liu, Ting, Lim, Kyungtae, Poibeau, Thierry, Sato, Motoki, Manabe, Hitoshi, Noji, Hiroshi, Matsumoto, Yuji, Kırnap, Ömer, Önder, Berkay Furkan, Yuret, Deniz, Straková, Jana, Vania, Clara, Zhang, Xingxing, Lopez, Adam, Heinecke, Johannes, Asadullah, Munshi, Kanerva, Jenna, Luotolahti, Juhani, Ginter, Filip, Kuan, Yu, Sofroniev, Pavel, Schill, Erik, Hinrichs, Erhard, Nguyen, Dat Quoc, Dras, Mark, Johnson, Mark, Qian, Xian, Vilares, David, Gómez-Rodríguez, Carlos, Aufrant, Lauriane, Wisniewski, Guillaume, Yvon, François, Dumitrescu, Stefan Daniel, Boroş, Tiberiu, Tufiş, Dan, Das, Ayan, Zaffar, Affan, Sarkar, Sudeshna, Wang, Hao, Zhao, Hai, Zhang, Zhisong, Hornby, Ryan, Taylor, Clark, Park, Jungyeul, de Lhoneux, Miryam, Shao, Yan, Basirat, Ali, Kiperwasser, Eliyahu, Stymne, Sara, Goldberg, Yoav, Nivre, Joakim, Akkuş, Burak Kerim, Azizoglu, Heval, Cakici, Ruket, Moor, Christophe, Merlo, Paola, Henderson, James, Wang, Haozhou, Ji, Tao, Wu, Yuanbin, Lan, Man, de la Clergerie, Eric, Sagot, Benoît, Seddah, Djamé, More, Amir, Tsarfaty, Reut, Kanayama, Hiroshi, Muraoka, Masayasu, Yoshikawa, Katsumasa, Garcia, Marcos, and Gamallo, Pablo
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- dependency parser and parsebank
- Language:
- Arabic, Bulgarian, Russia Buriat, Czech, Catalan, Church Slavic, Danish, German, Modern Greek (1453-), English, Spanish, Estonian, Basque, Persian, Finnish, French, Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Swedish, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
- Description:
- This package contains the system outputs from the CoNLL 2017 Shared Task in Multilingual Parsing from Raw Text to Universal Dependencies.
- Rights:
- Licence Universal Dependencies v2.0, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.0, and PUB
53. CoNLL 2018 Shared Task System Outputs
- Creator:
- Zeman, Daniel, Potthast, Martin, Duthoo, Elie, Mesnard, Olivier, Rybak, Piotr, Wróblewska, Alina, Che, Wanxiang, Liu, Yijia, Wang, Yuxuan, Zheng, Bo, Liu, Ting, Li, Zuchao, He, Shexia, Zhang, Zhuosheng, Zhao, Hai, Wu, Yingting, Tong, Jia-Jun, Nguyen, Dat Quoc, Verspoor, Karin, Wan, Hui, Naseem, Tahira, Lee, Young-Suk, Castelli, Vittorio, Ballesteros, Miguel, Hershcovich, Daniel, Abend, Omri, Rappoport, Ari, Smith, Aaron, Bohnet, Bernd, de Lhoneux, Miryam, Nivre, Joakim, Shao, Yan, Stymne, Sara, Kırnap, Ömer, Dayanık, Erenay, Yuret, Deniz, Kanerva, Jenna, Ginter, Filip, Miekka, Niko, Leino, Akseli, Salakoski, Tapio, Lim, KyungTae, Park, Cheoneum, Lee, Changki, Poibeau, Thierry, Bhat, Riyaz Ahmad, Bhat, Irshad, Bangalore, Srinivas, Qi, Peng, Dozat, Timothy, Zhang, Yuhao, Manning, Christopher, Boroș, Tiberiu, Dumitrescu, Stefan Daniel, Burtica, Ruxandra, Arakelyan, Gor, Hambardzumyan, Karen, Khachatrian, Hrant, Rosa, Rudolf, Mareček, David, Straka, Milan, Seker, Amit, More, Amir, Tsarfaty, Reut, Önder, Berkay Furkan, Gümeli, Can, Jawahar, Ganesh, Muller, Benjamin, Fethi, Amal, Martin, Louis, Villemonte de la Clergerie, Eric, Sagot, Benoît, Seddah, Djamé, Özateş, Şaziye Betül, Özgür, Arzucan, Gungor, Tunga, Öztürk, Balkız, Ji, Tao, Liu, Yufang, Wang, Yijun, Wu, Yuanbin, Lan, Man, Chen, Danlu, Lin, Mengxiao, Hu, Zhifeng, and Qiu, Xipeng
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- parsed data, conllu, and universal dependencies
- Language:
- Afrikaans, Arabic, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Persian, Finnish, French, Old French (842-ca. 1400), Irish, Galician, Gothic, Ancient Greek (to 1453), Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Latin, Latvian, Dutch, Norwegian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Thai, Turkish, Uighur, Ukrainian, Urdu, Vietnamese, and Chinese
- Description:
- Test data parsed by systems submitted to the CoNLL 2018 UD parsing shared task.
- Rights:
- Licence Universal Dependencies v2.2, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.2, and PUB
54. CORP-ORAL Spontaneous Speech Corpus
- Publisher:
- Instituto de Linguística Teórica e Computacional
- Type:
- corpus
- Language:
- Portuguese
- Description:
- The aim of the CORP-ORAL project is to build a corpus of spontaneous European Portuguese speech available for the training of speech synthesis and recognition systems as well as phonetic, phonological, lexical, morphological and syntactic studies. The corpus contains the recording of 60 hours of conversations between two European Portuguese speakers per conversation (at a time). The entire corpus will be completed with orthographic transcription and the prosodic marking of speech breaks/boundaries as well as phonetic transcription of a selection of chunks. CORP-ORAL is built from scratch with the explicit goal of becoming entirely available on the internet to the scientific community and the public in general.
- Rights:
- Not specified
55. Corpus CLUVI
- Publisher:
- TALG Research Group (University of Vigo)
- Type:
- corpus
- Language:
- Basque, Catalan, English, French, Galician, German, Portuguese, and Spanish
- Description:
- Parallel corpus, 22 million words
- Rights:
- Not specified
56. CorpusExplorer
- Creator:
- Rüdiger, Jan Oliver
- Publisher:
- Jan Oliver Rüdiger
- Type:
- tool and toolService
- Subject:
- Corpus Linguisitics, NLP, conll, tei, XML, nlp, Natural Language Processing, linguistics, Linguistics, Computational Linguistics, corpus processing, tagger, POS tagger, lemmatization, text cleaning, CommonCrawl, epub, JSON, Twitter, Pandoc, Wikipedia, digital data, DTA, DSpin, MySQL, ElasticSearch, TextGrid, text corpora, TigerXML, and WeblichtXML
- Language:
- German, English, French, Italian, Dutch, Spanish, Polish, Arabic, Chinese, and Portuguese
- Description:
- Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 45 interactive visualizations under a user-friendly interface. Routine tasks such as text acquisition, cleaning or tagging are completely automated. The simple interface supports the use in university teaching and leads users/students to fast and substantial results. The CorpusExplorer is open for many standards (XML, CSV, JSON, R, etc.) and also offers its own software development kit (SDK). Source code available at https://github.com/notesjor/corpusexplorer2.0
- Rights:
- Not specified
57. Crise e queda dos governos PS.
- Creator:
- Cunhal, Álvaro,
- Type:
- text and monografie
- Subject:
- Politické strany a hnutí, Dějiny států a území na Pyrenejském poloostrově, Cunhal, Álvaro,, dějiny hospodářské, dějiny sociální, Portugalsko, politické dějiny, politici, světové dějiny od r. 1945 do současnosti, and hospodářské dějiny
- Language:
- Portuguese
- Rights:
- unknown
58. DaMuEL 1.0: A Large Multilingual Dataset for Entity Linking
- Creator:
- Kubeša, David and Straka, Milan
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- entity linking, NEL, NER, dataset, and knowledge base
- Language:
- Afrikaans, Arabic, Armenian, Basque, Belarusian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Hebrew, Hindi, Hungarian, Indonesian, Irish, Italian, Japanese, Korean, Latin, Latvian, Lithuanian, Maltese, Marathi, Modern Greek (1453-), Northern Sami, Norwegian Nynorsk, Persian, Polish, Portuguese, Romanian, Russian, Scottish Gaelic, Serbian, Slovak, Slovenian, Spanish, Swedish, Tamil, Telugu, Uighur, Ukrainian, Urdu, Vietnamese, and Wolof
- Description:
- We present DaMuEL, a large Multilingual Dataset for Entity Linking containing data in 53 languages. DaMuEL consists of two components: a knowledge base that contains language-agnostic information about entities, including their claims from Wikidata and named entity types (PER, ORG, LOC, EVENT, BRAND, WORK_OF_ART, MANUFACTURED); and Wikipedia texts with entity mentions linked to the knowledge base, along with language-specific text from Wikidata such as labels, aliases, and descriptions, stored separately for each language. The Wikidata QID is used as a persistent, language-agnostic identifier, enabling the combination of the knowledge base with language-specific texts and information for each entity. Wikipedia documents deliberately annotate only a single mention for every entity present; we further automatically detect all mentions of named entities linked from each document. The dataset contains 27.9M named entities in the knowledge base and 12.3G tokens from Wikipedia texts. The dataset is published under the CC BY-SA licence.
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
59. De Hislampa a Porthum. De la delimitación del corpus del humanismo en Portugal /
- Creator:
- Sánchez Tarrio, Ana María
- Type:
- studie
- Subject:
- Filozofie, dějiny portugalské, humanismus, historiografie, Portugalsko, světové dějiny 1492-1648, světové dějiny středověku (do r. 1492), dějiny ideí, ideologie, and historiografie, vědecké projekty
- Language:
- Portuguese
- Rights:
- unknown
60. Deep Universal Dependencies 2.4
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, and Galician
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-2988). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.4, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.4, and PUB
61. Deep Universal Dependencies 2.5
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, and Skolt Sami
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3105). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.5, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.5, and PUB
62. Deep Universal Dependencies 2.6
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, and Persian
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3226). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.6, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.6, and PUB
63. Deep Universal Dependencies 2.7
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, Persian, Akuntsu, Apurinã, Khunsari, Manx, Mundurukú, Nayini, Soi, South Levantine Arabic, and Tupinambá
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3424). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.7, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.7, and PUB
64. Deep Universal Dependencies 2.8
- Creator:
- Zeman, Daniel and Droganova, Kira
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- semantic dependency and universal dependencies
- Language:
- Afrikaans, Assyrian Neo-Aramaic, Akkadian, Amharic, Arabic, Belarusian, Breton, Bulgarian, Russia Buriat, Catalan, Czech, Church Slavic, Mandarin Chinese, Coptic, Welsh, Danish, German, Modern Greek (1453-), English, Estonian, Basque, Faroese, Finnish, French, Irish, Gothic, Ancient Greek (to 1453), Mbyá Guaraní, Hebrew, Hindi, Croatian, Upper Sorbian, Hungarian, Armenian, Indonesian, Italian, Japanese, Kazakh, Northern Kurdish, Korean, Komi-Zyrian, Karelian, Latin, Latvian, Lithuanian, Literary Chinese, Marathi, Erzya, Dutch, Norwegian, Old Russian, Nigerian Pidgin, Polish, Portuguese, Romanian, Russian, Sanskrit, Slovak, Slovenian, Northern Sami, Spanish, Serbian, Swedish, Tamil, Tagalog, Turkish, Ukrainian, Urdu, Vietnamese, Warlpiri, Wolof, Yoruba, Galician, Bhojpuri, Komi-Permyak, Livvi, Moksha, Scottish Gaelic, Skolt Sami, Icelandic, Albanian, Persian, Akuntsu, Apurinã, Khunsari, Manx, Mundurukú, Nayini, Soi, South Levantine Arabic, Tupinambá, Beja, Western Frisian, Urubú-Kaapor, Kangri, K'iche', Low German, Makuráp, Western Armenian, and Central Siberian Yupik
- Description:
- Deep Universal Dependencies is a collection of treebanks derived semi-automatically from Universal Dependencies (http://hdl.handle.net/11234/1-3687). It contains additional deep-syntactic and semantic annotations. Version of Deep UD corresponds to the version of UD it is based on. Note however that some UD treebanks have been omitted from Deep UD.
- Rights:
- Licence Universal Dependencies v2.8, https://lindat.mff.cuni.cz/repository/xmlui/page/license-ud-2.8, and PUB
65. Deltacorpus
- Creator:
- Mareček, David, Yu, Zhiwei, Zeman, Daniel, and Žabokrtský, Zdeněk
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- part of speech, tagging, semi-supervised, and cross-language
- Language:
- Belarusian, Bosnian, Bulgarian, Czech, Serbo-Croatian, Croatian, Upper Sorbian, Macedonian, Polish, Russian, Slovak, Slovenian, Serbian, Ukrainian, Latvian, Lithuanian, Afrikaans, Danish, German, English, Faroese, Western Frisian, Swiss German, Icelandic, Limburgan, Luxembourgish, Low German, Dutch, Norwegian Nynorsk, Norwegian, Scots, Swedish, Yiddish, Aragonese, Asturian, Catalan, French, Galician, Haitian, Italian, Latin, Lombard, Neapolitan, Piemontese, Portuguese, Romanian, Spanish, Venetian, Walloon, Breton, Welsh, Scottish Gaelic, Irish, Modern Greek (1453-), Armenian, Albanian, Dimli (individual language), Persian, Gilaki, Kurdish, Tajik, Bengali, Bishnupriya, Gujarati, Fiji Hindi, Hindi, Marathi, Nepali (macrolanguage), Urdu, Amharic, Arabic, Egyptian Arabic, Hebrew, Estonian, Finnish, Hungarian, Basque, Georgian, Chuvash, Azerbaijani, Turkish, Uzbek, Kazakh, Tatar, Yakut, Korean, Mongolian, Telugu, Kannada, Malayalam, Tamil, Newari, Vietnamese, Indonesian, Javanese, Malagasy, Maori, Malay (macrolanguage), Pampanga, Sundanese, Tagalog, Waray (Philippines), Swahili (macrolanguage), Esperanto, Ido, Interlingua (International Auxiliary Language Association), and Volapük
- Description:
- Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia).
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
66. Deltacorpus 1.1
- Creator:
- Mareček, David, Yu, Zhiwei, Zeman, Daniel, and Žabokrtský, Zdeněk
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- part of speech, tagging, semi-supervised, and cross-language
- Language:
- Belarusian, Bosnian, Bulgarian, Czech, Serbo-Croatian, Croatian, Upper Sorbian, Macedonian, Polish, Russian, Slovak, Slovenian, Serbian, Ukrainian, Latvian, Lithuanian, Afrikaans, Danish, German, English, Faroese, Western Frisian, Swiss German, Icelandic, Limburgan, Luxembourgish, Low German, Dutch, Norwegian Nynorsk, Norwegian, Scots, Swedish, Yiddish, Aragonese, Asturian, Catalan, French, Galician, Haitian, Italian, Latin, Lombard, Neapolitan, Piemontese, Portuguese, Romanian, Spanish, Venetian, Walloon, Breton, Welsh, Scottish Gaelic, Irish, Modern Greek (1453-), Armenian, Albanian, Dimli (individual language), Persian, Gilaki, Kurdish, Tajik, Bengali, Bishnupriya, Gujarati, Fiji Hindi, Hindi, Marathi, Nepali (macrolanguage), Urdu, Amharic, Arabic, Egyptian Arabic, Hebrew, Estonian, Finnish, Hungarian, Basque, Georgian, Chuvash, Azerbaijani, Turkish, Uzbek, Kazakh, Tatar, Yakut, Korean, Mongolian, Telugu, Kannada, Malayalam, Tamil, Newari, Vietnamese, Indonesian, Javanese, Malagasy, Maori, Malay (macrolanguage), Pampanga, Sundanese, Tagalog, Waray (Philippines), Swahili (macrolanguage), Esperanto, Ido, Interlingua (International Auxiliary Language Association), and Volapük
- Description:
- Texts in 107 languages from the W2C corpus (http://hdl.handle.net/11858/00-097C-0000-0022-6133-9), first 1,000,000 tokens per language, tagged by the delexicalized tagger described in Yu et al. (2016, LREC, Portorož, Slovenia). Changes in version 1.1: 1. Universal Dependencies tagset instead of the older and smaller Google Universal POS tagset. 2. SVM classifier trained on Universal Dependencies 1.2 instead of HamleDT 2.0. 3. Balto-Slavic languages, Germanic languages and Romance languages were tagged by classifier trained only on the respective group of languages. Other languages were tagged by a classifier trained on all available languages. The "c7" combination from version 1.0 is no longer used.
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
67. Der Subjektivismus der Husserlschen und die Forderung einer asubjektiven Phänomenologie
- Creator:
- Jan Patočka
- Publisher:
- Sborník prací filosofické fakulty brněnské university 19–20 (1971), Řada uměnovědná (F), č. 14–15, str. 11–26. Stať. něm.
- Type:
- Text
- Subject:
- 1970/6, 1971, 1988/30, 1991/2, 2004/10, 2009/1, cs, es, fr, pt, SS-7/Fen-II, and Stať. něm.
- Language:
- Czech, French, Portuguese, and Spanish
- Rights:
- open access and Rights holder: Archiv Jana Patočky, z.s.
68. Der Subjektivismus der Husserlschen und die Möglichkeit einer asubjektiven Phänomenologie
- Creator:
- Jan Patočka
- Publisher:
- Philosophische Perspektiven, ein Jahrbuch, sv. 2, ed. R. Berlinger a E. Fink, Frankfurt/M. (v. Klostermann) 1970, str. 317–334. Stať. něm.
- Type:
- Text
- Subject:
- 1970, 1988/30, 1991/2, 2004/10, 2009/1, cn, cs, de, es, fr, hu, pt, SS-7/Fen-II, and Stať. něm.
- Language:
- German, Czech, French, Hungarian, Portuguese, and Spanish
- Rights:
- open access and Rights holder: Archiv Jana Patočky, z.s.
69. Do 25 de Novembro às Eleiçoes para a Assembleia da República /
- Creator:
- Cunhal, Álvaro,
- Type:
- text and projevy
- Subject:
- Dějiny států a území na Pyrenejském poloostrově, Cunhal, Álvaro,, politici portugalští, projevy politické, strany politické, strany politické komunistické, and Portugalsko
- Language:
- Portuguese
- Rights:
- unknown
70. Do Moldava ao Douro :
- Creator:
- Burmester, Elisabeth,
- Type:
- text and paměti
- Subject:
- Dějiny Česka a Slovenska, Ringhofferové (rod), podnikatelé průmysloví, rody a rodiny, Československo 1918-1992, šlechta, buržoazie, měšťanstvo, podnikatelé, and české země 1792-1918
- Language:
- Portuguese
- Rights:
- unknown
71. DOESTE v0.5
- Creator:
- Martins, Mário, Janssen, Maarten, Santos, Taiza, Lopes, Raquel, and Souza, Thiago
- Publisher:
- Federal Rural University of the Semiarid Region
- Type:
- text and corpus
- Subject:
- Developmental corpus, Writing development, and School-age language development
- Language:
- Portuguese
- Description:
- DOESTE v0.5 is a set of developmental corpora of texts written by Brazilian and Portuguese school-age children and adolescents. It is a work in progress. The texts written by monolingual children and adolescents in European Portuguese were collected between September 2011 and January 2012, from different public schools in Lisbon (Portugal). It is composed of 244 narrative (n=122) and argumentative (n=122) texts. The subjects (51% female and 49% male) are students enroled in the 5th grade (n=52; mean age=10.19), in the 7th grade (n=92; mean age=12.33) and in the 10th grade (n=100; mean age=15.16) from the Portuguese basic schooling. The subcorpus of Portuguese texts is fully tokenized and morphologically annotated, in addition to presenting the sentence occurrences. The texts written by monolingual children and adolescents in Brazilian Portuguese have been collected since 2017, from different public schools in three cities in Rio Grande do Norte (Brazil). It is currently composed of narrative (n=225) and argumentative (n=225) texts. The subjects (53% female and 47% male) are students enroled in the 5th grade (n=68; mean age=11.13), in the 9th grade (n=82; mean age=15.32) and in the 12th grade (n=224; mean age=17.96) from the Brazilian basic schooling. The subcorpora of Brazilian texts is still in the compilation, but a large part is already searchable, being tokenized and morphologically annotated. The Brazilian subcorpus also presents itself with the original transcripts, along original images. Portuguese and Brazilian texts were collected from similar tasks: Narrative-based task: Tell a remarkable story (real or imagined) that you and your best friend lived during the last school vacation. Argumentative based-task: Do you think social networks (Facebook, Twitter, Google+, Windows Live Space, etc.) are important today? Write a text to be published on your school's blog where you express your opinion on social networks. In this text, you must say whether you are for or against the existence of social networks. Don't forget to justify your opinion! The next version of DOESTE intends to present semantic annotations and clause and t-unit segmentation. DOESTE v0.5 is developed and maintained by the Educational Linguistics Research Group (LEd), based at the Federal Rural University of the Semiarid Region (UFERSA). DOESTE v0.5 by Mário Martins et al. is licensed under CC BY-NC-ND 4.0.
- Rights:
- Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), http://creativecommons.org/licenses/by-nc-nd/4.0/, and PUB
72. Dois movimentos integralistas em Portugal e no Brasil /
- Creator:
- Vrbata, Aleš,
- Subject:
- integralismus, ideologie, hnutí politická, politické dějiny, politici, světové dějiny 1789-1918, světové dějiny 1918-1945, Brazílie, and Portugalsko
- Language:
- Portuguese
- Rights:
- unknown
73. Don Juan Szychowski, Un pionero polaco, un nuevo modelo de trabajo para Argentina /
- Creator:
- Kojrowicz, Claudia Stefanetti
- Subject:
- Szychowski, Juan,, emigrace polská, Poláci argentinští, vztahy argentinsko-polské, světové dějiny od r. 1918 do současnosti, Argentina, Polsko, and migrace, vystěhovalectví, kolonizace
- Language:
- Portuguese
- Rights:
- unknown
74. ECI Multilingual Text
- Publisher:
- HCRC
- Type:
- corpus
- Language:
- Portuguese
- Description:
- Parallel corpus
- Rights:
- Not specified
75. Economia portuguesa :
- Creator:
- Pimenta, Carlos
- Type:
- text and monografie
- Subject:
- Dějiny států a území na Pyrenejském poloostrově, ekonomové, dějiny hospodářské, Portugalsko, hospodářské dějiny, and světové dějiny od r. 1945 do současnosti
- Language:
- Portuguese
- Rights:
- unknown
76. Eduard Ingriš 11. 02. 1905, Zlonice (República Tcheca) - 12. 01. 1991, Reno (EUA). Compositor, maestro, explorador, documentarista, cineasta e fotógrafo /
- Creator:
- Náplava, Miroslav,
- Type:
- text and studie
- Subject:
- Dějiny civilizace. Kulturní dějiny, Ingriš, Eduard,, hudebníci, skladatelé, cestovatelé, fotografové, Československo 1918-1992, světové dějiny od r. 1918 do současnosti, and dějiny vědy, umění, kultury a techniky, kulturní vztahy
- Language:
- Portuguese
- Rights:
- unknown
77. El interés por el Brasil en la literatura checa y eslovaca entre las dos guerras mundiales /
- Creator:
- Binková, Simona,
- Subject:
- vztahy česko-brazilské, vztahy slovensko-brazilské, dějiny literatury, literatura slovenská, pohled na druhé, přehledná zpracování (tematicky), zahraniční politika, mezinárodní vztahy, světové dějiny od r. 1918 do současnosti, Brazílie, Československo 1918-1945, and literatura, spisovatelé
- Language:
- Portuguese
- Rights:
- unknown
78. Emigração alemã para o sul do Brasil no século XIX: propaganda e expectativas. Experiências de imigrantes no Rio Grande do Sul /
- Creator:
- Piccolo, Helga Iracema Landgraf,
- Subject:
- emigrace, vystěhovalectví, Němci, propaganda, světové dějiny 1789-1918, Německo, Brazílie, and migrace, vystěhovalectví, kolonizace
- Language:
- Portuguese
- Rights:
- unknown
79. Enrique Stanko Vráz 1860 - 20. 02. 1932, Praga (Réepública Tcheca). Viajante e fotógrafo tcheco, autor do livro Através de América Equatorial /
- Creator:
- Kázecký, Stanislav,
- Type:
- text and studie
- Subject:
- Dějiny civilizace. Kulturní dějiny, Vráz, Enrique Stanko,, cestovatelé, fotografové, etnografie, cestopisy, vztahy česko-jihoamerické, české země 1848-1918, světové dějiny 1789-1918, dějiny vědy, umění, kultury a techniky, kulturní vztahy, and Československo 1918-1938
- Language:
- Portuguese
- Rights:
- unknown
80. Entre duas eleições /
- Creator:
- Cunhal, Álvaro,
- Type:
- text and spisy
- Subject:
- Dějiny států a území na Pyrenejském poloostrově, Cunhal, Álvaro,, politici portugalští, projevy politické, strany politické, strany politické komunistické, Portugalsko, světové dějiny od r. 1945 do současnosti, and politické dějiny, politici
- Language:
- Portuguese
- Rights:
- unknown
81. Europarl QTLeap WSD/NED corpus
- Creator:
- Agirre, Eneko, Branco, António, Popel, Martin, and Simov, Kiril
- Publisher:
- University of the Basque Country, UPV/EHU, Faculty of Science, Univeristy of Lisbon, FCUL, Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), and Bulgarian Academy of Sciences, IICT-BAS
- Type:
- text and corpus
- Subject:
- annotated corpus and multilingual
- Language:
- Basque, Bulgarian, Czech, English, Portuguese, and Spanish
- Description:
- This corpora is part of Deliverable 5.5 of the European Commission project QTLeap FP7-ICT-2013.4.1-610516 (http://qtleap.eu). The texts are sentences from the Europarl parallel corpus (Koehn, 2005). We selected the monolingual sentences from parallel corpora for the following pairs: Bulgarian-English, Czech-English, Portuguese-English and Spanish-English. The English corpus is comprised by the English side of the Spanish-English corpus. Basque is not in Europarl. In addition, it contains the Basque and English sides of the GNOME corpus. The texts have been automatically annotated with NLP tools, including Word Sense Disambiguation, Named Entity Disambiguation and Coreference resolution. Please check deliverable D5.6 in http://qtleap.eu/deliverables for more information.
- Rights:
- Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB
82. Europarl: European Parliament Proceedings Parallel Corpus 1996-2003
- Type:
- corpus
- Language:
- Portuguese
- Description:
- Parallel corpus
- Rights:
- Not specified
83. European expansion 1494-1519 :
- Type:
- text and edice
- Subject:
- Geografie jako věda. Výzkum. Cestování, cesty objevné, rukopisy, pohled na druhé, cesty námořní, historická geografie, kartografie a topografie, světové dějiny 1492-1648, and dějiny vědy, umění, kultury a techniky, kulturní vztahy
- Language:
- English and Portuguese
- Description:
- Mapy na předsádkách and Frontispis
- Rights:
- unknown
84. Europeos en la Araucanía. Los colonos del Budi a principios del siglo 20 /
- Creator:
- Chávez, Jaine Flores
- Subject:
- vztahy Evropané-Indiáni, imigrace, světové dějiny 1789-1918, světové dějiny od r. 1918 do současnosti, and migrace, vystěhovalectví, kolonizace
- Language:
- Portuguese
- Rights:
- unknown
85. Faleceu o professor Jaromír Tláskal /
- Creator:
- Jindrová, Jaroslava,
- Type:
- nekrology
- Subject:
- Filologie, Tláskal, Jaromír,, filologové, romanisté, bibliografie personální, historici (jubilea, nekrology apod.), and personální bibliografie
- Language:
- Portuguese
- Rights:
- unknown
86. Formação do PCB 1922-1928 :
- Creator:
- Pereira, Astrojildo,
- Type:
- text, prameny, and dokumenty
- Subject:
- Politické strany a hnutí, strany politické komunistické, Brazílie, světové dějiny 1918-1945, and politické strany a hnutí, volby
- Language:
- Portuguese
- Rights:
- unknown
87. František Čech-Vyšata 14. 02. 1881, Chlumany (República Tcheca) - 03. 10. 1942, Sobíňov (República Tcheca). Escritor e viajante /
- Creator:
- Tkadlečková, Věra,
- Type:
- text and studie
- Subject:
- Dějiny civilizace. Kulturní dějiny, Čech-Vyšata, František,, cestovatelé, spisovatelé, cestopisy, vztahy česko-jihoamerické, české země 1848-1918, Československo 1918-1945, světové dějiny 1789-1918, světové dějiny 1918-1945, and dějiny vědy, umění, kultury a techniky, kulturní vztahy
- Language:
- Portuguese
- Rights:
- unknown
88. FreeLing
- Publisher:
- Centro de Tecnologías y Aplicaciones del Lenguaje y del Habla (TALP)
- Type:
- toolService
- Language:
- Catalan, English, Galician, Italian, Portuguese, and Welsh
- Description:
- Open source language analysis tool suite: tokenizer, stemmer/lemmatizer, named entity recognizer, chunker/segmenter, morphosyntactic tagger, syntactic tagger, corpus processer, morphological tagger, semantic tagger, analyzer, Word Sense Disambiguator.
- Rights:
- Not specified
89. HamleDT 2.0
- Creator:
- Zeman, Daniel, Mareček, David, Mašek, Jan, Popel, Martin, Ramasamy, Loganathan, Rosa, Rudolf, Štěpánek, Jan, and Žabokrtský, Zdeněk
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- treebank, Stanford dependencies, Prague dependencies, harmonization, common annotation style, and Interset
- Language:
- Arabic, Bulgarian, Bengali, Catalan, Czech, Danish, German, Modern Greek (1453-), English, Spanish, Estonian, Basque, Persian, Finnish, Ancient Greek (to 1453), Hindi, Hungarian, Italian, Japanese, Latin, Dutch, Portuguese, Romanian, Russian, Slovak, Slovenian, Swedish, Tamil, Telugu, and Turkish
- Description:
- HamleDT 2.0 is a collection of 30 existing treebanks harmonized into a common annotation style, the Prague Dependencies, and further transformed into Stanford Dependencies, a treebank annotation style that became popular recently. We use the newest basic Universal Stanford Dependencies, without added language-specific subtypes.
- Rights:
- HamleDT 2.0 Licence Agreement, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-hamledt-2.0, and ACA
90. HamleDT 3.0
- Creator:
- Zeman, Daniel, Mareček, David, Mašek, Jan, Popel, Martin, Ramasamy, Loganathan, Rosa, Rudolf, Štěpánek, Jan, and Žabokrtský, Zdeněk
- Publisher:
- Charles University
- Type:
- text and corpus
- Subject:
- annotated corpus, morphology, syntax, dependency, treebank, harmonized annotation, and common annotation style
- Language:
- Arabic, Basque, Bengali, Bulgarian, Catalan, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Modern Greek (1453-), Ancient Greek (to 1453), Hebrew, Hindi, Hungarian, Indonesian, Irish, Italian, Japanese, Latin, Persian, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Spanish, Swedish, Tamil, Telugu, and Turkish
- Description:
- HamleDT (HArmonized Multi-LanguagE Dependency Treebank) is a compilation of existing dependency treebanks (or dependency conversions of other treebanks), transformed so that they all conform to the same annotation style. This version uses Universal Dependencies as the common annotation style. Update (November 1017): for a current collection of harmonized dependency treebanks, we recommend using the Universal Dependencies (UD). All of the corpora that are distributed in HamleDT in full are also part of the UD project; only some corpora from the Patch group (where HamleDT provides only the harmonizing scripts but not the full corpus data) are available in HamleDT but not in UD.
- Rights:
- HamleDT 3.0 License Terms, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-hamledt-3.0, and PUB
91. História da Madeira /
- Creator:
- Vieira, Alberto,
- Type:
- text and monografie
- Subject:
- Dějiny států a území na Pyrenejském poloostrově, dějiny zemí, dějiny regionální, Portugalsko, politické dějiny, politici, and přehledná zpracování světových dějin (chronologicky)
- Language:
- Portuguese
- Rights:
- unknown
92. História do Brasil /
- Creator:
- Fausto, Boris,
- Type:
- text and monografie
- Subject:
- Dějiny Jižní Ameriky. Latinská Amerika, dějiny států, Brazílie, přehledná zpracování (tematicky), and přehledná zpracování světových dějin (chronologicky)
- Language:
- Portuguese
- Rights:
- unknown
93. Ibero-Americana Pragensia :
- Type:
- text and sborníky
- Subject:
- Iberorománské jazyky, Dějiny Jižní Ameriky. Latinská Amerika, hispanistika, iberoamerikanistika, and česká periodika
- Language:
- Spanish and Portuguese
- Rights:
- unknown
94. Ibero-Americana Pragensia :
- Publisher:
- Karolinum,
- Type:
- sborníky
- Subject:
- Iberorománské jazyky, Dějiny Jižní Ameriky. Latinská Amerika, hispanistika, iberoamerikanistika, and česká periodika
- Language:
- Spanish and Portuguese
- Rights:
- unknown
95. Ibero-Americana Pragensia :
- Publisher:
- Karolinum,
- Type:
- sborníky
- Subject:
- Iberorománské jazyky, Dějiny Jižní Ameriky. Latinská Amerika, hispanistika, iberoamerikanistika, and česká periodika
- Language:
- Spanish and Portuguese
- Rights:
- unknown
96. Ibero-Americana Pragensia :
- Publisher:
- Karolinum,
- Type:
- sborníky
- Subject:
- Iberorománské jazyky, Dějiny Jižní Ameriky. Latinská Amerika, hispanistika, iberoamerikanistika, and česká periodika
- Language:
- Spanish and Portuguese
- Rights:
- unknown
97. Ibero-Americana Pragensia :
- Publisher:
- Karolinum,
- Type:
- sborníky
- Subject:
- Iberorománské jazyky, Dějiny Jižní Ameriky. Latinská Amerika, hispanistika, iberoamerikanistika, and česká periodika
- Language:
- Spanish and Portuguese
- Rights:
- unknown
98. Ibero-Americana Pragensia :
- Publisher:
- Karolinum,
- Subject:
- Iberorománské jazyky, Dějiny Jižní Ameriky. Latinská Amerika, hispanistika, iberoamerikanistika, and česká periodika
- Language:
- Spanish and Portuguese
- Rights:
- unknown
99. Ibero-Americana Pragensia :
- Publisher:
- Karolinum,
- Type:
- sborníky
- Subject:
- Iberorománské jazyky, Dějiny Jižní Ameriky. Latinská Amerika, hispanistika, iberoamerikanistika, and česká periodika
- Language:
- Spanish and Portuguese
- Rights:
- unknown
100. Ibero-Americana Pragensia :
- Type:
- text and sborníky
- Subject:
- Iberorománské jazyky, Dějiny Jižní Ameriky. Latinská Amerika, hispanistika, iberoamerikanistika, and česká periodika
- Language:
- Spanish and Portuguese
- Rights:
- unknown
- « Previous
- Next »
- 1
- 2
- 3