Number of results to display per page
Search Results
362. Pamiętniki włościanina :
- Creator:
- Słomka, Jan,
- Type:
- text and vzpomínky
- Subject:
- Dějiny Evropy, zemědělci, obyvatelstvo venkovské, život každodenní, Polsko, zemědělci, řemeslníci, poddaní, and světové dějiny 1789-1918
- Language:
- Polish
- Rights:
- unknown
363. Pamiętniki.
- Creator:
- Mickiewicz, Władysław Józef,
- Type:
- text and paměti
- Subject:
- Dějiny Evropy, Mickiewicz, Władysław Józef,, spisovatelé polští, Polsko, dějiny vědy, umění, kultury a techniky, kulturní vztahy, světové dějiny 1789-1918, and světové dějiny 1918-1945
- Language:
- Polish
- Rights:
- unknown
364. Pamiętniki.
- Creator:
- Mickiewicz, Władysław Józef,
- Type:
- text and paměti
- Subject:
- Dějiny Evropy, Polská literatura, Mickiewicz, Władysław Józef,, spisovatelé polští, Polsko, světové dějiny 1789-1918, and dějiny vědy, umění, kultury a techniky, kulturní vztahy
- Language:
- Polish
- Rights:
- unknown
365. Pamietniki.
- Creator:
- Daszyński, Ignacy,
- Type:
- text and paměti
- Subject:
- Dějiny zemí střední Evropy, Daszyński, Ignacy,, socialismus, politici polští, Polsko, politické dějiny, politici, and světové dějiny 1789-1918
- Language:
- Polish
- Rights:
- unknown
366. Pamiętniki.
- Creator:
- Mickiewicz, Władysław Józef,
- Type:
- text and paměti
- Subject:
- Dějiny Evropy, Polská literatura (o ní), Mickiewicz, Władysław Józef,, spisovatelé polští, Polsko, dějiny vědy, umění, kultury a techniky, kulturní vztahy, and světové dějiny 1789-1918
- Language:
- Polish
- Rights:
- unknown
367. Pamiętny sejm /
- Creator:
- Sobieski, Wacław,
- Type:
- text and monografie
- Subject:
- Dějiny zemí střední Evropy, sněmy zemské, vztahy panovník-šlechta, Polsko, politické dějiny, politici, and světové dějiny 1492-1648
- Language:
- Polish
- Rights:
- unknown
368. Pani Masarykowa :
- Creator:
- Doležal, Jaromír,
- Type:
- text and biografie
- Subject:
- Dějiny Česka a Slovenska, Masaryková-Garrigue, Charlotte,, ženy, biografie, politici, ženská otázka, rodina, děti, životní úroveň, české země 1848-1918, and Československo 1918-1938
- Language:
- Polish
- Description:
- Charlotta Garrigue Masaryková
- Rights:
- unknown
369. ParaCrawl Corpus version 1.0
- Creator:
- Koehn, Philipp, Heafield, Kenneth, Forcada, Mikel L., Esplà-Gomis, Miquel, Ortiz-Rojas, Sergio, Sánchez, Gema Ramírez, Cartagena, Víctor M. Sánchez, Haddow, Barry, Bañón, Marta, Střelec, Marek, Samiotou, Anna, and Kamran, Amir
- Publisher:
- ParaCrawl
- Type:
- text and corpus
- Subject:
- ParaCrawl, parallel corpus, CommonCrawl, machine translation, and text corpora
- Language:
- English, German, French, Spanish, Italian, Portuguese, Dutch, Polish, Czech, Romanian, Finnish, Latvian, Russian, and Estonian
- Description:
- The January 2018 release of the ParaCrawl is the first version of the corpus. It contains parallel corpora for 11 languages paired with English, crawled from a large number of web sites. The selection of websites is based on CommonCrawl, but ParaCrawl is extracted from a brand new crawl which has much higher coverage of these selected websites than CommonCrawl. Since the data is fairly raw, it is released with two quality metrics that can be used for corpus filtering. An official "clean" version of each corpus uses one of the metrics. For more details and raw data download please visit: http://paracrawl.eu/releases.html
- Rights:
- Public Domain Dedication (CC Zero), http://creativecommons.org/publicdomain/zero/1.0/, and PUB
370. PARSEME corpora annotated for verbal multiword expressions (version 1.3)
- Creator:
- Savary, Agata, Ramisch, Carlos, Guillaume, Bruno, Hawwari, Abdelati, Walsh, Abigail, Fotopoulou, Aggeliki, Bielinskienė, Agnė, Estarrona, Ainara, Gatt, Albert, Butler, Alexandra, Rademaker, Alexandre, Maldonado, Alfredo, Villavicencio, Aline, Farrugia, Alison, Muscat, Amanda, Gatt, Anabelle, Antić, Anđela, De Santis, Anna, Raffone, Annalisa, Riccio, Anna, Pascucci, Antonio, Gurrutxaga, Antton, Bhatia, Archna, Vaidya, Ashwini, Miral, Ayşenur, QasemiZadeh, Behrang, Priego Sanchez, Belem, Griciūtė, Bernadeta, Erden, Berna, Parra Escartín, Carla, Herrero, Carlos, Carlino, Carola, Pasquer, Caroline, Liebeskind, Chaya, Wang, Chenweng, Ben Khelil, Chérifa, Bonial, Claire, Somers, Clarissa, Aceta, Cristina, Krstev, Cvetana, Bejček, Eduard, Lindqvist, Ellinor, Erenmalm, Elsa, Palka-Binkiewicz, Emilia, Rimkute, Erika, Petterson, Eva, Cap, Fabienne, Hu, Fangyuan, Sangati, Federico, Wick Pedro, Gabriela, Speranza, Giulia, Jagfeld, Glorianna, Blagus, Goranka, Berk, Gözde, Attard, Greta, Eryiğit, Gülşen, Finnveden, Gustav, Martínez Alonso, Héctor, de Medeiros Caseli, Helena, Elyovich, Hevi, Xu, Hongzhi, Xiao, Huangyang, Miranda, Isaac, Jaknić, Isidora, El Maarouf, Ismail, Aduriz, Itziar, Gonzalez, Itziar, Matas, Ivana, Stoyanova, Ivelina, Jazbec, Ivo-Pavao, Busuttil, Jael, Waszczuk, Jakub, Findlay, Jamie, Bonnici, Janice, Šnajder, Jan, Antoine, Jean-Yves, Foster, Jennifer, Chen, Jia, Nivre, Joakim, Monti, Johanna, McCrae, John, Kovalevskaitė, Jolanta, Jain, Kanishka, Simkó, Katalin, Yu, Ke, Azzopardi, Kirsty, Adalı, Kübra, Uria, Larraitz, Zilio, Leonardo, Boizou, Loïc, van der Plas, Lonneke, Galea, Luke, Sarlak, Mahtab, Buljan, Maja, Cherchi, Manuela, Tanti, Marc, Di Buono, Maria Pia, Todorova, Maria, Candito, Marie, Constant, Matthieu, Shamsfard, Mehrnoush, Jiang, Menghan, Boz, Mert, Spagnol, Michael, Onofrei, Mihaela, Li, Minli, Elbadrashiny, Mohamed, Diab, Mona, Rizea, Monica-Mihaela, Hadj Mohamed, Najet, Theoxari, Natasa, Schneider, Nathan, Tabone, Nicole, Ljubešić, Nikola, Vale, Oto, Cook, Paul, Yan, Peiyi, Gantar, Polona, Ehren, Rafael, Fabri, Ray, Ibrahim, Rehab, Ramisch, Renata, Walles, Rinat, Wilkens, Rodrigo, Urizar, Ruben, Sun, Ruilong, Malka, Ruth, Galea, Sara Anne, Stymne, Sara, Louizou, Sevasti, Hu, Sha, Taslimipoor, Shiva, Ratori, Shraddha, Srivastava, Shubham, Cordeiro, Silvio Ricardo, Krek, Simon, Liu, Siyuan, Zeng, Si, Yu, Songping, Arhar Holdt, Špela, Markantonatou, Stella, Papadelli, Stella, Leseva, Svetlozara, Kuzman, Taja, Kavčič, Teja, Lynn, Teresa, Lichte, Timm, Pickard, Thomas, Dimitrova, Tsvetana, Yih, Tsy, Güngör, Tunga, Dinç, Tutkum, Iñurrieta, Uxoa, Tajalli, Vahide, Stefanova, Valentina, Caruso, Valeria, Puri, Vandana, Foufi, Vassiliki, Barbu Mititelu, Verginica, Vincze, Veronika, Kovács, Viktória, Shukla, Vishakha, Giouli, Voula, Ge, Xiaomin, Ha-Cohen Kerner, Yaakov, Öztürk, Yağmur, Yarandi, Yalda, Parmentier, Yannick, Zhang, Yongchen, Zhao, Yun, Urešová, Zdeňka, Yirmibeşoğlu, Zeynep, Qin, Zhenzhen, Stank, Cristescu, Mihaela, Zgreabăn, Bianca-Mădălina, Bărbulescu, Elena-Andreea, and Stanković, Ranka
- Publisher:
- PARSEME
- Type:
- text and corpus
- Subject:
- multiword expressions, verbal multiword expressions, light verb construction, verb-particle constructions, inherently reflexive verbs, verbal idioms, and multi-verb constructions
- Language:
- Arabic, Bulgarian, Czech, German, Modern Greek (1453-), English, Spanish, Basque, Persian, French, Irish, Hebrew, Hindi, Croatian, Hungarian, Lithuanian, Italian, Maltese, Polish, Portuguese, Romanian, Slovenian, Serbian, Swedish, Turkish, and Chinese
- Description:
- This multilingual resource contains corpora in which verbal MWEs have been manually annotated. VMWEs include idioms (let the cat out of the bag), light-verb constructions (make a decision), verb-particle constructions (give up), inherently reflexive verbs (help oneself), and multi-verb constructions (make do). This is the first release of the corpora without an associated shared task. Previous version (1.2) was associated with the PARSEME Shared Task on semi-supervised Identification of Verbal MWEs (2020). The data covers 26 languages corresponding to the combination of the corpora for all previous three editions (1.0, 1.1 and 1.2) of the corpora. VMWEs were annotated according to the universal guidelines. The corpora are provided in the cupt format, inspired by the CONLL-U format. Morphological and syntactic information, including parts of speech, lemmas, morphological features and/or syntactic dependencies, are also provided. Depending on the language, the information comes from treebanks (e.g., Universal Dependencies) or from automatic parsers trained on treebanks (e.g., UDPipe). All corpora are split into training, development and test data, following the splitting strategy adopted for the PARSEME Shared Task 1.2. The annotation guidelines are available online: https://parsemefr.lis-lab.fr/parseme-st-guidelines/1.3 The .cupt format is detailed here: https://multiword.sourceforge.net/cupt-format/
- Rights:
- PARSEME Corpora v. 1.3 - Licence Agreement, https://lindat.mff.cuni.cz/repository/xmlui/page/licence-mwe-1.3, and PUB