Language: Polish - LINDAT/CLARIAH-CZ Catalog Search Results

Start Over Language Polish

751. Corpus for training and evaluating diacritics restoration systems

Creator:: Náplava, Jakub, Straka, Milan, Hajič, Jan, and Straňák, Pavel
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text and corpus
Subject:: diacritical marks generation and natural language correction
Language:: Czech, Vietnamese, Romanian, Polish, Slovak, Spanish, Croatian, Irish, Latvian, Hungarian, French, and Turkish
Description:: Corpus of texts in 12 languages. For each language, we provide one training, one development and one testing set acquired from Wikipedia articles. Moreover, each language dataset contains (substantially larger) training set collected from (general) Web texts. All sets, except for Wikipedia and Web training sets that can contain similar sentences, are disjoint. Data are segmented into sentences which are further word tokenized. All data in the corpus contain diacritics. To strip diacritics from them, use Python script diacritization_stripping.py contained within attached stripping_diacritics.zip. This script has two modes. We generally recommend using method called uninames, which for some languages behaves better. The code for training recurrent neural-network based model for diacritics restoration is located at https://github.com/arahusky/diacritics_restoration.
Rights:: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB

752. CorpusExplorer

Creator:: Rüdiger, Jan Oliver
Publisher:: Jan Oliver Rüdiger
Type:: tool and toolService
Subject:: Corpus Linguisitics, NLP, conll, tei, XML, nlp, Natural Language Processing, linguistics, Linguistics, Computational Linguistics, corpus processing, tagger, POS tagger, lemmatization, text cleaning, CommonCrawl, epub, JSON, Twitter, Pandoc, Wikipedia, digital data, DTA, DSpin, MySQL, ElasticSearch, TextGrid, text corpora, TigerXML, and WeblichtXML
Language:: German, English, French, Italian, Dutch, Spanish, Polish, Arabic, Chinese, and Portuguese
Description:: Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 45 interactive visualizations under a user-friendly interface. Routine tasks such as text acquisition, cleaning or tagging are completely automated. The simple interface supports the use in university teaching and leads users/students to fast and substantial results. The CorpusExplorer is open for many standards (XML, CSV, JSON, R, etc.) and also offers its own software development kit (SDK). Source code available at https://github.com/notesjor/corpusexplorer2.0
Rights:: Not specified

753. Crescente cottidie malicia perversorum. Notka o wpływie statutów Jakuba Świnki na czeskie prawodawstwo kościelne /

Creator:: Krafl, Pavel,
Type:: text and studie
Subject:: Dějiny křesťanské církve, statuta diecézní, vztahy česko-polské, české země 1306-1526, církevní právo, inkvizice, Polsko, and světové dějiny středověku (do r. 1492)
Language:: Polish
Rights:: unknown

754. Crimina et mores. Prawo karne i obyczaje w starożytnym Rzymie /

Publisher:: Wydawn. Univ. Marii Curie-Skłodowskiej,
Subject:: dějiny práva, právo římské, právo trestní, ústavní a právní dějiny, and světové dějiny - pravěk a starověk
Language:: Polish
Rights:: unknown

755. Cronica monasterii canonicorum regularium (S. Augustini) in Glacz. = Kronika klasztoru kanoników regularnych (Św. Augustyna) w Kłodzku /

Publisher:: Universitas Wratislaviensis, Institutum studiorum Silesiacorum et Bohemicorum,
Subject:: Czacheritz, Michał,, kroniky klášterní, edice, řád, augustiniáni kanovníci, kroniky středověké, Polsko, světové dějiny středověku (do r. 1492), církevní řády a kongregace, náboženská bratrstva, kláštery, and dějepisectví, historické vědy, historici
Language:: Polish and Latin
Rights:: unknown

756. Črty uhlem

Creator:: Sienkiewicz, Henryk and Moudrý, Cyril S.
Publisher:: J.F. Kubeš
Format:: print, Text, regular print, and 104 s. ; 19 cm
Type:: model:monograph and TEXT
Subject:: 821.162.1-32 and (0:82-32)
Language:: Czech and Polish
Description:: Converted from MARCXML to MODS version 3.5 using MARC21slim2MODS3-5.xsl (Revision 1.106 2014/12/19)(EE patch 2015/05/15), obrázky z venkovského života od Henryka Sienkiewicze ; z polštiny přeložil Cyril S. Moudrý, Rok vyd. a název originálu z bibliografického katalogu 19. stol., Přívazek k: Historické arabesky / Z. Winter, and Converted from MODS 3.5 to DC version 1.8 (EE patch 2015/06/25)
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

757. CUBBITT Translation Models (en-pl) (v1.0)

Creator:: Popel, Martin, Tomková, Markéta, Tomek, Jakub, Kaiser, Łukasz, Uszkoreit, Jakob, Bojar, Ondřej, and Žabokrtský, Zdeněk
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: tool and toolService
Subject:: machine translation, neural machine translation, transformer, and cubbitt
Language:: English and Polish
Description:: CUBBITT En-Pl translation models, exported via TensorFlow Serving, available in the Lindat translation service (https://lindat.mff.cuni.cz/services/translation/). Models are compatible with Tensor2tensor version 1.6.6. For details about the model training (data, model hyper-parameters), please contact the archive maintainer. Evaluation on newstest2020 (BLEU): en->pl: 12.3 pl->en: 20.0 (Evaluated using multeval: https://github.com/jhclark/multeval)
Rights:: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB

758. Cudze chwalicie... Perypetie inżynierów z rodziny Strakowskich w Gdańsku w XVII wieku /

Creator:: Dybaś, Bogusław,
Subject:: Strackwitz (rodna), inženýři, inteligence technická, rody a rodiny, světové dějiny novověku (1492-1918), Polsko, and inteligence, úředníci, další společenské skupiny
Language:: Polish
Description:: Die Fremden lobt ihr... Die Peripetien der Ingenieure aus der Familie Strackwitz in Danzig während des 17. Jahrhunderts.
Rights:: unknown

759. Cudzoziemcy w polskim ruchu oporu 1939-1945 /

Creator:: Okęcki, Stanisław,
Type:: text and monografie
Subject:: Dějiny zemí střední Evropy, odboj protifašistický, válka druhá světová (1939-1945), spolupráce mezinárodní, Polsko, odboj, odpor, antifašismus, antikomunismus, and světové dějiny 1939-1945
Language:: Polish
Rights:: unknown

760. Cuius ius? O istocie władzy - dyskusja między Luksemburgami i śląskimi Piastami /

Creator:: Wiszewski, Przemysław,
Type:: text and studie
Subject:: Mezinárodní vztahy, světová politika, Lucemburkové (rod), Piastovci (rod), rody panovnické, právo dědické, panovníci, vláda panovnická, české země 1306-1419, Polsko, světové dějiny středověku (do r. 1492), panovníci, panovnické rody, dvory, and politické dějiny, politici
Language:: Polish
Description:: Cuius ius? On the essence of government - the debate between the Luxembourgs and the Piast dynasty of Silesia.
Rights:: unknown

« Previous
Next »
1
2
…
72
73
74
75
76
77
78
79
80
…
761
762

751. Corpus for training and evaluating diacritics restoration systems

752. CorpusExplorer

753. Crescente cottidie malicia perversorum. Notka o wpływie statutów Jakuba Świnki na czeskie prawodawstwo kościelne /

754. Crimina et mores. Prawo karne i obyczaje w starożytnym Rzymie /

755. Cronica monasterii canonicorum regularium (S. Augustini) in Glacz. = Kronika klasztoru kanoników regularnych (Św. Augustyna) w Kłodzku /

756. Črty uhlem

757. CUBBITT Translation Models (en-pl) (v1.0)

758. Cudze chwalicie... Perypetie inżynierów z rodziny Strakowskich w Gdańsku w XVII wieku /

759. Cudzoziemcy w polskim ruchu oporu 1939-1945 /

760. Cuius ius? O istocie władzy - dyskusja między Luksemburgami i śląskimi Piastami /

Limit your search

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Coverage

Show values starting with

Creator

Show values starting with

Format

Show values starting with

Language

Show values starting with

Publisher

Show values starting with

Rights

Show values starting with

Subject

Show values starting with

Type

Show values starting with

Date

Original context has metadata only

Harvested from