Number of results to display per page
Search Results
842. Julius Stoklasa (agrobiologist)
- Creator:
- Veselý, Bohumil
- Publisher:
- Národní filmový archiv
- Type:
- video and clip
- Subject:
- Galerie osobností, Places::Praha::Nové Město::Školská::pavlač domu, and People::Stoklasa Julius (1857-1936)
- Language:
- No linguistic content
- Description:
- Professor and agrobiologist Julius Stoklasa in the Botanical Garden. Stoklasa with his colleagues by a greenhouse.
- Rights:
- http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
843. jusText
- Creator:
- Pomikálek, Jan
- Publisher:
- Masaryk University, NLP Centre
- Type:
- toolService and tool
- Subject:
- boilerplate, web documents, text cleaning, boilerplate removal, and text corpora
- Language:
- English
- Description:
- jusText is a heuristic based boilerplate removal tool useful for cleaning documents in large textual corpora. The tool has been implemented in Python, licensed under New BSD License and made an open source software (available for download including the source code at http://code.google.com/p/justext/). It is successfully used for cleaning large textual corpora at Natural language processing centre at Faculty of informatics, Masaryk university Brno and it's industry partners. The research leading to this piece of software was published in author's Ph.D. thesis "Removing Boilerplate and Duplicate Content from Web Corpora". The boilerplate removal algorithm is able to remove most of non-grammatical sentences from a web page like navigation, advertisements, tables, short notes and so on. It has been shown it overperforms or at least keeps up with it's competitors (according to comparison with participants of Cleaneval competition in author's Ph.D. thesis). The precise removal of unwanted content and scalability of the algorithm has been demonstrated while building corpora of American Spanish, Arabic, Czech, French, Japanese, Russian, Tajik, and six Turkic languages consisting --- over 20 TB of HTML pages were processed resulting in corpora of 70 billions tokens altogether. and PRESEMT, Lexical Computing Ltd
- Rights:
- Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0), http://creativecommons.org/licenses/by-sa/3.0/, and PUB
844. K. M. Walló (poet, screenwriter)
- Creator:
- Veselý, Bohumil
- Publisher:
- Národní filmový archiv
- Type:
- video and clip
- Subject:
- Galerie osobností, Places::Praha::Nové Město::Školská::pavlač domu, and People::Walló K. M. (1914-1990)
- Language:
- No linguistic content
- Description:
- Poet and screenwriter K. M. Walló on Bohumil Veselý's balcony.
- Rights:
- http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
845. Kamil Hilbert (architect)
- Creator:
- Veselý, Bohumil
- Publisher:
- Národní filmový archiv
- Type:
- video and clip
- Subject:
- Galerie osobností, Places::Praha::Hradčany::Pražský hrad::katedrála sv. Víta /ext./, and People::Hilbert Kamil (1869-1933)
- Language:
- No linguistic content
- Description:
- Architect Kamil Hilbert in front of St. Vitus Cathedral at Prague Castle.
- Rights:
- http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
846. Kamil Lhoták (painter)
- Creator:
- Krátký film and Veselý, Bohumil
- Publisher:
- Národní filmový archiv
- Type:
- video and clip
- Subject:
- Galerie osobností, Places::Praha::Nové Město::Školská::pavlač domu, People::Lhoták Kamil (1912-1990), People::Brůžek (neuvedeno-), and Československý filmový týdeník 1957/2
- Language:
- Czech
- Description:
- Painter Kamil Lhoták on Bohumil Veselý's balcony. Kamil Lhoták with gilder Brůžek while choosing a frame in a fragmented segment from Československý filmový týdeník (Czechoslovak Film Weekly Newsreel) 1957, issue no. 2.
- Rights:
- http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
847. Kamila Ungrová (opera singer)
- Creator:
- Veselý, Bohumil
- Publisher:
- Národní filmový archiv
- Type:
- video and clip
- Subject:
- nápis Veselý, Galerie osobností, Places::Praha::Nové Město::Školská::pavlač domu, and People::Ungrová Kamila (1887-1972)
- Language:
- No linguistic content
- Description:
- Opera singer Kamila Ungrová with an unidentified man on Bohumil Veselý's balcony.
- Rights:
- http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
848. KAMOKO-Digitalizer
- Creator:
- Rüdiger, Jan Oliver
- Publisher:
- Rüdiger, Jan Oliver
- Type:
- tool and toolService
- Subject:
- learner corpus, corpus, and annotation
- Language:
- German
- Description:
- This editor was developed especially for the needs of the KAMOKO project (https://lindat.mff.cuni.cz/repository/xmlui/handle/11372/LRT-3261). The editor allows the quick entry of example sentences and sentence variants as well as the corresponding speaker ratings.
- Rights:
- Affero General Public License 3 (AGPL-3.0), http://opensource.org/licenses/AGPL-3.0, and PUB
849. KAMOKO: KAsseler MOrgenstern KOrpus
- Creator:
- Schrott, Angela, Wieders-Lohéac, Aline, and Rüdiger, Jan Oliver
- Publisher:
- Universität Kassel - Institut für Romanistik
- Type:
- text and corpus
- Subject:
- corpus, annotated corpus, French, learner corpus, XML, CorpusExplorer, TXM, and WeblichtXML
- Language:
- French
- Description:
- KAMOKO is a structured and commented french learner-corpus. It addresses the central structures of the French language from a linguistic perspective (18 different courses). The text examples in this corpus are annotated by native speakers. This makes this corpus a valuable resource for (1) advanced language practice/teaching and (2) linguistics research. The KAMOKO corpus can be used free of charge. Information on the structure of the corpus and instructions on how to use it are presented in detail in the KAMOKO Handbook and a video-tutorial (both in german). In addition to the raw XML-data, we also offer various export formats (see ZIP files – supported file formats: CorpusExplorer, TXM, WebLicht, TreeTagger, CoNLL, SPEEDy, CorpusWorkbench and TXT).
- Rights:
- Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB
850. KAMOKO: KAsseler MOrgenstern KOrpus (2021-02-09)
- Creator:
- Schrott, Angela, Wieders-Lohéac, Aline, and Rüdiger, Jan Oliver
- Publisher:
- Universität Kassel - Institut für Romanistik
- Type:
- text and corpus
- Subject:
- corpus, annotated corpus, French, learner corpus, XML, CorpusExplorer, TXM, and WeblichtXML
- Language:
- French
- Description:
- KAMOKO is a structured and commented french learner-corpus. It addresses the central structures of the French language from a linguistic perspective (18 different courses). The text examples in this corpus are annotated by native speakers. This makes this corpus a valuable resource for (1) advanced language practice/teaching and (2) linguistics research. The KAMOKO corpus can be used free of charge. Information on the structure of the corpus and instructions on how to use it are presented in detail in the KAMOKO Handbook and a video-tutorial (both in german). In addition to the raw XML-data, we also offer various export formats (see ZIP files – supported file formats: CorpusExplorer, TXM, WebLicht, TreeTagger, CoNLL, SPEEDy, CorpusWorkbench and TXT).
- Rights:
- Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB