Number of results to display per page
Search Results
912. Indonesian web corpus (idWac)
- Creator:
- Medveď, Marek and Suchomel, Vít
- Publisher:
- Natural Language Processing Centre, Faculty of Informatics, Masaryk University
- Type:
- text and corpus
- Subject:
- corpus, lemmatization, and PoS tagging
- Language:
- Indonesian
- Description:
- Indonesian text corpus from web. Crawling done by SpiderLing in 2017. Filtering by JusText and Onion (see http://corpus.tools/ for details). Tagged and lemmatized by MorphInd (http://septinalarasati.com/morphind/).
- Rights:
- NLP Centre Web Corpus License, https://lindat.mff.cuni.cz/repository/xmlui/page/license-NLPC-WeC, and ACA
913. Information extraction from EIA documents
- Creator:
- Lukšová, Ivana and Hladká, Barbora
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- tool and toolService
- Subject:
- information extraction and rule-based extraction
- Language:
- Czech
- Description:
- Environmental impact assessment (EIA) is the formal process used to predict the environmental consequences of a plan. We present a rule-based extraction system to mine Czech EIA documents. The extraction rules work with a set of documents enriched with morphological information and manually created vocabularies of terms supposed to be extracted from the documents, e.g. basic information about the project (address, ID company, ...), data on the impacts and outcomes (waste substances, endangered species, ...), a final opinion. The documents Notice of Intent contains the section BI2 with the information on the scope (capacity) of the plan.
- Rights:
- GNU General Public Licence, version 3, http://opensource.org/licenses/GPL-3.0, and PUB
914. Intas corpus
- Publisher:
- Department of Languages, University of Jyväskylä
- Type:
- corpus
- Language:
- Dutch, Finnish, and Russian
- Description:
- A corpus of spontaneous discussions and read-aloud performances from native speakers of different ages. Parallel corpus in Russian, Finnish, and Dutch.
- Rights:
- Not specified
915. Integrated lexicographic platform for Russian
- Creator:
- Rambousek, Adam
- Publisher:
- Masaryk University, NLP Centre
- Type:
- toolService
- Subject:
- lexicography platform, russian, and web dictionary
- Language:
- Russian
- Description:
- Integrated lexicographic platform for Russian.
- Rights:
- Attribution-NonCommercial-NoDerivs 3.0 Unported (CC BY-NC-ND 3.0), http://creativecommons.org/licenses/by-nc-nd/3.0/, and PUB
916. INTERA Terminological Lexicon
- Type:
- lexicalConceptualResource
- Language:
- Bulgarian, English, Modern Greek (1453-), Serbian, and Slovenian
- Description:
- 17357 terms, XML
- Rights:
- Not specified
917. Interment of the remains of 42 legionaries executed on the Italia
- Creator:
- (:unav) Unknown author
- Publisher:
- Národní filmový archiv
- Type:
- video and clip
- Subject:
- pohřeb legionářský, generálové českoslovenští, pohřeb 42 italských legionářů, slavnost legionářská, legie československé, projevy veřejné, vyznamenání vojenská, slavnost vojenská, průvod pohřební, akt pietní legionáři, přehlídka legionářská, Významné pohřby, Places::Praha::Staré Město::Staroměstské náměstí, and Places::Praha::Staré Město::Staroměstské náměstí::Husův pomník
- Language:
- No linguistic content
- Description:
- The segment captures the reverential act of depositing the remains of forty-two Italian legionnaires who were executed for deserting from the Austrian army to join the Italian legions in the summer of 1918. The coffins with their bodies were temporarily placed at the military cemetery in Milovice and later unearthed and transported to Prague, where a day-long funeral ceremony was held on 24 April 1921. The camera focuses on military troops lined up on Old Town Square and Italian and Czechoslovak officers. The ceremony is witnessed by Minister of National Defence Otakar Husák and the General Inspector of the Czechoslovak Army, the poet Josef Svatopluk Machar. Shots of speeches given by Josef Rotnágl, a member of the Revolutionary National Assembly, and General Otakar Husák, who delivers a message from the President of Czechoslovakia (silent). This is followed by speeches given by the Senator of the National Assembly, Václav Klofáč, Deputy of the National Assembly František Udržal, and the President of the Italian-Czechoslovak League, Prince Pietro Lanza di Scalea, whose speech is interpreted by diplomat Jan Šeba. Shot of the commander of the funeral procession, General Karel Voženílek, on horseback. General Otakar Husák and Josef Svatopluk Machar receive Italian military honours. After the solemn ceremony on Old Town Square, the coffins with the remains of the executed legionnaires were taken to the military burial ground at Olšany Cemetery.
- Rights:
- http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
918. Interment of the remains of poet Karel Hynek Mácha at Vyšehrad
- Creator:
- Aktualita
- Publisher:
- Národní filmový archiv
- Type:
- video and clip
- Subject:
- pohřeb Mácha Karel Hynek, akt pietní Mácha Karel Hynek, portrét Mácha Karel Hynek, věnce smuteční, rakev zahalená vlajkou českou, stráž čestná u rakve, žena křižující se, strážník, hodnostáři církevní na pohřbu, ministranti na pohřbu, rakev na katafalku, oheň věčný, pomník sv. Václav, lidé přihlížející, vůz pohřební tažený koňmi, projevy smuteční, vlajky české, rakev vynášení, architektura novogotická, věnec smuteční ve tvaru lyry, družičky na pohřbu, automobily v pohřebním průvodu, hřbitov, lidé v krojích, kameraman, mikrofon, chlapec křižující se, hrob Mácha Karel Hynek, náhrobek Mácha Karel Hynek, nápis Dalekáť cesta má! Marné volání!, Významné pohřby, People::Kapras Jan (1880-1947), People::Klapka Otakar (1891-1941), People::Vydra Václav ml. (1902-1979), People::Novák Vítězslav (1870-1949), People::Švabinský Max (1873-1962), People::Halas František (1901-1949), People::Hora Josef (1891-1945), People::Wünsch Antonín (1864-1953), People::Medek Rudolf (1890-1940), People::Kohout Eduard (1889-1976), and Český zvukový týdeník Aktualita::1939/20
- Language:
- Czech
- Description:
- Segment of the Český zvukový týdeník Aktualita (Czech Aktualita Sound Newsreel) 1939 No. 20 captures the solemn event of the interment of the remains of poet Karel Hynek Mácha at Vyšehrad Cemetery in Prague on 7 May 1939. Mourners walk past the coffin with the poet´s remains in the Pantheon of the National Museum. The large funeral procession starts on Wenceslaus Square and continues along National Street, Masaryk Embankment and narrow alleys to Vyšehrad. The streets are lined with crowds of people. The film footage is accompanied by the recitation of the fourth canto of the poem May delivered by Václav Vydra Jr., an actor of the National Theatre. This is followed by images from the solemn ceremony in the Slavín Tomb at Vyšehrad Cemetery. The coffin with the poet´s remains is lowered into the grave. Rudolf Medek bids farewell to Mácha on the behalf of Czech writers. Actor Eduard Kohout recites 7 May 1939, a poem by Josef Hora. People walk past the grave, placing flowers on it, some crossing themselves. The mourners include composer Vítězslav Novák, painter Max Švabinský, Minister of Education and National Enlightenment Jan Kapras and the Mayor of Prague Otakar Klapka.
- Rights:
- http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
919. International Corpus of English: East Africa (ICE-EA)
- Publisher:
- Technische Universität, Chemnitz , Universität Bayreuth
- Type:
- corpus
- Subject:
- corpus
- Language:
- English
- Description:
- One million words of spoken and written English from Kenya and Tanzania. Part of the ICE project
- Rights:
- Not specified
920. International Corpus of English: Great Britain (ICE-GB)
- Publisher:
- Survey of English Usage, University College London
- Type:
- corpus
- Language:
- English
- Description:
- One million words of written and spoken English from Great Britain. Transcriptions aligned with digitised speech recordings. POS-tagged and parsed. Part of the International Corpus of English project. Custom-made search software: ICE-CUP
- Rights:
- Not specified