Skip to search
Skip to main content
Skip to first result
Search
Search Results
Type:
corpus
Language:
English
Description:
General reference corpus; 100 million words; POS, lemma, descriptive metadata
Rights:
Not specified
Type:
lexicalConceptualResource
Subject:
Germanistik
Language:
German
Description:
5. Aufl. 1911; Fokus auf Politik, Wirtschaft, Kultur und Technik zu Beginn des 20. Jahrhunderts
Rights:
Not specified
Creator:
Ouamer, meriem , Bouzoubaa, Karim , and Tajmout, rachida
Publisher:
ALELM research group
Type:
text , wordList , and lexicalConceptualResource
Subject:
Broken plural
Language:
Arabic
Description:
An LMF conformant XML-based file containing a comprehensive Arabic broken plural list. The file contains 12,249 singular words with their corresponding BPs
Rights:
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) , http://creativecommons.org/licenses/by-nc-sa/4.0/ , and PUB
Creator:
Veselý, Bohumil
Publisher:
Národní filmový archiv
Type:
video and clip
Subject:
Galerie osobností , Places::Praha::Nové Město::Školská::pavlač domu , and People::Chorovič Bronislav (1888-1980)
Language:
No linguistic content
Description:
Opera singer Bronislav Chorovič on Bohumil Veselý's balcony.
Rights:
http://creativecommons.org/licenses/by-nc-nd/4.0/ , PUB , and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
Publisher:
Academy of Sciences
Type:
corpus
Language:
Hungarian
Description:
BSI is a large-scale survey which provides reliable data on and analyses of the varieties of Hungarian spoken in Budapest.
Rights:
Not specified
Type:
corpus
Language:
Bulgarian
Description:
Written, synchronic, general (newspapers)
Rights:
Not specified
Type:
corpus
Language:
Bulgarian and Croatian
Description:
written; domain-specific (newspaper); diachronic; bilingual; comparable; ca 3,500,000 tokens (393 Kw Bulgarian; 3.1 Mw Croatian)
Rights:
Not specified
Type:
corpus
Language:
Bulgarian
Description:
HPSG-based annotation including: constituent structure, dependency relations, named entities (classified as person, organisation, location or other names), coreferential relations. Annotation in XML
Rights:
Not specified
Type:
lexicalConceptualResource
Language:
Bulgarian
Description:
100 000 most frequent Cyrillic tokens in the BulTreeBank text archive, UTF-16 list of token-frequency pairs
Rights:
Not specified
Creator:
Simov, Kiril and Osenova, Petya
Publisher:
Linguistic Modeling Department, IPP, Bulgarian Academy of Sciences
Type:
toolService
Description:
It is used morphological lexicon of Bulgarian (100 000 lemmas) compiled as a finite-state automaton in CLaRK System. It requires the text to be first tokenized and it is applied in each token. Includes also guessers for unknown words and Named Entities gazetteers. If the corresponding resources are available for a different language, then it can be tuned to it.
Rights:
Not specified