1 - 10 of 10
Number of results to display per page
Search Results
2. Botanicus Digital Library
- Type:
- corpus
- Subject:
- Germanistik
- Language:
- Chinese, Czech, English, French, German, Latin, and Spanish
- Description:
- Digital copies of historical botanic papers from the Missouri Botanical Garden Library; Bilddigitalisate von historischen botanischen Schriften; deutschsprachige Texte stellen nur einen Teilbereich dar
- Rights:
- Not specified
3. Chinese history and literature :
- Creator:
- Průšek, Jaroslav,
- Type:
- text and soubory studií
- Subject:
- Sino-tibetské literatury (o nich), literatura čínská, dějiny literatury, Čína, literatura, spisovatelé, přehledná zpracování světových dějin (chronologicky), and politické dějiny, politici
- Language:
- English, French, and Chinese
- Rights:
- unknown
4. Considering the end :
- Creator:
- Chan, Timothy Wai Keung
- Type:
- text
- Subject:
- Literatura různých forem a žánrů (o ní), literatura čínská, poezie, Čína, světové dějiny středověku (do r. 1492), and literatura, spisovatelé
- Language:
- English and Chinese
- Rights:
- unknown
5. CorpusExplorer
- Creator:
- Rüdiger, Jan Oliver
- Publisher:
- Jan Oliver Rüdiger
- Type:
- tool and toolService
- Subject:
- Corpus Linguisitics, NLP, conll, tei, XML, nlp, Natural Language Processing, linguistics, Linguistics, Computational Linguistics, corpus processing, tagger, POS tagger, lemmatization, text cleaning, CommonCrawl, epub, JSON, Twitter, Pandoc, Wikipedia, digital data, DTA, DSpin, MySQL, ElasticSearch, TextGrid, text corpora, TigerXML, and WeblichtXML
- Language:
- German, English, French, Italian, Dutch, Spanish, Polish, Arabic, Chinese, and Portuguese
- Description:
- Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 45 interactive visualizations under a user-friendly interface. Routine tasks such as text acquisition, cleaning or tagging are completely automated. The simple interface supports the use in university teaching and leads users/students to fast and substantial results. The CorpusExplorer is open for many standards (XML, CSV, JSON, R, etc.) and also offers its own software development kit (SDK). Source code available at https://github.com/notesjor/corpusexplorer2.0
- Rights:
- Not specified
6. OmegaWiki
- Publisher:
- Universität Bamberg, World Language Documentation Centre
- Format:
- application/octet-stream
- Type:
- lexicalConceptualResource
- Language:
- Afrikaans, Arabic, Basque, Bulgarian, Catalan, Chinese, Czech, Danish, Dutch, English, Esperanto, Estonian, Finnish, French, Galician, Georgian, Modern Greek (1453-), Hebrew, Hungarian, Icelandic, Indonesian, Interlingua (International Auxiliary Language Association), Irish, Italian, Japanese, Khmer, Norwegian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Spanish, Swedish, Turkish, Ukrainian, and Welsh
- Rights:
- GFDL or CC and http://www.omegawiki.org/Licensing
7. Speecon databases
- Type:
- corpus
- Language:
- Czech, Danish, Dutch, English, Finnish, French, German, Hungarian, Italian, Polish, Portuguese, Russian, Spanish, Swedish, Turkish, Chinese, Hebrew, Japanese, Korean, and Thai
- Description:
- 28 speech databases containing broadband recordings from 550 adults and 50 children per language. Contains interesting phonetically rich material. All orthographically transcribed. Speaker information included for gender, age, accent. Including pronunciation lexicon.
- Rights:
- Not specified
8. Stříbrná a modrá :
- Creator:
- Heroldová, Helena,
- Type:
- text, statický obraz, and monografie
- Subject:
- Oděv, móda, ozdoby, Dějiny Číny, Mongolska a Koreje, šperky, řemesla umělecká, sbírky muzejní, sbírky umělecké, Čína, přehledná zpracování světových dějin (chronologicky), and hmotná kultura, umělecká řemesla
- Language:
- Czech, English, and Chinese
- Rights:
- unknown
9. Subtitle Word Frequencies
- Publisher:
- Center for Reading Research, Ghent University
- Type:
- lexicalConceptualResource
- Language:
- Chinese, Dutch, English, German, Modern Greek (1453-), and Spanish
- Rights:
- Not specified
10. Wortschatz
- Publisher:
- University of Leipzig
- Type:
- corpus
- Language:
- Afrikaans, Albanian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Esperanto, Estonian, Finnish, French, German, Hungarian, Icelandic, Indonesian, Italian, Japanese, Korean, Latin, Latvian, Lithuanian, Malay (macrolanguage), Norwegian, Occitan (post 1500), Romanian, Russian, Slovak, Slovenian, Spanish, Sundanese, Swedish, Tagalog, Turkish, Vietnamese, and Welsh
- Description:
- Collected from newspaper texts, webcrawling, etc.: words (+frequency), cooccurrences (+graph), left/right neighbours, example sentences
- Rights:
- Not specified