1 - 7 of 7
Number of results to display per page
Search Results
2. CorpusExplorer
- Creator:
- Rüdiger, Jan Oliver
- Publisher:
- Jan Oliver Rüdiger
- Type:
- tool and toolService
- Subject:
- Corpus Linguisitics, NLP, conll, tei, XML, nlp, Natural Language Processing, linguistics, Linguistics, Computational Linguistics, corpus processing, tagger, POS tagger, lemmatization, text cleaning, CommonCrawl, epub, JSON, Twitter, Pandoc, Wikipedia, digital data, DTA, DSpin, MySQL, ElasticSearch, TextGrid, text corpora, TigerXML, and WeblichtXML
- Language:
- German, English, French, Italian, Dutch, Spanish, Polish, Arabic, Chinese, and Portuguese
- Description:
- Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 45 interactive visualizations under a user-friendly interface. Routine tasks such as text acquisition, cleaning or tagging are completely automated. The simple interface supports the use in university teaching and leads users/students to fast and substantial results. The CorpusExplorer is open for many standards (XML, CSV, JSON, R, etc.) and also offers its own software development kit (SDK). Source code available at https://github.com/notesjor/corpusexplorer2.0
- Rights:
- Not specified
3. Dutch Bilingualism Data Base (DBD)
- Publisher:
- Radboud University Nijmegen, Max Planck Institute for Psycholinguistics, Meertens Institute KNAW The Netherlands, and Babylon Centre for Studies of Multilingualism in the Multicultural Society
- Type:
- corpus
- Language:
- Arabic, Dutch, and Turkish
- Description:
- Audio recordings, transcripts,
- Rights:
- Not specified
4. ElixirFM
- Creator:
- Smrž, Otakar, Bielický, Viktor, and Buckwalter, Tim
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- toolService
- Subject:
- Arabic morphology and ElixirFM
- Language:
- Arabic
- Description:
- ElixirFM is a high-level implementation of Functional Arabic Morphology documented at http://elixir-fm.wiki.sourceforge.net/. The core of ElixirFM is written in Haskell, while interfaces in Perl support lexicon editing and other interactions.
- Rights:
- http://opensource.org/licenses/GPL-3.0
5. JIRS
- Publisher:
- Grid and High Performance Computing Group, ITACA, Universidad Politécnica de Valencia and Universidad de Alicante
- Type:
- toolService
- Language:
- Arabic, English, French, Italian, Oromo, and Urdu
- Description:
- JIRS is a Passage Retrieval system specially suited for Question Answering. It could be adapted to others languages very easily. ask (Written Language): Information Retrieval Applications Question/Answering Environment: OS-independent Access: GPLv3
- Rights:
- Not specified
6. OmegaWiki
- Publisher:
- Universität Bamberg, World Language Documentation Centre
- Format:
- application/octet-stream
- Type:
- lexicalConceptualResource
- Language:
- Afrikaans, Arabic, Basque, Bulgarian, Catalan, Chinese, Czech, Danish, Dutch, English, Esperanto, Estonian, Finnish, French, Galician, Georgian, Modern Greek (1453-), Hebrew, Hungarian, Icelandic, Indonesian, Interlingua (International Auxiliary Language Association), Irish, Italian, Japanese, Khmer, Norwegian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Spanish, Swedish, Turkish, Ukrainian, and Welsh
- Rights:
- GFDL or CC and http://www.omegawiki.org/Licensing
7. TITUS Arabic
- Format:
- text/html
- Type:
- corpus
- Language:
- Arabic
- Description:
- ca. 100.000 tokens; linked with relational database; XML-encoding in progress
- Rights:
- http://titus.uni-frankfurt.de/texte/texte2.htm#Estart