Harvested from: LINDAT/CLARIAH-CZ repository - LINDAT/CLARIAH-CZ Catalog Search Results

2041. TreeTagger

Publisher:: University of Stuttgart
Type:: toolService
Subject:: POS tagger
Language:: Bulgarian, Dutch, English, French, German, Modern Greek (1453-), Italian, Portuguese, Russian, Spanish, and Swahili (macrolanguage)
Description:: A part-of-speech tagger and lemmatizer for several languages.
Rights:: Not specified

2042. Treex::Web

Creator:: Sedlák, Michal
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: toolService and service
Subject:: Treex, Perl, REST, web service, and machine translation
Language:: English and Czech
Description:: Treex::Web is a web frontend for running Treex applications from your browser. Treex (formerly TectoMT) is a highly modular NLP framework implemented in Perl programming language. It is primarily aimed at Machine Translation, making use of the ideas and technology created during the Prague Dependency Treebank project.
Rights:: Not specified

2043. Trhový Štěpánov

Creator:: Aktualita
Publisher:: Národní filmový archiv
Type:: video and clip
Subject:: oběd v provizorním obydlí, obydlí provizorní, vagony železniční, uprchlíci z pohraničí, dohoda Mnichovská následky, děti uprchlíků, pohraničí 1938, děti uprchlíků z pohraničí, bydlení provizorní, Mnichovská dohoda, and Československý zvukový týdeník Aktualita::1938/43
Language:: Czech
Description:: The segment of Československý zvukový týdeník Aktualita (Czechoslovak Aktualita Sound Newsreel), 1938, issue no. 43 offers an insight into the lives of Czech railway workers in the aftermath of the Sudetenland annexation after they find makeshift shelter in decommissioned railway carriages in the Posázaví region. The footage also shows railway workers from Obrnice u Mostu living in railway carriages.
Rights:: http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)

2044. Trova

Publisher:: Max Planck Institute for Psycholinguistics
Type:: toolService
Subject:: search engine and corpus search
Description:: Trova is a search engine for annotation content archived at The Language Archive. Searchable formats include ELAN EAF, Childes CHAT, Toolbox, PDF, SubRip, Praat TextGrid and others.
Rights:: Not specified

2045. Turkish Natural Language Processing Pipeline

Publisher:: Natural Language Processing Group, Computer Science Department, Istanbul Technical University
Type:: toolService
Language:: Turkish
Description:: This is a state-of-the-art pipeline of Turkish NLP tools (sentence splitting, tokenisation, normalisation, deasciification, vowelisation, spelling correction, morphological analysis/disambiguation, named entity recognition, dependency parsing). The platform operates as a SaaS (Software as a Service) and provides the researchers and the students the state of the art NLP tools in many layers: preprocessing, morphology, syntax and entity recognition. The users may communicate with the platform via three channels: via a user friendly web interface, file uploads, AP.
Rights:: Not specified

2046. TXM

Creator:: Heiden, Serge
Publisher:: ENS de Lyon - CNRS, ICAR Laboratory and Université de Franche-Compté, laboratoire ELLIADD (Edition, Littératures, Langages, Informatique, Arts, Didactique, Discours)
Type:: tool and toolService
Subject:: textometry, xml, tei, nlp, cqp, r, textual data analysis, statistical text analysis, text mining, and concordance
Description:: TXM is a free and open-source cross-platform Unicode & XML based text/corpus analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in.
Rights:: Not specified

2047. TXM 0.6

Publisher:: ENS de Lyon - CNRS, ICAR Laboratory
Type:: toolService
Description:: TXM is a Unicode - XML & TEI compatible text/corpus analysis environment and graphical client based on the CQP search engine and the R statistical environment (http://textometrie.ens-lyon.fr/?lang=en).
Rights:: EPL V1.0; GNU GPL V2.0; GNU GPL V3.0; GNU LGPL V2.1 and Copyright © 2010-2013 ENS de Lyon; Copyright © 2007-2010 ENS de Lyon, CNRS, INRP, University of Lyon 2, University of Franche-Comté, University of Nice Sophia Antipolis, University of Paris 3.

2048. Typological Database System

Publisher:: Max Planck Institute for Psycholinguistics, University of Utrecht/Netherlands Graduate School of Linguistics, Data Archiving and Networked Services, and Meertens Institute KNAW The Netherlands
Type:: toolService
Subject:: typological database
Language:: English
Description:: The Typological Database System (TDS) is a web-based service that provides integrated access to a collection of independently created typological databases. It was developed with support from NWO grant 380-30-004 / INV-03-12 and from participating universities, and provides continued availability and extended documentation for its component databases, through a uniform structure and search interface. Web technologies evolve rapidly, and the system had begun to show its age even before the end of the project in 2009, motivating migration of the data collection to an archival platform. Through its Project Call 1, CLARIN-NL granted funding for migrating the resource to a durable, archival environment and converting it to a true web service architecture.
Rights:: Not specified

2049. UDify Pretrained Model

Creator:: Kondratyuk, Dan and Straka, Milan
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: tool and toolService
Subject:: syntax, dependency parser, and universal dependencies
Language:: Ancient Greek (to 1453), Arabic, Basque, Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Gothic, Modern Greek (1453-), Hebrew, Hindi, Hungarian, Indonesian, Irish, Italian, Japanese, Latin, Norwegian, Church Slavic, Persian, Polish, Portuguese, Romanian, Slovenian, Spanish, Swedish, Tamil, Catalan, Chinese, Galician, Kazakh, Latvian, Russian, Turkish, Coptic, Sanskrit, Slovak, Ukrainian, Uighur, Vietnamese, Belarusian, Korean, Lithuanian, Urdu, Russia Buriat, Northern Kurdish, Northern Sami, Upper Sorbian, Afrikaans, Yue Chinese, Marathi, Serbian, Swedish Sign Language, Telugu, Amharic, Armenian, Breton, Faroese, Komi-Zyrian, Nigerian Pidgin, Old French (842-ca. 1400), Tagalog, Thai, Warlpiri, Yoruba, Akkadian, Bambara, Erzya, and Maltese
Description:: Pretrained model weights for the UDify model, and extracted BERT weights in pytorch-transformers format. Note that these weights slightly differ from those used in the paper.
Rights:: Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB

2050. UDPipe

Creator:: Straka, Milan and Straková, Jana
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: tool and toolService
Subject:: tokenizer, POS tagger, tagger, lemmatization, parser, dependency parser, and CoNLL-U
Language:: English
Description:: UDPipe is an trainable pipeline for tokenization, tagging, lemmatization and dependency parsing of CoNLL-U files. UDPipe is language-agnostic and can be trained given only annotated data in CoNLL-U format. Trained models are provided for nearly all UD treebanks. UDPipe is available as a binary, as a library for C++, Python, Perl, Java, C#, and as a web service. UDPipe is a free software under Mozilla Public License 2.0 (http://www.mozilla.org/MPL/2.0/) and the linguistic models are free for non-commercial use and distributed under CC BY-NC-SA (http://creativecommons.org/licenses/by-nc-sa/4.0/) license, although for some models the original data used to create the model may impose additional licensing conditions. UDPipe is versioned using Semantic Versioning (http://semver.org/). UDPipe website http://ufal.mff.cuni.cz/udpipe contains download links of both the released packages and trained models, hosts documentation and offers online demo. UDPipe development repository http://github.com/ufal/udpipe is hosted on GitHub.
Rights:: Mozilla Public License 2.0, http://opensource.org/licenses/MPL-2.0, and PUB

2041. TreeTagger

2042. Treex::Web

2043. Trhový Štěpánov

2044. Trova

2045. Turkish Natural Language Processing Pipeline

2046. TXM

2047. TXM 0.6

2048. Typological Database System

2049. UDify Pretrained Model

2050. UDPipe

Limit your search

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Coverage

Show values starting with

Creator

Show values starting with

Format

Language

Show values starting with

Publisher

Show values starting with

Rights

Show values starting with

Subject

Show values starting with

Type

Show values starting with

Date

Original context has metadata only

Harvested from