Number of results to display per page
Search Results
2042. Treex::Web
- Creator:
- Sedlák, Michal
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- toolService and service
- Subject:
- Treex, Perl, REST, web service, and machine translation
- Language:
- English and Czech
- Description:
- Treex::Web is a web frontend for running Treex applications from your browser. Treex (formerly TectoMT) is a highly modular NLP framework implemented in Perl programming language. It is primarily aimed at Machine Translation, making use of the ideas and technology created during the Prague Dependency Treebank project.
- Rights:
- Not specified
2043. Trhový Štěpánov
- Creator:
- Aktualita
- Publisher:
- Národní filmový archiv
- Type:
- video and clip
- Subject:
- oběd v provizorním obydlí, obydlí provizorní, vagony železniční, uprchlíci z pohraničí, dohoda Mnichovská následky, děti uprchlíků, pohraničí 1938, děti uprchlíků z pohraničí, bydlení provizorní, Mnichovská dohoda, and Československý zvukový týdeník Aktualita::1938/43
- Language:
- Czech
- Description:
- The segment of Československý zvukový týdeník Aktualita (Czechoslovak Aktualita Sound Newsreel), 1938, issue no. 43 offers an insight into the lives of Czech railway workers in the aftermath of the Sudetenland annexation after they find makeshift shelter in decommissioned railway carriages in the Posázaví region. The footage also shows railway workers from Obrnice u Mostu living in railway carriages.
- Rights:
- http://creativecommons.org/licenses/by-nc-nd/4.0/, PUB, and Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
2044. Trova
- Publisher:
- Max Planck Institute for Psycholinguistics
- Type:
- toolService
- Subject:
- search engine and corpus search
- Description:
- Trova is a search engine for annotation content archived at The Language Archive. Searchable formats include ELAN EAF, Childes CHAT, Toolbox, PDF, SubRip, Praat TextGrid and others.
- Rights:
- Not specified
2045. Turkish Natural Language Processing Pipeline
- Publisher:
- Natural Language Processing Group, Computer Science Department, Istanbul Technical University
- Type:
- toolService
- Language:
- Turkish
- Description:
- This is a state-of-the-art pipeline of Turkish NLP tools (sentence splitting, tokenisation, normalisation, deasciification, vowelisation, spelling correction, morphological analysis/disambiguation, named entity recognition, dependency parsing). The platform operates as a SaaS (Software as a Service) and provides the researchers and the students the state of the art NLP tools in many layers: preprocessing, morphology, syntax and entity recognition. The users may communicate with the platform via three channels: via a user friendly web interface, file uploads, AP.
- Rights:
- Not specified
2046. TXM
- Creator:
- Heiden, Serge
- Publisher:
- ENS de Lyon - CNRS, ICAR Laboratory and Université de Franche-Compté, laboratoire ELLIADD (Edition, Littératures, Langages, Informatique, Arts, Didactique, Discours)
- Type:
- tool and toolService
- Subject:
- textometry, xml, tei, nlp, cqp, r, textual data analysis, statistical text analysis, text mining, and concordance
- Description:
- TXM is a free and open-source cross-platform Unicode & XML based text/corpus analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in.
- Rights:
- Not specified
2047. TXM 0.6
- Publisher:
- ENS de Lyon - CNRS, ICAR Laboratory
- Type:
- toolService
- Description:
- TXM is a Unicode - XML & TEI compatible text/corpus analysis environment and graphical client based on the CQP search engine and the R statistical environment (http://textometrie.ens-lyon.fr/?lang=en).
- Rights:
- EPL V1.0; GNU GPL V2.0; GNU GPL V3.0; GNU LGPL V2.1 and Copyright © 2010-2013 ENS de Lyon; Copyright © 2007-2010 ENS de Lyon, CNRS, INRP, University of Lyon 2, University of Franche-Comté, University of Nice Sophia Antipolis, University of Paris 3.
2048. Typological Database System
- Publisher:
- Max Planck Institute for Psycholinguistics, University of Utrecht/Netherlands Graduate School of Linguistics, Data Archiving and Networked Services, and Meertens Institute KNAW The Netherlands
- Type:
- toolService
- Subject:
- typological database
- Language:
- English
- Description:
- The Typological Database System (TDS) is a web-based service that provides integrated access to a collection of independently created typological databases. It was developed with support from NWO grant 380-30-004 / INV-03-12 and from participating universities, and provides continued availability and extended documentation for its component databases, through a uniform structure and search interface. Web technologies evolve rapidly, and the system had begun to show its age even before the end of the project in 2009, motivating migration of the data collection to an archival platform. Through its Project Call 1, CLARIN-NL granted funding for migrating the resource to a durable, archival environment and converting it to a true web service architecture.
- Rights:
- Not specified
2049. UDify Pretrained Model
- Creator:
- Kondratyuk, Dan and Straka, Milan
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- tool and toolService
- Subject:
- syntax, dependency parser, and universal dependencies
- Language:
- Ancient Greek (to 1453), Arabic, Basque, Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Gothic, Modern Greek (1453-), Hebrew, Hindi, Hungarian, Indonesian, Irish, Italian, Japanese, Latin, Norwegian, Church Slavic, Persian, Polish, Portuguese, Romanian, Slovenian, Spanish, Swedish, Tamil, Catalan, Chinese, Galician, Kazakh, Latvian, Russian, Turkish, Coptic, Sanskrit, Slovak, Ukrainian, Uighur, Vietnamese, Belarusian, Korean, Lithuanian, Urdu, Russia Buriat, Northern Kurdish, Northern Sami, Upper Sorbian, Afrikaans, Yue Chinese, Marathi, Serbian, Swedish Sign Language, Telugu, Amharic, Armenian, Breton, Faroese, Komi-Zyrian, Nigerian Pidgin, Old French (842-ca. 1400), Tagalog, Thai, Warlpiri, Yoruba, Akkadian, Bambara, Erzya, and Maltese
- Description:
- Pretrained model weights for the UDify model, and extracted BERT weights in pytorch-transformers format. Note that these weights slightly differ from those used in the paper.
- Rights:
- Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0), http://creativecommons.org/licenses/by-sa/4.0/, and PUB
2050. UDPipe
- Creator:
- Straka, Milan and Straková, Jana
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- tool and toolService
- Subject:
- tokenizer, POS tagger, tagger, lemmatization, parser, dependency parser, and CoNLL-U
- Language:
- English
- Description:
- UDPipe is an trainable pipeline for tokenization, tagging, lemmatization and dependency parsing of CoNLL-U files. UDPipe is language-agnostic and can be trained given only annotated data in CoNLL-U format. Trained models are provided for nearly all UD treebanks. UDPipe is available as a binary, as a library for C++, Python, Perl, Java, C#, and as a web service. UDPipe is a free software under Mozilla Public License 2.0 (http://www.mozilla.org/MPL/2.0/) and the linguistic models are free for non-commercial use and distributed under CC BY-NC-SA (http://creativecommons.org/licenses/by-nc-sa/4.0/) license, although for some models the original data used to create the model may impose additional licensing conditions. UDPipe is versioned using Semantic Versioning (http://semver.org/). UDPipe website http://ufal.mff.cuni.cz/udpipe contains download links of both the released packages and trained models, hosts documentation and offers online demo. UDPipe development repository http://github.com/ufal/udpipe is hosted on GitHub.
- Rights:
- Mozilla Public License 2.0, http://opensource.org/licenses/MPL-2.0, and PUB