Zobrazit minimální záznam
dc.contributor.author |
Simov, Kiril |
dc.contributor.author |
Osenova, Petya |
dc.contributor.other |
Simov, Kiril |
dc.date.accessioned |
2014-07-30T21:33:43Z |
dc.date.available |
2014-07-30T21:33:43Z |
dc.date.issued |
2014-07-30 |
dc.identifier.uri |
http://hdl.handle.net/11372/LRT-1241 |
dc.description |
It is used morphological lexicon of Bulgarian (100 000 lemmas) compiled as a finite-state automaton in CLaRK System. It requires the text to be first tokenized and it is applied in each token. Includes also guessers for unknown words and Named Entities gazetteers. If the corresponding resources are available for a different language, then it can be tuned to it. |
dc.publisher |
Linguistic Modeling Department, IPP, Bulgarian Academy of Sciences |
dc.title |
BulTreeBank Morphological Analyzer |
dc.type |
toolService |
has.files |
no |
additional.metadata |
Language(s) of input data (field_tool_input_language):Bulgarian
Implementation language(s) (field_tool_implementation_langu):Java
Software requirements (field_tool_software_requirement):Implemented in CLaRK
Webservice link (field_tool_webservice_link):http://www.bultreebank.org/clark/index.html
Availibility (field_tool_availibility):Free for use on request, but can not be distributed. It will be provided as a web service within CLARIN.
Nid:994
System requirements (field_tool_system_requirements):Java
Platform(s) (field_tool_platform):Used under Windows, Linux
Character encoding of output data (field_tool_char_encoding_output):Unicode (UTF-8)
Documentation link (field_tool_document_link):not available
Approach (field_tool_aproach):finite-state
Open source code (field_tool_open_source_code):no
Language(s) of output data (field_tool_output_language):Bulgarian
Character encoding of input data (field_tool_char_encoding):Unicode (UTF-8)
Relevant project(s) (field_tool_relevant_project):BulTreeBank (www.bultreebank.org) |
branding |
LRT + Open Submissions |
dc.coverage.placeName |
Bulgaria |
files.size |
0 |
files.count |
0 |
Zobrazit minimální záznam