dc.contributor.author | Grella, Matteo |
dc.date.accessioned | 2017-10-06T07:05:57Z |
dc.date.available | 2017-10-06T07:05:57Z |
dc.date.issued | 2011 |
dc.identifier.uri | http://hdl.handle.net/11372/LRT-2476 |
dc.description | This resource is an Italian morphological dictionary for content words, encoded in a JSON Lines format text file. It contains correspondences between surface form and lexical forms of words followed by grammatical features. The surface word forms have been generated algorithmically by using stable phonological and morphological rules of the Italian language. Particular attention has been given to the generation of verbs for which rules have been extracted from the famous A.L e G. Lepschy, La lingua italiana. The dictionary with its remarkable coverage is particularly useful used together with the Italian Function Words (http://hdl.handle.net/11372/LRT-2288) for tasks such as POS-Tagging or Syntactic Parsing. |
dc.language.iso | ita |
dc.publisher | Matteo Grella |
dc.relation.isreplacedby | http://hdl.handle.net/11372/LRT-2630 |
dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ |
dc.subject | morphological dictionary |
dc.title | Italian Content Words |
dc.type | lexicalConceptualResource |
metashare.ResourceInfo#ContentInfo.mediaType | text |
metashare.ResourceInfo#ContentInfo.detailedType | machineReadableDictionary |
dc.rights.label | PUB |
has.files | yes |
branding | LRT + Open Submissions |
contact.person | Matteo Grella matteogrella@gmail.com Matteo Grella |
size.info | 2342120 items |
files.size | 16392115 |
files.count | 1 |
Soubory tohoto záznamu
Licenční kategorie:
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
- Název
- italian_content_words.rar
- Velikost
- 15.63 MB
- Formát
- application/x-rar-compressed
- Popis
- List of Italian Content Words in JSONL Format
- MD5
- 80c31d0f9a7cc541e8b36419cf045ccb