Zobrazit minimální záznam
dc.contributor.other |
Boleda, Gemma |
dc.date.accessioned |
2014-07-30T21:26:58Z |
dc.date.available |
2014-07-30T21:26:58Z |
dc.date.issued |
2014-07-30 |
dc.identifier.uri |
http://hdl.handle.net/11372/LRT-1105 |
dc.description |
Trilingual corpus (Catalan, Spanish, English) that contains large portions of the Wikipedia (based on a 2006 dump) and has been automatically enriched with linguistic information. In its present version, it contains over 750 million words. |
dc.language.iso |
cat |
dc.language.iso |
eng |
dc.language.iso |
spa |
dc.publisher |
Centro de Tecnologías y Aplicaciones del Lenguaje y del Habla (TALP) |
dc.source.uri |
http://www.lsi.upc.edu/~nlp/wikicorpus/ |
dc.subject |
trilingual corpus |
dc.title |
Wikicorpus |
dc.type |
corpus |
has.files |
no |
additional.metadata |
Nid:3379
Readily Available (field_resource_available):Yes |
branding |
LRT + Open Submissions |
dc.coverage.placeName |
Spain |
files.size |
0 |
files.count |
0 |
Zobrazit minimální záznam