dc.contributor.author | Cotgrove, Louis Alexander |
dc.date.accessioned | 2022-06-23T09:39:52Z |
dc.date.available | 2022-06-23T09:39:52Z |
dc.date.issued | 2018 |
dc.identifier.uri | http://hdl.handle.net/11372/LRT-4779 |
dc.description | The NottDeuYTSch corpus contains over 33 million words taken from approximately 3 million YouTube comments from videos published between 2008 to 2018 targeted at a young, German-speaking demographic and represents an authentic language snapshot of young German speakers. The corpus was proportionally sampled based on video category and year from a database of 112 popular German-speaking YouTube channels in the DACH region for optimal representativeness and balance and contains a considerable amount of associated metadata for each comment that enable further longitudinal cross-sectional analyses. |
dc.language.iso | deu |
dc.language.iso | eng |
dc.language.iso | rus |
dc.language.iso | tur |
dc.language.iso | hbs |
dc.publisher | University of Nottingham |
dc.relation.isreplacedby | http://hdl.handle.net/11372/LRT-4806 |
dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ |
dc.subject | youth language |
dc.subject | Computer-Mediated Communication |
dc.subject | Digitally-Mediated Communication |
dc.subject | CMC |
dc.subject | DMC |
dc.subject | online |
dc.subject | YouTube |
dc.subject | digital |
dc.subject | emoji |
dc.subject | translanguaging |
dc.subject | multilingualism |
dc.subject | social media |
dc.title | Nottinghamer Korpus Deutscher YouTube-Sprache (The NottDeuYTSch Corpus) |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
dc.rights.label | PUB |
has.files | yes |
branding | LRT + Open Submissions |
contact.person | Louis Cotgrove cotgrove@ids-mannheim.de Leibniz-Institut für Deutsche Sprache |
size.info | 33760494 tokens |
size.info | 32549462 words |
files.size | 738297343 |
files.count | 2 |
Soubory tohoto záznamu
Stáhnout všechny soubory záznamu (704.1 MB)Licenční kategorie:
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
- Název
- NottDeuYTSch_Corpus.rda
- Velikost
- 280.29 MB
- Formát
- Neznámý
- Popis
- NottDeuYTSch Corpus R Data Object
- MD5
- e66260b11688917660e5ca511de4d066
- Název
- ndy296.i5.zip
- Velikost
- 423.81 MB
- Formát
- application/zip
- Popis
- NottDeuYTSch Corpus TEI i5 XML
- MD5
- d96a1a7f5a95b866dbc2bbbc7164900d