Harvested from: LINDAT/CLARIAH-CZ repository / Language: English / Original context has metadata only: true

Start Over Language English Original context has metadata only true Harvested from LINDAT/CLARIAH-CZ repository

21. Corpus CLUVI

Publisher:: TALG Research Group (University of Vigo)
Type:: corpus
Language:: Basque, Catalan, English, French, Galician, German, Portuguese, and Spanish
Description:: Parallel corpus, 22 million words
Rights:: Not specified

22. Corpus d’extractes de gravacions d’Internet en temps aparent (TA) i temps real (TR) amb finalitats forenses

Publisher:: Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra
Type:: corpus
Subject:: corpus
Language:: English
Rights:: Not specified

23. Corpus de narratives d’angloparlants immigrats a Espanya en temps aparent (TA)

Publisher:: Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra
Type:: corpus
Language:: English
Description:: Oral corpus containing 166 narratives in English elicited by means of Labovian techniques. Participants from the UK (England, Wales, Scotland), Ireland, USA, Australia and South Africa.
Rights:: Not specified

24. Corpus of Early English Correspondence Sampler (CEECS)

Publisher:: University of Helsinki
Format:: text/plain
Type:: corpus
Language:: English
Description:: Personal correspondence from England between the years 1418-1680. Compiled as a tool for historical sociolinguistics.
Rights:: Not specified

25. Corpus Tècnic de l'IULA

Publisher:: Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra
Type:: corpus
Language:: Catalan, English, and Spanish
Description:: domain specific corpus (Law, Economy, Computing, Medicine and Environment as well as a contrastive corpus from the press); EN 3.3 M tokens, SP 33 M tokens, CAT 19 M tokens; EAGLEs pos tagset
Rights:: Not specified

26. CorpusExplorer

Creator:: Rüdiger, Jan Oliver
Publisher:: Jan Oliver Rüdiger
Type:: tool and toolService
Subject:: Corpus Linguisitics, NLP, conll, tei, XML, nlp, Natural Language Processing, linguistics, Linguistics, Computational Linguistics, corpus processing, tagger, POS tagger, lemmatization, text cleaning, CommonCrawl, epub, JSON, Twitter, Pandoc, Wikipedia, digital data, DTA, DSpin, MySQL, ElasticSearch, TextGrid, text corpora, TigerXML, and WeblichtXML
Language:: German, English, French, Italian, Dutch, Spanish, Polish, Arabic, Chinese, and Portuguese
Description:: Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 45 interactive visualizations under a user-friendly interface. Routine tasks such as text acquisition, cleaning or tagging are completely automated. The simple interface supports the use in university teaching and leads users/students to fast and substantial results. The CorpusExplorer is open for many standards (XML, CSV, JSON, R, etc.) and also offers its own software development kit (SDK). Source code available at https://github.com/notesjor/corpusexplorer2.0
Rights:: Not specified

27. Croatian-English Parallel Corpus

Publisher:: University of Zagreb, Faculty of Humanities and Social Sciences
Type:: corpus
Language:: Croatian and English
Description:: written; domain-specific (newspaper); synchronic; bilingual; parallel; unidirectional; XML; S-alignment
Rights:: Not specified

28. CST's lemmatiser

Publisher:: Center for Sprogteknologi, University of Copenhagen
Type:: toolService
Language:: Danish, Dutch, English, German, Modern Greek (1453-), Icelandic, Norwegian, Russian, Slovenian, and Swedish
Description:: 1) Fully automatic rule based lemmatization of inflected languages 2) Fully automatic training of lemmatization rules based on full form-lemma list
Rights:: Not specified

29. Dependency Grammars

Publisher:: Universitat de Barcelona
Type:: languageDescription
Subject:: dependency grammar
Language:: Catalan, English, and Spanish
Description:: Dependency grammars
Rights:: Not specified

30. Diachronic Corpus of Present-Day Spoken English (DCPSE)

Publisher:: Survey of English Usage, University College London
Type:: corpus
Language:: English
Description:: A parsed corpus of spoken English. Ca 400,000 words from ICE-GB (early 1990s) and 400,000 words from the London-Lund Corpus (late 1960s-early 1980s). The orthographic transcriptions have been normalised and annotated.
Rights:: Not specified

« Previous
Next »
1
2
3
4
5
6
7
…
10
11

21. Corpus CLUVI

22. Corpus d’extractes de gravacions d’Internet en temps aparent (TA) i temps real (TR) amb finalitats forenses

23. Corpus de narratives d’angloparlants immigrats a Espanya en temps aparent (TA)

24. Corpus of Early English Correspondence Sampler (CEECS)

25. Corpus Tècnic de l'IULA

26. CorpusExplorer

27. Croatian-English Parallel Corpus

28. CST's lemmatiser

29. Dependency Grammars

30. Diachronic Corpus of Present-Day Spoken English (DCPSE)

Limit your search

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Coverage

Show values starting with

Creator

Show values starting with

Format

Language

Show values starting with

Publisher

Show values starting with

Rights

Subject

Show values starting with

Type

Date

Original context has metadata only

Harvested from