The database will contain an etymological lexicon of Saami languages complete with detailed source citations. The database will be open to the public in November 2006 and will be updated regularly.
The Audio Recordings Archive (Suomen kielen nauhoitearkisto) holds over 23,000 hours of recordings collected since 1959, providing authentic samples of Finnish dialects, languages related to Finnish, and other world languages. The collection additionally includes samples of Finnish dialects spoken in Sweden, Norway, Ingria, the United States and Australia. Digitisation of the audio bank was undertaken in 1999. Over half of its content has been digitised, totalling about 13,000 hours of recordings.
This corpus contains a variety of works written in Finnish published between 1809 and 1899, such as newspapers, periodicals, almanacs, and decrees.
The corpus contains 8,976,561 words and is available for online browsing.
The classics of Finnish literature corpus contains works by established Finnish fiction writers from the 1880s to the 1930s. The corpus is part of speech tagged and available for online browsing via the concordancer Korp.
This is a linguistically unannotated corpus of various historical texts written between 1543 and 1809.
The corpus consists of 3,428,618 words and is available for online browsing.