Scottish Corpus of Texts and Speech

The Scottish Corpora project has created large electronic corpora of written and spoken texts for the languages of Scotland. The Scottish Corpus of Texts & Speech (SCOTS) has been online since November 2004, and, after a number of updates and additions, has reached a total of nearly 4.6 million words of text, with audio recordings to accompany many of the spoken texts. A sister resource, the Corpus of Modern Scottish Writing, was launched in 2010, and now comprises 5.4 million words of written text with accompanying images.

 http://www.scottishcorpus.ac.uk/search/

The Corpus of Modern Scottish Writing (CMSW) is an electronic corpus of written and printed texts from the period 1700-1945, complementing the Helsinki Corpus of Older Scots (1450-1700) and the Scottish Corpus of Texts & Speech (1945-present day). CMSW contains over 350 documents, containing approximately 5.5 million words of text overall.