SCOTS: Scottish Corpus Of Texts and Speech

The Scottish Corpora project has created large electronic corpora of written and spoken texts for the languages of Scotland, featuring nearly 4.6 million words of text, with audio recordings to accompany many of the spoken texts.

An advanced search for 'gonnae'
An advanced search for 'gonnae'

Recent years have brought significant changes to the political situation in Scotland. This new political situation has been accompanied by a resurgence of interest in the languages and culture of Scotland.

Advances in computer technology have made it possible to store and analyse large quantities of information in ways which were previously unthinkable. As a result, in recent years much research in the Humanities has focused on the building of large text archives and corpora. Such resources offer exciting opportunities to study language on a broad scale and with a precision which would otherwise be impossible.

The Scottish Corpora project has created large electronic corpora of written and spoken texts for the languages of Scotland. The Scottish Corpus of Texts & Speech (SCOTS) has been online since November 2004, and, after a number of updates and additions, has reached a total of nearly 4.6 million words of text, with audio recordings to accompany many of the spoken texts.

The corpus allow those interested in Scotland’s linguistic diversity, and in Scottish culture and identity, to investigate the languages of Scotland in new ways, and to address the gap which presently exists in our knowledge of these.

Project website: https://www.scottishcorpus.ac.uk/


Main contact: Wendy Anderson

Developers: Brian AitkenDavid Beavan

Start year: 2002

End year: 2004

Funded by: AHRC

Subject area: English Language & Linguistics

Keywords: Corpus LinguisticsMappingScotsVisualisation

Record last updated 2020-01-13