Building and using the Siarad Corpus

Electronic versions

DOI

This book is a research monograph divided into two parts. The first part describes the methods used to build the first sizeable corpus of informal conversational data collected from bilingual speakers of Welsh and English: Siarad. The second part describes the linguistic analysis of data from this corpus (available at bangortalk.org.uk). The information in Part One will be useful as a ‘how to’ manual on building a bilingual spoken corpus, including methods of data collection, transcription, glossing and analysis. The findings reported in Part Two throw new light on the debate regarding code-switching vs. borrowing, the application of the Matrix Language Framework (MLF) to the grammar of Welsh-English code-switching, the extralinguistic factors influencing variation in quantity of code-switching, and the extent to which the grammar of Welsh is changing in contact with English. Additional findings by other researchers using the corpus are also reported, and possible future directions are discussed.

Keywords

  • bilingualism , Celtic languages, Corpus linguistics, English linguistics, Sociolinguistics and dialectology
Original languageEnglish
PublisherJohn Benjamins Publishing Company
Number of pages199
ISBN (Electronic)9789027264589
ISBN (Print)9789027200112
DOIs
StatePublished - 15 Jan 2018

Publication series

NameStudies in Corpus Linguistics
PublisherJohn Benjamins Publishing Company
No.81
View graph of relations