Neidio i’r brif dudalen lywio Neidio i chwilio Neidio i’r prif gynnwys

SUBTLEX-CY: A new word frequency database for Welsh

  • University of Nottingham
  • Glyndwr University

Allbwn ymchwil: Cyfraniad at gyfnodolynErthygladolygiad gan gymheiriaid

161 Wedi eu Llwytho i Lawr (Pure)

Crynodeb

We present SUBTLEX-CY, a new word frequency database created from a 32-million-word corpus of Welsh television subtitles. An experiment comprising a lexical decision task examined SUBTLEX-CY frequency estimates against words with inconsistent frequencies in a much smaller Welsh corpus that is often used by researchers, the Cronfa Electroneg o’r Gymraeg (CEG), and three other Welsh word frequency databases. Words were selected that were classified as low frequency (LF) in SUBTLEX-CY and high frequency (HF) in CEG and compared with words that were classified as medium frequency (MF) in both SUBTLEX-CY and CEG. Reaction time analyses showed that HF words in CEG were responded to more slowly compared to MF words, suggesting that SUBTLEX-CY corpus provides a more reliable estimate of Welsh word frequencies. The new Welsh word frequency database that also includes part-of-speech, contextual diversity, and other lexical information is freely available for research purposes on the Open Science Framework repository at https://osf.io/9gkqm/.
Iaith wreiddiolSaesneg
Tudalennau (o-i)1052–1067
Nifer y tudalennau16
CyfnodolynQuarterly Journal of Experimental Psychology
Cyfrol77
Rhif cyhoeddi5
Dyddiad ar-lein cynnar30 Awst 2023
Dynodwyr Gwrthrych Digidol (DOIs)
StatwsCyhoeddwyd - Mai 2024

Ôl bys

Gweld gwybodaeth am bynciau ymchwil 'SUBTLEX-CY: A new word frequency database for Welsh'. Gyda’i gilydd, maen nhw’n ffurfio ôl bys unigryw.

Dyfynnu hyn