SUBTLEX-CY: A new word frequency database for Welsh

Research output: Contribution to journalArticlepeer-review

Standard Standard

SUBTLEX-CY: A new word frequency database for Welsh. / van Heuven, Walter J.B.; Payne, Joshua S.; Jones, Manon.
In: Quarterly Journal of Experimental Psychology, 30.08.2023.

Research output: Contribution to journalArticlepeer-review

HarvardHarvard

van Heuven, WJB, Payne, JS & Jones, M 2023, 'SUBTLEX-CY: A new word frequency database for Welsh', Quarterly Journal of Experimental Psychology. https://doi.org/10.1177/17470218231190315

APA

van Heuven, W. J. B., Payne, J. S., & Jones, M. (2023). SUBTLEX-CY: A new word frequency database for Welsh. Quarterly Journal of Experimental Psychology. Advance online publication. https://doi.org/10.1177/17470218231190315

CBE

van Heuven WJB, Payne JS, Jones M. 2023. SUBTLEX-CY: A new word frequency database for Welsh. Quarterly Journal of Experimental Psychology. https://doi.org/10.1177/17470218231190315

MLA

van Heuven, Walter J.B., Joshua S. Payne, and Manon Jones. "SUBTLEX-CY: A new word frequency database for Welsh". Quarterly Journal of Experimental Psychology. 2023. https://doi.org/10.1177/17470218231190315

VancouverVancouver

van Heuven WJB, Payne JS, Jones M. SUBTLEX-CY: A new word frequency database for Welsh. Quarterly Journal of Experimental Psychology. 2023 Aug 30. Epub 2023 Aug 30. doi: https://doi.org/10.1177/17470218231190315

Author

van Heuven, Walter J.B. ; Payne, Joshua S. ; Jones, Manon. / SUBTLEX-CY: A new word frequency database for Welsh. In: Quarterly Journal of Experimental Psychology. 2023.

RIS

TY - JOUR

T1 - SUBTLEX-CY: A new word frequency database for Welsh

AU - van Heuven, Walter J.B.

AU - Payne, Joshua S.

AU - Jones, Manon

PY - 2023/8/30

Y1 - 2023/8/30

N2 - We present SUBTLEX-CY, a new word frequency database created from a 32-million-word corpus of Welsh televisionsubtitles. An experiment comprising a lexical decision task examined SUBTLEX-CY frequency estimates against wordswith inconsistent frequencies in a much smaller Welsh corpus that is often used by researchers, the Cronfa Electronego’r Gymraeg (CEG), and three other Welsh word frequency databases. Words were selected that were classified as lowfrequency (LF) in SUBTLEX-CY and high frequency (HF) in CEG and compared with words that were classified as mediumfrequency (MF) in both SUBTLEX-CY and CEG. Reaction time analyses showed that HF words in CEG were respondedto more slowly compared to MF words, suggesting that SUBTLEX-CY corpus provides a more reliable estimate of Welshword frequencies. The new Welsh word frequency database that also includes part-of-speech, contextual diversity, andother lexical information is freely available for research purposes on the Open Science Framework repository at https://osf.io/9gkqm/.

AB - We present SUBTLEX-CY, a new word frequency database created from a 32-million-word corpus of Welsh televisionsubtitles. An experiment comprising a lexical decision task examined SUBTLEX-CY frequency estimates against wordswith inconsistent frequencies in a much smaller Welsh corpus that is often used by researchers, the Cronfa Electronego’r Gymraeg (CEG), and three other Welsh word frequency databases. Words were selected that were classified as lowfrequency (LF) in SUBTLEX-CY and high frequency (HF) in CEG and compared with words that were classified as mediumfrequency (MF) in both SUBTLEX-CY and CEG. Reaction time analyses showed that HF words in CEG were respondedto more slowly compared to MF words, suggesting that SUBTLEX-CY corpus provides a more reliable estimate of Welshword frequencies. The new Welsh word frequency database that also includes part-of-speech, contextual diversity, andother lexical information is freely available for research purposes on the Open Science Framework repository at https://osf.io/9gkqm/.

U2 - https://doi.org/10.1177/17470218231190315

DO - https://doi.org/10.1177/17470218231190315

M3 - Article

JO - Quarterly Journal of Experimental Psychology

JF - Quarterly Journal of Experimental Psychology

SN - 1747-0218

ER -