Macsen: A Voice Assistant for Speakers of a Lesser Resourced Language

Dewi Jones

Macsen: A Voice Assistant for Speakers of a Lesser Resourced Language

Research output: Contribution to conference › Paper › peer-review

Standard Standard

Macsen: A Voice Assistant for Speakers of a Lesser Resourced Language. / Jones, Dewi.
2020. 194-201 Paper presented at Language Resources and Evaluation Conference , Marseille, Spain.

Research output: Contribution to conference › Paper › peer-review

HarvardHarvard

Jones, D 2020, 'Macsen: A Voice Assistant for Speakers of a Lesser Resourced Language', Paper presented at Language Resources and Evaluation Conference , Marseille, Spain, 11/05/20 - 16/05/20 pp. 194-201. <https://aclanthology.org/2020.sltu-1.27/>

APA

Jones, D. (2020). Macsen: A Voice Assistant for Speakers of a Lesser Resourced Language. 194-201. Paper presented at Language Resources and Evaluation Conference , Marseille, Spain. https://aclanthology.org/2020.sltu-1.27/

CBE

Jones D. 2020. Macsen: A Voice Assistant for Speakers of a Lesser Resourced Language. Paper presented at Language Resources and Evaluation Conference , Marseille, Spain.

MLA

Jones, Dewi Macsen: A Voice Assistant for Speakers of a Lesser Resourced Language. Language Resources and Evaluation Conference , 11 May 2020, Marseille, Spain, Paper, 2020.

VancouverVancouver

Jones D. Macsen: A Voice Assistant for Speakers of a Lesser Resourced Language. 2020. Paper presented at Language Resources and Evaluation Conference , Marseille, Spain.

Author

Jones, Dewi. / Macsen: A Voice Assistant for Speakers of a Lesser Resourced Language. Paper presented at Language Resources and Evaluation Conference , Marseille, Spain.

RIS

TY - CONF

T1 - Macsen: A Voice Assistant for Speakers of a Lesser Resourced Language

AU - Jones, Dewi

PY - 2020/5

Y1 - 2020/5

N2 - This paper reports on the development of a voice assistant mobile app for speakers of a lesser resourced language – Welsh. An assistant with a smaller set of effective but useful skills is both desirable and urgent for the wider Welsh speaking community. Descriptions of the app’s skills, architecture, design decisions and user interface is provided before elaborating on the most recent research and activities in open source speech technology for Welsh. The paper reports on the progress to date on crowdsourcing Welsh speech data in Mozilla Common Voice and of its suitability for training Mozilla’s DeepSpeech speech recognition for a voice assistant application according to conventional and transfer learning methods. We demonstrate that with smaller datasets of speech data, transfer learning and a domain specific language model, acceptable speech recognition is achievable that facilitates, as confirmed by beta users, a practical and useful voice assistant for Welsh speakers. We hope that this work informs and serves as a model to researchers and developers in other lesser-resourced linguistic communities and helps bring into being voice assistant apps for their languages.

AB - This paper reports on the development of a voice assistant mobile app for speakers of a lesser resourced language – Welsh. An assistant with a smaller set of effective but useful skills is both desirable and urgent for the wider Welsh speaking community. Descriptions of the app’s skills, architecture, design decisions and user interface is provided before elaborating on the most recent research and activities in open source speech technology for Welsh. The paper reports on the progress to date on crowdsourcing Welsh speech data in Mozilla Common Voice and of its suitability for training Mozilla’s DeepSpeech speech recognition for a voice assistant application according to conventional and transfer learning methods. We demonstrate that with smaller datasets of speech data, transfer learning and a domain specific language model, acceptable speech recognition is achievable that facilitates, as confirmed by beta users, a practical and useful voice assistant for Welsh speakers. We hope that this work informs and serves as a model to researchers and developers in other lesser-resourced linguistic communities and helps bring into being voice assistant apps for their languages.

KW - Voice Assistants

KW - Welsh Language

KW - speech recognition

KW - transfer learning

KW - Mozilla Common voice

M3 - Paper

SP - 194

EP - 201

T2 - Language Resources and Evaluation Conference

Y2 - 11 May 2020 through 16 May 2020

ER -

Research Portal

Macsen: A Voice Assistant for Speakers of a Lesser Resourced Language

Standard Standard

HarvardHarvard

APA

CBE

MLA

VancouverVancouver

Author

RIS