TY - JOUR
T1 - An evaluation of measures to dissociate language and communication disorders from healthy controls using machine learning techniques
AU - Gaspers, J.
AU - Thiele, K.
AU - Cimiano, P.
AU - Foltz, A.
AU - Stenneken, P.
AU - Tscherepanow, M.
PY - 2012/1/28
Y1 - 2012/1/28
N2 - Reliably distinguishing patients with verbal impairment due to brain damage, e.g. aphasia, cognitive communication disorder (CCD), from healthy subjects is an important challenge
in clinical practice. A widely-used method is the application of word generation tasks, using the number of correct responses as a performance measure. Though clinically well-established, its analytical and explanatory power is limited.
In this paper, we explore whether additional features extracted from task performance can be used to distinguish healthy subjects from aphasics or CCD patients. We considered temporal, lexical, and sublexical features and used machine learning techniques to obtain a model that minimizes the empirical risk of classifying participants incorrectly. Depending
on the type of word generation task considered, the exploitation of features with state-of-the-art machine learning techniques outperformed the predictive accuracy of the
clinical standard method (number of correct responses). Our analyses confirmed that number of correct responses is an adequate measure for distinguishing aphasics from healthy
subjects. However, our additional features outperformed the traditional clinical measure in distinguishing patients with CCD from healthy subjects: The best classifcation
performance was achieved by excluding number of correct responses. Overall, our work contributes to the challenging goal of distinguishing patients with verbal impairments from
healthy subjects.
AB - Reliably distinguishing patients with verbal impairment due to brain damage, e.g. aphasia, cognitive communication disorder (CCD), from healthy subjects is an important challenge
in clinical practice. A widely-used method is the application of word generation tasks, using the number of correct responses as a performance measure. Though clinically well-established, its analytical and explanatory power is limited.
In this paper, we explore whether additional features extracted from task performance can be used to distinguish healthy subjects from aphasics or CCD patients. We considered temporal, lexical, and sublexical features and used machine learning techniques to obtain a model that minimizes the empirical risk of classifying participants incorrectly. Depending
on the type of word generation task considered, the exploitation of features with state-of-the-art machine learning techniques outperformed the predictive accuracy of the
clinical standard method (number of correct responses). Our analyses confirmed that number of correct responses is an adequate measure for distinguishing aphasics from healthy
subjects. However, our additional features outperformed the traditional clinical measure in distinguishing patients with CCD from healthy subjects: The best classifcation
performance was achieved by excluding number of correct responses. Overall, our work contributes to the challenging goal of distinguishing patients with verbal impairments from
healthy subjects.
U2 - 10.1145/2110363.2110389
DO - 10.1145/2110363.2110389
M3 - Article
SP - 209
EP - 218
JO - Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium
JF - Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium
ER -