Where’s Wally? A machine learning approach
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
Standard Standard
Computer Graphics & Visual Computing (CGVC) 2021. ed. / K. Xu; M. Turner. The Eurographics Association, 2021. p. 28-31.
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
HarvardHarvard
APA
CBE
MLA
VancouverVancouver
Author
RIS
TY - GEN
T1 - Where’s Wally? A machine learning approach
AU - Barthelmes, Tobias
AU - Vidal, Franck
N1 - Conference code: 39
PY - 2021/9
Y1 - 2021/9
N2 - Object detection has been implemented in all sorts of real-life scenarios such as facial recognition, traffic monitoring and medical imaging but the research that has gone into object detection in drawings and cartoons is not nearly as extensive. The Where's Wally puzzle books give a good opportunity to implement some of these real-life methods into the fictional world. The Wally detection framework proposed is composed of two stages: i) a Haar-cascade classifier based on the Viola-Jones framework, which detects possible candidates from a scenario from the Where's Wally books, and ii) a lightweight convolutional neural network (CNN) that re-labels the objects detected by the cascade classifier. The cascade classifier was trained on 85 positive images and 172 negative images. It was then applied to 12 test images, which produced over 400 false positives. To increase the accuracy of the models, hard negative mining was implemented. The framework achieved a recall score of 84.61% and an F1 score of 78.54%. Improvements could be made to the training data or the CNN to further increase these scores.
AB - Object detection has been implemented in all sorts of real-life scenarios such as facial recognition, traffic monitoring and medical imaging but the research that has gone into object detection in drawings and cartoons is not nearly as extensive. The Where's Wally puzzle books give a good opportunity to implement some of these real-life methods into the fictional world. The Wally detection framework proposed is composed of two stages: i) a Haar-cascade classifier based on the Viola-Jones framework, which detects possible candidates from a scenario from the Where's Wally books, and ii) a lightweight convolutional neural network (CNN) that re-labels the objects detected by the cascade classifier. The cascade classifier was trained on 85 positive images and 172 negative images. It was then applied to 12 test images, which produced over 400 false positives. To increase the accuracy of the models, hard negative mining was implemented. The framework achieved a recall score of 84.61% and an F1 score of 78.54%. Improvements could be made to the training data or the CNN to further increase these scores.
U2 - 10.2312/cgvc.20211313
DO - 10.2312/cgvc.20211313
M3 - Conference contribution
SP - 28
EP - 31
BT - Computer Graphics & Visual Computing (CGVC) 2021
A2 - Xu, K.
A2 - Turner, M.
PB - The Eurographics Association
T2 - EG UK Computer Graphics & Visual Computing (CGVC)
Y2 - 8 September 2021 through 10 September 2021
ER -