Where’s Wally? A machine learning approach

Allbwn ymchwil: Pennod mewn Llyfr/Adroddiad/Trafodion CynhadleddCyfraniad i Gynhadleddadolygiad gan gymheiriaid

StandardStandard

Where’s Wally? A machine learning approach. / Barthelmes, Tobias; Vidal, Franck.
Computer Graphics & Visual Computing (CGVC) 2021. gol. / K. Xu; M. Turner. The Eurographics Association, 2021. t. 28-31.

Allbwn ymchwil: Pennod mewn Llyfr/Adroddiad/Trafodion CynhadleddCyfraniad i Gynhadleddadolygiad gan gymheiriaid

HarvardHarvard

Barthelmes, T & Vidal, F 2021, Where’s Wally? A machine learning approach. yn K Xu & M Turner (gol.), Computer Graphics & Visual Computing (CGVC) 2021. The Eurographics Association, tt. 28-31, EG UK Computer Graphics & Visual Computing (CGVC), Lincoln, Y Deyrnas Unedig, 8/09/21. https://doi.org/10.2312/cgvc.20211313

APA

Barthelmes, T., & Vidal, F. (2021). Where’s Wally? A machine learning approach. Yn K. Xu, & M. Turner (Gol.), Computer Graphics & Visual Computing (CGVC) 2021 (tt. 28-31). The Eurographics Association. https://doi.org/10.2312/cgvc.20211313

CBE

Barthelmes T, Vidal F. 2021. Where’s Wally? A machine learning approach. Xu K, Turner M, golygyddion. Yn Computer Graphics & Visual Computing (CGVC) 2021. The Eurographics Association. tt. 28-31. https://doi.org/10.2312/cgvc.20211313

MLA

Barthelmes, Tobias a Franck Vidal "Where’s Wally? A machine learning approach". a Xu, K. Turner, M. (golygyddion). Computer Graphics & Visual Computing (CGVC) 2021. The Eurographics Association. 2021, 28-31. https://doi.org/10.2312/cgvc.20211313

VancouverVancouver

Barthelmes T, Vidal F. Where’s Wally? A machine learning approach. Yn Xu K, Turner M, golygyddion, Computer Graphics & Visual Computing (CGVC) 2021. The Eurographics Association. 2021. t. 28-31 doi: https://doi.org/10.2312/cgvc.20211313

Author

Barthelmes, Tobias ; Vidal, Franck. / Where’s Wally? A machine learning approach. Computer Graphics & Visual Computing (CGVC) 2021. Gol. / K. Xu ; M. Turner. The Eurographics Association, 2021. tt. 28-31

RIS

TY - GEN

T1 - Where’s Wally? A machine learning approach

AU - Barthelmes, Tobias

AU - Vidal, Franck

N1 - Conference code: 39

PY - 2021/9

Y1 - 2021/9

N2 - Object detection has been implemented in all sorts of real-life scenarios such as facial recognition, traffic monitoring and medical imaging but the research that has gone into object detection in drawings and cartoons is not nearly as extensive. The Where's Wally puzzle books give a good opportunity to implement some of these real-life methods into the fictional world. The Wally detection framework proposed is composed of two stages: i) a Haar-cascade classifier based on the Viola-Jones framework, which detects possible candidates from a scenario from the Where's Wally books, and ii) a lightweight convolutional neural network (CNN) that re-labels the objects detected by the cascade classifier. The cascade classifier was trained on 85 positive images and 172 negative images. It was then applied to 12 test images, which produced over 400 false positives. To increase the accuracy of the models, hard negative mining was implemented. The framework achieved a recall score of 84.61% and an F1 score of 78.54%. Improvements could be made to the training data or the CNN to further increase these scores.

AB - Object detection has been implemented in all sorts of real-life scenarios such as facial recognition, traffic monitoring and medical imaging but the research that has gone into object detection in drawings and cartoons is not nearly as extensive. The Where's Wally puzzle books give a good opportunity to implement some of these real-life methods into the fictional world. The Wally detection framework proposed is composed of two stages: i) a Haar-cascade classifier based on the Viola-Jones framework, which detects possible candidates from a scenario from the Where's Wally books, and ii) a lightweight convolutional neural network (CNN) that re-labels the objects detected by the cascade classifier. The cascade classifier was trained on 85 positive images and 172 negative images. It was then applied to 12 test images, which produced over 400 false positives. To increase the accuracy of the models, hard negative mining was implemented. The framework achieved a recall score of 84.61% and an F1 score of 78.54%. Improvements could be made to the training data or the CNN to further increase these scores.

U2 - https://doi.org/10.2312/cgvc.20211313

DO - https://doi.org/10.2312/cgvc.20211313

M3 - Conference contribution

SP - 28

EP - 31

BT - Computer Graphics & Visual Computing (CGVC) 2021

A2 - Xu, K.

A2 - Turner, M.

PB - The Eurographics Association

T2 - EG UK Computer Graphics & Visual Computing (CGVC)

Y2 - 8 September 2021 through 10 September 2021

ER -