Using Machine Vision to Estimate Fish Length from Images using Regional Convolutional Neural Networks

Research output: Contribution to journalArticlepeer-review

Standard Standard

Using Machine Vision to Estimate Fish Length from Images using Regional Convolutional Neural Networks. / Monkman, Graham G.; Hyder, Kieran; Kaiser, Michel J. et al.
In: Methods in Ecology and Evolution, Vol. 10, No. 12, 12.2019, p. 2045-2056.

Research output: Contribution to journalArticlepeer-review

HarvardHarvard

Monkman, GG, Hyder, K, Kaiser, MJ & Vidal, FP 2019, 'Using Machine Vision to Estimate Fish Length from Images using Regional Convolutional Neural Networks', Methods in Ecology and Evolution, vol. 10, no. 12, pp. 2045-2056. https://doi.org/10.1111/2041-210X.13282

APA

Monkman, G. G., Hyder, K., Kaiser, M. J., & Vidal, F. P. (2019). Using Machine Vision to Estimate Fish Length from Images using Regional Convolutional Neural Networks. Methods in Ecology and Evolution, 10(12), 2045-2056. https://doi.org/10.1111/2041-210X.13282

CBE

MLA

VancouverVancouver

Monkman GG, Hyder K, Kaiser MJ, Vidal FP. Using Machine Vision to Estimate Fish Length from Images using Regional Convolutional Neural Networks. Methods in Ecology and Evolution. 2019 Dec;10(12):2045-2056. Epub 2019 Aug 10. doi: 10.1111/2041-210X.13282

Author

Monkman, Graham G. ; Hyder, Kieran ; Kaiser, Michel J. et al. / Using Machine Vision to Estimate Fish Length from Images using Regional Convolutional Neural Networks. In: Methods in Ecology and Evolution. 2019 ; Vol. 10, No. 12. pp. 2045-2056.

RIS

TY - JOUR

T1 - Using Machine Vision to Estimate Fish Length from Images using Regional Convolutional Neural Networks

AU - Monkman, Graham G.

AU - Hyder, Kieran

AU - Kaiser, Michel J.

AU - Vidal, Franck P.

PY - 2019/12

Y1 - 2019/12

N2 - 1. An image can encode date time, location and camera information as metadata and implicitly encodes species information and data on human activity, e.g. the size distribution of fish removals. Accurate length estimates can be made from images using a fiducial marker however, their manual extraction is time consuming and estimates are inaccurate without control over the imaging system. This article presents a methodology which uses machine vision to estimate the total length (TL) of a fusiform fish (European sea bass).2. Three regional convolutional neural networks (R CNN) were trained from public images. Images of European sea bass were captured with a fiducial marker with 3 non specialist cameras. Images were undistorted using the intrinsic lens properties calculated for the camera in OpenCV, then TL was estimated using machine vision (MV) to detect both marker and subject. MV performance was evaluated for the three R CNNs under downsampling and rotation of the captured images.3. Each R CNN accurately predicted the location of fish in test images (mean intersection over union, 93%) and estimates of TL were accurate, with percent mean bias error (%MBE [95% CIs]) = 2.2% [2.0, 2.4]). Detections were robust to horizontal flipping and downsampling. TL estimates at absolute image rotations > 20̊ became increasingly inaccurate but %MBE [95% CIs] was reduced to -0.1% [-0.2, 0.1] using machine learning to remove outliers and model bias.4. Machine vision can classify and derive measurements of species from images without specialist equipment. It is anticipated that ecological researchers and managers will make increasing use of MV where image data is collected (e.g. in remote electronic monitoring, virtual observations, wildlife surveys and morphometrics) and MV will be of particular utility where large volumes of image data will be gathered.

AB - 1. An image can encode date time, location and camera information as metadata and implicitly encodes species information and data on human activity, e.g. the size distribution of fish removals. Accurate length estimates can be made from images using a fiducial marker however, their manual extraction is time consuming and estimates are inaccurate without control over the imaging system. This article presents a methodology which uses machine vision to estimate the total length (TL) of a fusiform fish (European sea bass).2. Three regional convolutional neural networks (R CNN) were trained from public images. Images of European sea bass were captured with a fiducial marker with 3 non specialist cameras. Images were undistorted using the intrinsic lens properties calculated for the camera in OpenCV, then TL was estimated using machine vision (MV) to detect both marker and subject. MV performance was evaluated for the three R CNNs under downsampling and rotation of the captured images.3. Each R CNN accurately predicted the location of fish in test images (mean intersection over union, 93%) and estimates of TL were accurate, with percent mean bias error (%MBE [95% CIs]) = 2.2% [2.0, 2.4]). Detections were robust to horizontal flipping and downsampling. TL estimates at absolute image rotations > 20̊ became increasingly inaccurate but %MBE [95% CIs] was reduced to -0.1% [-0.2, 0.1] using machine learning to remove outliers and model bias.4. Machine vision can classify and derive measurements of species from images without specialist equipment. It is anticipated that ecological researchers and managers will make increasing use of MV where image data is collected (e.g. in remote electronic monitoring, virtual observations, wildlife surveys and morphometrics) and MV will be of particular utility where large volumes of image data will be gathered.

KW - European sea bass

KW - convolutional neural networks

KW - fiducial marker

KW - fish length

KW - machine vision

KW - photogrammetry

KW - regional convolutional neural network

KW - videogrammetry

U2 - 10.1111/2041-210X.13282

DO - 10.1111/2041-210X.13282

M3 - Article

VL - 10

SP - 2045

EP - 2056

JO - Methods in Ecology and Evolution

JF - Methods in Ecology and Evolution

SN - 2041-210X

IS - 12

ER -