Adversarial Image Caption Generator Network

Allbwn ymchwil: Cyfraniad at gyfnodolynErthygladolygiad gan gymheiriaid

StandardStandard

Adversarial Image Caption Generator Network. / Mollaahmadi Dehaqi, Ali ; Seydi, Vahid; Madadi, Yeganeh .
Yn: SN Computer Science, Cyfrol 2, Rhif 3, 182, 05.2021.

Allbwn ymchwil: Cyfraniad at gyfnodolynErthygladolygiad gan gymheiriaid

HarvardHarvard

Mollaahmadi Dehaqi, A, Seydi, V & Madadi, Y 2021, 'Adversarial Image Caption Generator Network', SN Computer Science, cyfrol. 2, rhif 3, 182. https://doi.org/10.1007/s42979-021-00486-y

APA

Mollaahmadi Dehaqi, A., Seydi, V., & Madadi, Y. (2021). Adversarial Image Caption Generator Network. SN Computer Science, 2(3), Erthygl 182. https://doi.org/10.1007/s42979-021-00486-y

CBE

Mollaahmadi Dehaqi A, Seydi V, Madadi Y. 2021. Adversarial Image Caption Generator Network. SN Computer Science. 2(3):Article 182. https://doi.org/10.1007/s42979-021-00486-y

MLA

Mollaahmadi Dehaqi, Ali , Vahid Seydi a Yeganeh Madadi. "Adversarial Image Caption Generator Network". SN Computer Science. 2021. 2(3). https://doi.org/10.1007/s42979-021-00486-y

VancouverVancouver

Mollaahmadi Dehaqi A, Seydi V, Madadi Y. Adversarial Image Caption Generator Network. SN Computer Science. 2021 Mai;2(3):182. Epub 2021 Maw 31. doi: 10.1007/s42979-021-00486-y

Author

Mollaahmadi Dehaqi, Ali ; Seydi, Vahid ; Madadi, Yeganeh . / Adversarial Image Caption Generator Network. Yn: SN Computer Science. 2021 ; Cyfrol 2, Rhif 3.

RIS

TY - JOUR

T1 - Adversarial Image Caption Generator Network

AU - Mollaahmadi Dehaqi, Ali

AU - Seydi, Vahid

AU - Madadi, Yeganeh

PY - 2021/5

Y1 - 2021/5

N2 - Image captioning is a task to make an image description, which needs recognizing the important attributes and also their relationships in the image. This task requires to generate semantically and syntactically correct sentences. Most image captioning models are based on RNN and MLE methods, but we propose a novel model based on GAN networks where it generates the caption of the image through the representation of the image by utilizing the generator adversarial network and it does not need any secondary learning algorithm like policy gradient. Due to the complexity of benchmark datasets such as Flickr and Coco, in both volume and complexity, we introduce a new dataset and perform the experiments on it. The experimental results show the effectiveness of our model compared to the state-of-the-art image captioning methods.

AB - Image captioning is a task to make an image description, which needs recognizing the important attributes and also their relationships in the image. This task requires to generate semantically and syntactically correct sentences. Most image captioning models are based on RNN and MLE methods, but we propose a novel model based on GAN networks where it generates the caption of the image through the representation of the image by utilizing the generator adversarial network and it does not need any secondary learning algorithm like policy gradient. Due to the complexity of benchmark datasets such as Flickr and Coco, in both volume and complexity, we introduce a new dataset and perform the experiments on it. The experimental results show the effectiveness of our model compared to the state-of-the-art image captioning methods.

KW - image captioning

KW - Feature representation

KW - Deep neural network

KW - Generative adversarial network

KW - Novel dataset

U2 - 10.1007/s42979-021-00486-y

DO - 10.1007/s42979-021-00486-y

M3 - Article

VL - 2

JO - SN Computer Science

JF - SN Computer Science

IS - 3

M1 - 182

ER -