Adversarial Image Caption Generator Network
Research output: Contribution to journal › Article › peer-review
Standard Standard
In: SN Computer Science, Vol. 2, No. 3, 182, 05.2021.
Research output: Contribution to journal › Article › peer-review
HarvardHarvard
APA
CBE
MLA
VancouverVancouver
Author
RIS
TY - JOUR
T1 - Adversarial Image Caption Generator Network
AU - Mollaahmadi Dehaqi, Ali
AU - Seydi, Vahid
AU - Madadi, Yeganeh
PY - 2021/5
Y1 - 2021/5
N2 - Image captioning is a task to make an image description, which needs recognizing the important attributes and also their relationships in the image. This task requires to generate semantically and syntactically correct sentences. Most image captioning models are based on RNN and MLE methods, but we propose a novel model based on GAN networks where it generates the caption of the image through the representation of the image by utilizing the generator adversarial network and it does not need any secondary learning algorithm like policy gradient. Due to the complexity of benchmark datasets such as Flickr and Coco, in both volume and complexity, we introduce a new dataset and perform the experiments on it. The experimental results show the effectiveness of our model compared to the state-of-the-art image captioning methods.
AB - Image captioning is a task to make an image description, which needs recognizing the important attributes and also their relationships in the image. This task requires to generate semantically and syntactically correct sentences. Most image captioning models are based on RNN and MLE methods, but we propose a novel model based on GAN networks where it generates the caption of the image through the representation of the image by utilizing the generator adversarial network and it does not need any secondary learning algorithm like policy gradient. Due to the complexity of benchmark datasets such as Flickr and Coco, in both volume and complexity, we introduce a new dataset and perform the experiments on it. The experimental results show the effectiveness of our model compared to the state-of-the-art image captioning methods.
KW - image captioning
KW - Feature representation
KW - Deep neural network
KW - Generative adversarial network
KW - Novel dataset
U2 - 10.1007/s42979-021-00486-y
DO - 10.1007/s42979-021-00486-y
M3 - Article
VL - 2
JO - SN Computer Science
JF - SN Computer Science
IS - 3
M1 - 182
ER -