F4566049620 - International Journal of Innovative Technology and Exploring Engineering (IJITEE)

Real Time Image Captaioning
Asha G¹, R. Hema Sumanth², A. China Venkat Chowdary³, A. Shashank⁴, T. Sravan⁵
¹Asha G*, Master in Computer Science and Engineering from VTU Belgaum, Karnataka.
²R. Hema Sumanth, Master in Computer Science and Engineering from VTU Belgaum, Karnataka.
³A. China Venkat Chowdary, Pursuing Final Year B. Tech Degree in Computer Science and Engineering from GITAM University, Bengaluru.
⁴A. Shashank, Pursuing Final Year B. Tech Degree in Computer Science and Engineering from GITAM University, Bengaluru.
⁵T. Sravan, Pursuing Final year B. Tech Degree in Computer Science and Engineering from GITAM University, Bengaluru.
Manuscript received on March 15, 2020. | Revised Manuscript received on March 28, 2020. | Manuscript published on April 10, 2020. | PP: 1707-1709 | Volume-9 Issue-6, April 2020. | Retrieval Number: F4566049620/2020©BEIESP | DOI: 10.35940/ijitee.F4566.049620
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: Image caption generator means it will generate a description for the images. It will predict what is happing in the images. We make our model using a hybrid CNN-RNN model in which in the CNN part of the model we use inception model for transfer learning and RNN is majorly used for language modeling. We use Flickr8k Dataset for training and testing the model. We use LSTM model in RNN to avoid the problem of vanishing or exploding gradient in the training phase.
Keywords: CNN-RNN Architecture, LSTM, SOFTMAX, Image caption generator.
Scope of the Article: Computer Architecture and VLSI

Download PDF

JOURNAL

REQUIREMENTS

PRODUCT

CONTACT US