Image Captioning in Tamil Language with Merge Architecture

Rajalingam, G; Wickramaarachchi, WU

dc.contributor.author	Rajalingam, G
dc.contributor.author	Wickramaarachchi, WU
dc.date.accessioned	2021-12-24T06:26:56Z
dc.date.available	2021-12-24T06:26:56Z
dc.date.issued	2021
dc.identifier.uri	http://ir.kdu.ac.lk/handle/345/5209
dc.description.abstract	Image Captioning is the process of describing the content of an image using a natural language. This task that involves computer vision and natural language processing has been attempted on the English language with enormous success, owing to the presence of massive imagecaption paired corpora as Flickr and Microsoft Common Objects in Context (MS-COCO). However, such developments in this arena have been a novelty for non-English languages with the exception of a few such as Chinese, Turkish, German and Arabic. In the case of Tamil language, this premise has been barely touched upon, due to the lack of a large, paired corpus. In this work, a paired corpus inspired from Flickr30K dataset has been created in Tamil language for the image captioning purpose. Along with it, this paper includes the experiments with an image captioning model, using a combination of Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) architecture; specifically the Merge model for Tamil language caption generation. This methodology incorporates the image vectors in a layer following the LSTM layer. The results of the research have proven satisfactory in the evaluation with a Bilingual Evaluation Understudy (BLEU) score of 0.37, and this indicates further development with the presence of a more refined and improved dataset.	en_US
dc.language.iso	en	en_US
dc.subject	Tamil caption generation	en_US
dc.subject	convolutional neural network	en_US
dc.subject	long short-term memory	en_US
dc.subject	natural language processing	en_US
dc.title	Image Captioning in Tamil Language with Merge Architecture	en_US
dc.type	Article Full Text	en_US
dc.identifier.journal	KDU IRC, 2021	en_US
dc.identifier.issue	Faculty of Computing	en_US
dc.identifier.pgnos	114-121	en_US

Files in this item

Name:: 11.pdf
Size:: 614.5Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Computing [62]

Show simple item record