Kumar, AthulAthulKumarAgrawal, AarchiAarchiAgrawalAshin Shanly, K. S.K. S.Ashin ShanlyDas, SudipSudipDasHarilal, NidhinNidhinHarilal2025-08-312025-08-312022-01-08[9781450385824]10.1145/3493700.34937542-s2.0-85122698210https://d8.irins.org/handle/IITG2025/26203Image captions are those crisp descriptions that you see under images. They generally provide the viewer with a brief idea about the image context. To generate an accurate description of the scene, the model requires a semantic and spatial understanding of the contents in the scene. This paper proposes a novel approach using Siamese Graph Convolutional Network (S-GCN), making use of a non-parametric Kernel Activation function (KAF) followed by an LSTM with attention to generate natural language captions for the input image. Siamese-GCN captures deep semantic relations and makes the model more robust to class imbalances. We use an extended kernel activation function and regularize with standard lp-norm techniques, improving the overall model performance by a significant margin. The model is trained and tested on the Flickr30K data set and evaluated on BLEU-4 scores.falseImage Caption Generator using Siamese Graph Convolutional Networks and LSTMConference Paper306-3078 January 20220cpConference Proceeding0