Rao, Ailneni RakshithaAilneni RakshithaRaoRao, ArjunArjunRao2025-08-312025-08-312022-01-01[9781955917803]10.18653/v1/2022.semeval-1.822-s2.0-85137584064https://d8.irins.org/handle/IITG2025/26280Women are frequently targeted online with hate speech and misogyny using tweets, memes, and other forms of communication. This paper describes our system for Task 5 of SemEval-2022: Multimedia Automatic Misogyny Identification (MAMI). We participated in both the sub-tasks, where we used transformer-based architecture to combine features of images and text. We explore models with multi-modal pre-training (VisualBERT) and text-based pretraining (MMBT) while drawing comparative results. We also show how additional training with task-related external data can improve the model performance. We achieved sizable improvements over baseline models and the official evaluation ranked our system 3<sup>rd</sup> out of 83 teams on the binary classification task (Sub-task A) with an F1 score of 0.761, and 7<sup>th</sup> out of 48 teams on the multi-label classification task (Sub-task B) with an F1 score of 0.705.trueASRtrans at SemEval-2022 Task 5: Transformer-based Models for Meme ClassificationConference Paperhttps://aclanthology.org/2022.semeval-1.82.pdf597-60420223cpConference Proceeding3