Image Classification, Image Captioning and LLM inference with LiteRT
-
Updated
Dec 21, 2025 - Kotlin
Image Classification, Image Captioning and LLM inference with LiteRT
𝗜𝗺𝗮𝗴𝗲 𝗖𝗮𝗽𝘁𝗶𝗼𝗻𝗶𝗻𝗴 𝘄𝗶𝘁𝗵 𝗢𝗰𝗰𝗹𝘂𝘀𝗶𝗼𝗻 𝗔𝗻𝗮𝗹𝘆𝘀𝗶𝘀 | 𝗩𝗶𝗧-𝗚𝗣𝗧𝟮 | 𝗦𝗺𝗼𝗹𝗩𝗟𝗠 | 𝗕𝗘𝗥𝗧
This project implements an Image Caption Generator, a deep learning model that automatically generates descriptive captions for images. It combines Convolutional Neural Networks (CNNs) for image feature extraction and Recurrent Neural Networks (LSTMs) for language modeling, trained on image–caption datasets.
Data and code for the paper "Exploiting Image-Text Synergy for Contextual Image Captioning" published at EACL Workshop LANTERN 2021.
Fully Connected Neural Networks, Multilayer Neural Networks, MAdaline, CNNs, Segmentation, Detection, RNNs, CNN-LSTM, LSTM, Bi-LSTM, GRU, Transformers, Huber Loss, ViT, DGMs, Triplet VAE, AdvGAN, Image Caption Generation, attention, LLM Fine-Tuning, Soft Prompting, LoRA, Layer Freezing, SlimOrca
Blip 2 Captioning, Mass Captioning, Question Answering, and other tools.
Image Captioning Using CLIP & GPT Models
BLIP-ImageCaption
Add a description, image, and links to the image-caption-generation topic page so that developers can more easily learn about it.
To associate your repository with the image-caption-generation topic, visit your repo's landing page and select "manage topics."