Back to Portfolio
AI Image Caption Generator
Machine Learning Application

AI Image Caption Generator

Upload an image and let AI generate a descriptive caption using CNN and RNN architectures.

Project Overview

The AI Image Caption Generator is a web-based application that uses deep learning to automatically generate meaningful captions for uploaded images.

It combines Convolutional Neural Networks (CNNs) for visual feature extraction with Recurrent Neural Networks (RNNs) for natural language generation.

The project leverages the BLIP (Bootstrapped Language Image Pretraining) model, integrating computer vision and natural language processing to describe images accurately and fluently.

Key Features

  • Upload any image (JPG/PNG) and get an AI-generated caption instantly
  • Built using the BLIP model (from Salesforce) for high-quality caption generation
  • Streamlit interface for seamless user interaction
  • Efficient GPU inference with PyTorch backend
  • Supports various image formats and sizes
  • Lightweight and easily deployable to web platforms like Streamlit Cloud

Technologies Used

PythonStreamlitPyTorchTransformers (Hugging Face)CNNRNNBLIP ModelPillow

Project Gallery

Image upload interface

Project Details

Client

Personal Project

Timeline

2 weeks

Role

AI Developer & ML Engineer

© 2026 Kheireddine Derbali AKA KDB. All rights reserved.

0%