
Machine Learning Application
AI Image Caption Generator
Upload an image and let AI generate a descriptive caption using CNN and RNN architectures.
Project Overview
The AI Image Caption Generator is a web-based application that uses deep learning to automatically generate meaningful captions for uploaded images.
It combines Convolutional Neural Networks (CNNs) for visual feature extraction with Recurrent Neural Networks (RNNs) for natural language generation.
The project leverages the BLIP (Bootstrapped Language Image Pretraining) model, integrating computer vision and natural language processing to describe images accurately and fluently.
Key Features
- Upload any image (JPG/PNG) and get an AI-generated caption instantly
- Built using the BLIP model (from Salesforce) for high-quality caption generation
- Streamlit interface for seamless user interaction
- Efficient GPU inference with PyTorch backend
- Supports various image formats and sizes
- Lightweight and easily deployable to web platforms like Streamlit Cloud
Technologies Used
PythonStreamlitPyTorchTransformers (Hugging Face)CNNRNNBLIP ModelPillow
Project Gallery

Project Details
Client
Personal Project
Timeline
2 weeks
Role
AI Developer & ML Engineer
© 2026 Kheireddine Derbali AKA KDB. All rights reserved.
0%

