AI Image Caption Generator

Overview

The AI Image Caption Generator is a Python-based application that utilizes OpenAI's CLIP model to generate meaningful captions for images. This tool provides users with an easy-to-use graphical interface (GUI) for selecting images and obtaining captions, along with additional features such as text-to-speech, translation, history tracking, and more.

Features

AI-Powered Captions – Uses the CLIP model to generate accurate image captions.
Graphical User Interface (GUI) – Simple and user-friendly UI using Tkinter.
Image Preview – Displays the selected image before generating a caption.
Copy to Clipboard – Easily copy captions for further use.
Text-to-Speech – Listen to the generated captions using a speech engine.
Translation Support – Translate captions into multiple languages.
Caption History – Stores generated captions for reference.
Colorful Theme – An elegant and visually appealing UI.
Multi-Device Support – Runs on both Windows and macOS.
Lightweight & Fast – Works efficiently without high system requirements.

Installation

Prerequisites

Ensure you have Python 3.8+ installed on your system. Install the required dependencies using the command below:

pip install torch torchvision torchaudio clip-by-openai pillow pyperclip pyttsx3 googletrans==4.0.0-rc1 opencv-python-headless tkinter

Running the Application

Clone the repository and navigate to the project directory:

git clone https://github.com/Burhanali2211/Offline_Caption_Generator.git
cd Offline_Caption_Generator

Run the Python script:

python Offline_Caption_Generator.py

Usage

Click "Choose Image" and select an image.
The AI generates a caption based on the image content.
You can copy the caption, listen to it, or translate it.
View history of previously generated captions.

Technologies Used

Python – Backend processing
OpenAI CLIP Model – AI-powered captioning
Tkinter – GUI interface
PyTTSX3 – Text-to-speech conversion
Google Translate API – Language translation
Pyperclip – Clipboard management

License

This project is open-source and available under the MIT License.

Contributions

Feel free to fork the repository and submit pull requests with improvements!

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Day_TwentySeven		Day_TwentySeven
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Image Caption Generator

Overview

Features

Installation

Prerequisites

Running the Application

Usage

Technologies Used

License

Contributions

About

Uh oh!

Releases

Packages

Languages

License

Burhanali2211/Offline_Caption_Generator

Folders and files

Latest commit

History

Repository files navigation

AI Image Caption Generator

Overview

Features

Installation

Prerequisites

Running the Application

Usage

Technologies Used

License

Contributions

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages