Vakta 2.0 is a lightweight and powerful Text-to-Audio Converter built with Python.
It can take any PDF, image, or text file and turn it into an audio file β so you can listen to books, notes, or documents hands-free.
The app features a clean graphical interface, automatic text extraction (even from scanned pages), and smooth background processing.
- Resolving the problem for scanned books pdf by using OCR.
- π§ OCR Support: Reads text from scanned PDFs and images using
keras_ocr - π¨ Modern GUI: Built with
tkinterin a dark theme - β‘ Threaded Processing: Uses Pythonβs
threadingandqueueto keep the GUI responsive while converting files - π Improved Audio Engine: Uses
pyttsx3for clear and natural text-to-speech output
| Library | Purpose |
|---|---|
| pdfplumber | Extracts text from text-based PDFs |
| keras_ocr | Detects and reads text from scanned PDFs or images |
| pyttsx3 | Converts text into an audio file |
| tkinter | Creates the graphical interface |
| threading & queue | Handles background tasks without freezing the GUI |
You can set up Vakta 2.0 easily:
git clone https://github.com/yourusername/Vakta-2.0.git
cd Vakta-2.0
pip install pdfplumber pyttsx3 keras-ocr Pillow numpyπ§© Note: The first time you run Vakta,
keras_ocrmay take a few minutes to download its models.
Vakta 2.0 uses the threading module to handle conversions in a separate thread.
This ensures that the GUI never freezes while reading large PDFs or converting audio.
When you click βCreate AudioBookβ, a new thread is started:
threading.Thread(
target=converter,
args=(book_entry.get(), start_entry.get(), end_entry.get(), audio_entry.get(), task_queue),
daemon=True
).start()The Queue object (task_queue) is used to safely send status updates back to the main thread, which then updates the GUI labels using:
root.after(200, poll_queue)This pattern keeps GUI feel active like every thing is happening at frontend.
- Run the app:
python vakta2.py
- Choose a file (PDF, image, or text).
- For PDFs, enter start and end page numbers.
- Type a name for your audio file.
- Click Create AudioBook.
- The conversion runs in a background thread.
- Youβll see live status messages (e.g., Processing page 3/10...).
Your audio (.mp3) file will be saved in the same folder.
Opening PDF...
Processing page 1/4...
Extracted text successfully, converting to audio...
β
Conversion Finished: saved as output.mp3
- π£οΈ Selectable voices (male/female)
- ποΈ Adjustable speech speed and pitch
- π Output folder selection
- π Batch file conversion
Vakta 2.0 was developed by Karlex to make reading easier and more accessible.
Itβs free, open-source, and designed to turn your reading material into an audiobook effortlessly.
Version: 2.0
License: MIT