Mini Local Chatbot (llama.cpp + Gradio)

This project is a lightweight local chatbot built using llama.cpp with a Gradio web interface.
It supports multiple open-source language models, allows switching between them instantly, and includes full chat-history management — all running fully offline on your own machine.

🚀 Features

🧠 Multiple Local Models (via llama.cpp)

TinyLlama-1.1B-Chat (Q8_0) — ultra-fast and lightweight
Mistral-7B-Instruct (Q2_K) — stronger reasoning with low memory usage
DeepSeek-R1-Qwen3-8B (Q4_K_XL) — deeper answers with slower speed

🌐 Gradio Web UI

Clean, responsive browser interface
One-click model switching
Smooth message display

💾 Chat History Management

Download current conversation as JSON
Upload & load past chat histories
All files saved inside the history/ directory

🔒 100% Local Execution

No API calls, no network dependency
Ideal for private, offline, or on-device use
No API calls, no network dependency
Ideal for private, offline, or on-device use

🎥 Demo GIFs

💬 Ask the Chatbot

📥 Download Chat History

📤 Upload Chat History

🔀 Switch Models

📁 Repository Structure

mini_chatbot/
├── app.py # Main launcher for the Gradio UI
├── chatbot.py # Backend logic + llama.cpp wrapper
├── download.sh # Downloads all model files
├── requirements.txt # Python dependencies
├── history/ # Stored chat histories (JSON)
└── ui/ # UI helper components

🛠 Installation

Clone the repository git clone https://github.com/your-username/mini_chatbot.git cd mini_chatbot
(Optional) Create a virtual environment

python -m venv .venv source .venv/bin/activate

Install dependencies pip install -r requirements.txt
Download models Use the included script:

chmod +x download.sh ./download.sh

This fetches all required GGUF model files into the proper folders.

▶️ Running the Chatbot

Start the Gradio UI: python app.py

Then open: http://127.0.0.1:7860

You can now: ✔ Select a model
✔ Chat normally
✔ Save / load chat history
✔ Switch models mid-session

📝 Usage Notes

TinyLlama is best for speed and quick replies.
Mistral < Qwen are better for quality and depth, but will take more processing time, especially on CPU.
History files are standard JSONL format and editable.
All computation is local — suitable for private or offline applications.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
history		history
ui		ui
LICENSE		LICENSE
README.md		README.md
app.py		app.py
chatbot.py		chatbot.py
download_models.sh		download_models.sh
logger.py		logger.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mini Local Chatbot (llama.cpp + Gradio)

🚀 Features

🧠 Multiple Local Models (via llama.cpp)

🌐 Gradio Web UI

💾 Chat History Management

🔒 100% Local Execution

🎥 Demo GIFs

💬 Ask the Chatbot

📥 Download Chat History

📤 Upload Chat History

🔀 Switch Models

📁 Repository Structure

🛠 Installation

▶️ Running the Chatbot

📝 Usage Notes

About

Uh oh!

Packages

Languages

License

keneoneth/mini_chatbot

Folders and files

Latest commit

History

Repository files navigation

Mini Local Chatbot (llama.cpp + Gradio)

🚀 Features

🧠 Multiple Local Models (via llama.cpp)

🌐 Gradio Web UI

💾 Chat History Management

🔒 100% Local Execution

🎥 Demo GIFs

💬 Ask the Chatbot

📥 Download Chat History

📤 Upload Chat History

🔀 Switch Models

📁 Repository Structure

🛠 Installation

▶️ Running the Chatbot

📝 Usage Notes

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Packages 0

Languages

Packages