GPU Computing: kNN and MLP Acceleration

Overview

This project investigates the impact of GPU acceleration on machine learning algorithms.
We focused on:

k-Nearest Neighbors (kNN)
Multilayer Perceptron (MLP)

Both models were implemented and tested on CPU vs GPU to measure runtime differences. The work was carried out as part of the GPU Computing course at LUT University.

Objectives

Implement kNN and MLP for classification.
Compare runtime and accuracy between CPU and GPU execution.
Evaluate how dataset size and algorithm type affect GPU speedup.

Dataset

File: MLoGPU_data3_train.csv – dataset provided by the course.
Samples: 4,000
Features: 7 numerical values per sample
Classes: 7 (multi-class classification)
Preprocessing:
- Min–Max normalization
- Labels cast to integers
- Train/test split (80/20, stratified)

Technologies Used

Python 3.10
Google Colab (GPU runtime)
CuPy – GPU array operations & custom CUDA kernels
PyTorch – neural network implementation (MLP)
NumPy – CPU-based array operations
Matplotlib – visualizations
Scikit-learn – preprocessing (train/test split, scaling)

Results

k-Nearest Neighbors (kNN)

Best accuracy: ~52% (k = 1)
Runtime comparison:
- CPU avg: ~0.50s
- GPU avg: ~0.006s
Speedup: Up to 170x for small k, ~30–40x for larger k
Observation: GPU acceleration was very effective for distance calculations, but accuracy was limited by class imbalance.

Multilayer Perceptron (MLP)

Architecture: 7 → 64 → 64 → 7 (ReLU + CrossEntropyLoss)
Training setup: 100 epochs, Adam optimizer (lr=0.001)
Accuracy:
- CPU: 52.25%
- GPU: 53.87%
Runtime:
- Training: CPU ~9.95s, GPU ~10.38s
- Inference: CPU ~0.01s, GPU ~0.012s
Observation: GPU overhead outweighed benefits for this small dataset and simple model.

Key Findings

GPU acceleration provides significant speedups for highly parallelizable methods like kNN.
For small networks (MLP) and limited data, GPU benefits are minimal.
Dataset imbalance and overlapping features limited accuracy more than compute power.
Writing a custom CUDA kernel highlighted the importance of memory management and thread-level parallelism.

Repository Structure

GPU-Computing/
│
├── notebooks/               # Jupyter notebooks
│   ├── 5_knn.ipynb
│   └── 5_mlp.ipynb
│
├── data/                    # Dataset
│   └── MLoGPU_data3_train.csv
│
├── docs/                  # Project report + description
│   ├── 5.pdf
│   └── MLoGPU_description.pdf
│
├── README.md                # Project documentation
├── LICENSE                  # MIT License
├── .gitignore               # Ignored files config
└── .gitattributes           # Text file normalization

Authors

This project was completed as part of GPU Computing (LUT University) by:

Nada Rahali – MLP implementation, report writing
Tanjuma Haque – kNN implementation (CPU + GPU kernel), documentation

License

This project is licensed under the MIT License – see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GPU Computing: kNN and MLP Acceleration

Overview

Objectives

Dataset

Technologies Used

Results

k-Nearest Neighbors (kNN)

Multilayer Perceptron (MLP)

Key Findings

Repository Structure

Authors

License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
docs		docs
notebooks		notebooks
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

License

NadaRahali/GPU-Computing

Folders and files

Latest commit

History

Repository files navigation

GPU Computing: kNN and MLP Acceleration

Overview

Objectives

Dataset

Technologies Used

Results

k-Nearest Neighbors (kNN)

Multilayer Perceptron (MLP)

Key Findings

Repository Structure

Authors

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages