Skip to content

This project applies Machine Learning techniques to predict the survival of Titanic passengers. It explores various data preprocessing, visualization, and model-building techniques to enhance predictive accuracy.

Notifications You must be signed in to change notification settings

kumarritik24/Titanic-Survival-Prediction-using-Machine-Learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

8 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🚒 Titanic Survival Prediction using Machine Learning

🎯 Built an end-to-end machine learning pipeline to predict passenger survival in the Titanic dataset. Applied classification models (SVM, Random Forest, Logistic Regression) and achieved strong results using feature engineering, visualization, and evaluation metrics.


πŸ“ Project Overview

This project applies classic ML techniques to predict the survival of Titanic passengers. It involves:

  • Cleaning and preprocessing data
  • Feature extraction and transformation
  • Exploratory data analysis (EDA)
  • Applying supervised & unsupervised models
  • Model evaluation using classification metrics

πŸ“Š Dataset

The dataset used is the Titanic dataset, which includes information like:

  • Passenger ID, Name
  • Age, Gender, Class
  • Fare, Embarked Port
  • Survival status

πŸ” Features & Workflow

πŸ“¦ Data Processing & Feature Engineering
  • Handled missing values and outliers
  • Extracted features like Title, FamilySize, IsAlone
  • One-hot encoded categorical variables
  • Correlation heatmap + feature importance
πŸ“ˆ Exploratory Data Analysis (EDA)
  • Distribution plots for Age, Fare, Class
  • Survival rate by Gender, Class, Embarked
  • Cross-tab visualizations
πŸ€– Model Training & Evaluation
  • Supervised Models:
    • Logistic Regression
    • Random Forest Classifier
    • SVM
    • K-Nearest Neighbors (KNN)
    • Naive Bayes
  • Unsupervised Models:
    • K-Means Clustering
    • DBSCAN Clustering
  • Model Metrics:
    • Accuracy, F1-score, Recall
    • Confusion Matrix
    • ROC-AUC Score

🧰 Tools & Libraries

  • pandas, numpy – Data manipulation
  • matplotlib, seaborn – Data visualization
  • scikit-learn – Modeling & preprocessing
  • jupyter notebook – Development interface

πŸ§ͺ Results

The models were evaluated on:

  • Accuracy
  • Precision
  • Recall
  • F1-score

πŸ’‘ Achieved high predictive performance using Random Forest and SVM models with properly tuned hyperparameters.


βš™οΈ Installation & Usage

# Clone the repo
git clone https://github.com/kumarritik24/Titanic-Survival-Prediction-using-Machine-Learning.git
cd Titanic-Survival-Prediction-using-Machine-Learning

# Install required packages
pip install -r requirements.txt

# Run the notebook
jupyter notebook titanic-ml.ipynb

About

This project applies Machine Learning techniques to predict the survival of Titanic passengers. It explores various data preprocessing, visualization, and model-building techniques to enhance predictive accuracy.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published