ml-inference

Star

Here are 14 public repositories matching this topic...

willyfh / mlops-workflow

Star

An MLOps workflow for training, inference, experiment tracking, model registry, and deployment.

machine-learning deployment postgresql pytorch minio mlops mlflow fastapi experiment-tracking ml-training ml-inference

Updated Nov 24, 2025
Python

Clinical-Support-Systems / Plugin.Maui.ML

Star

A comprehensive .NET MAUI plugin for ML inference with ONNX Runtime, CoreML, and platform-native acceleration support

android windows ios machine-learning cross-platform neural-network dotnet tensor mac-os nuget-package coreml maui onnx onnxruntime dotnet9 ml-inference

Updated Dec 9, 2025
C#

PratikBarhate / hushar

Star

gRPC server for Machine Learning (ML) Model Inference in Rust.

aws machine-learning deep-learning grpc rust-lang nueral-networks onnx mlops model-inference-service ml-inference

Updated Oct 27, 2025
Rust

BabarAli93 / EdgeAIBus

Star

[TPDS 2025] EdgeAIBus: AI-driven Joint Container Management and Model Selection Framework for Heterogeneous Edge Computing

kubernetes impala transformers migration drl edge-computing heterogeneity energy-conservation ml-inference

Updated Aug 26, 2025
Jupyter Notebook

tantaneity / cat-brain-service

Star

ML service for cats that actually learn stuff. PPO brains, personality drift, mood system. Built in 10hrs

python docker redis machine-learning reinforcement-learning prometheus pytorch artificial-intelligence reinforcement-learning-algorithms pet-project game-ai gymnasium ppo indie-game fastapi stable-baselines3 ml-inference

Updated Dec 23, 2025
Python

MGTheTrain / distributed-workload-poc

Star

PoC demonstrating distributed workload orchestration using Ray as the primary compute framework with Prefect for workflow orchestration, supporting cloud-native deployments (Kubernetes)

etl distributed-computing ray hyperparameter-tuning prefect mlflow fastapi workflow-orchestration ml-training ml-inference

Updated Dec 20, 2025
Python

mosesachizz / ml-model-serving

Star

Production-ready ML model serving with FastAPI, TensorFlow, Docker, Kubernetes, and Prometheus. Features CI/CD, health checks, and scalable inference.

python docker kubernetes tensorflow prometheus ci-cd model-serving mlops fastapi ml-inference

Updated Oct 28, 2025
Python

chonzadaniel / Credit-card-FraudDetection

Star

Submission of Project

deployment tokenizer feature-extraction predictive-modeling fraud-detection supervised-machine-learning supervised-learning-algorithms ml-pipeline huggingface fraudulent-transactions streamlit ml-training huggingface-transformers ml-model-evaluation supervised-finetuning ml-inference bert-base-ucased ml-performance-metrics

Updated Jul 19, 2025
Jupyter Notebook

JayDS22 / Enterprise-Data-Warehouse

Star

Enterprise Data Warehouse & ML Platform - High-performance platform processing 24B records with <60s latency and 100K records/sec throughput, featuring 32 fact tables, 128 dimensions, and automated ML pipelines achieving 91.2% accuracy. Real-time ML inference serving 300K+ predictions/hour with ensemble models.

big-data ensemble-learning dbt datawarehouse ml-inference

Updated Sep 9, 2025
Python

100percentibrahim / livebatch

Star

A lightweight, framework-agnostic middleware that dynamically batches inference requests in real time to maximize GPU/TPU utilization.

golang microservices gpu grpc performance-optimization model-serving dynamic-batching ml-inference

Updated May 23, 2025
Go

nikhil-nandanwar / stressPrediction

Star

Client-side React + Vite web app to record and process voice audio and send features to an API for automated stress prediction (speech-based stress detection).