kekmodel

Follow

JD Kim kekmodel

Follow

AI research engineer

96 followers · 25 following

Achievements

Achievements

kekmodel/README.md

🔝 Top Contributed Repo

Pinned Loading

THUDM/slime THUDM/slime Public

slime is an LLM post-training framework for RL Scaling.

Python 3k 364
FixMatch-pytorch FixMatch-pytorch Public

Unofficial PyTorch implementation of "FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence"

Python 794 181
MPL-pytorch MPL-pytorch Public

Unofficial PyTorch implementation of "Meta Pseudo Labels"

Python 390 70
rl_pytorch rl_pytorch Public

Deep Reinforcement Learning Algorithms Implementation in PyTorch

Jupyter Notebook 27 4
reinforcement-learning-kr/alpha_omok reinforcement-learning-kr/alpha_omok Public

Minimal version of DeepMind AlphaZero

Python 83 21
reinforcement-learning-kr/distributional_rl reinforcement-learning-kr/distributional_rl Public

Repository for studying distributional rl

Python 30 8