-
Notifications
You must be signed in to change notification settings - Fork 223
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: Unify custom model logits extraction across all inference methods
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1815
opened Jan 23, 2026 by
zpqiu
Loading…
4 tasks
build(deps): bump vllm from 0.11.2 to 0.14.0
dependencies
Pull requests that update a dependency file
python:uv
Pull requests that update python:uv code
#1811
opened Jan 22, 2026 by
dependabot
bot
Loading…
chore: cuda13 support
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1803
opened Jan 21, 2026 by
guyueh1
Loading…
4 tasks
feat: Timer for the data sharding and job submission
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1802
opened Jan 21, 2026 by
guyueh1
Loading…
4 tasks
feat: Support lora in dtensor grpo workflow by merging weight
CI:L1
Run doctests, unit tests, and functional tests
feat: add speculative decoding during post-training
#1785
opened Jan 15, 2026 by
isomap
Loading…
2 of 4 tasks
feat: NeMo Gym GRPO on-policy fix params; Per-agent group-level rewards
CI:L1
Run doctests, unit tests, and functional tests
#1779
opened Jan 15, 2026 by
bxyu-nvidia
Loading…
4 tasks
[don't merge] split train and val dataset in preference dataset
CI:L1
Run doctests, unit tests, and functional tests
documentation
Improvements or additions to documentation
[docs] Document Gym + RL integration design
documentation
Improvements or additions to documentation
feat: refactor train utilities for dtensor policy v2
#1757
opened Jan 10, 2026 by
hemildesai
•
Draft
4 tasks
feat: Support lora in dtensor grpo workflow[3/3]: async vllm
CI:L1
Run doctests, unit tests, and functional tests
#1752
opened Jan 9, 2026 by
RayenTian
Loading…
7 tasks
feat: Support lora in dtensor grpo workflow[2/3]: sync and non-colocated setup
CI:L1
Run doctests, unit tests, and functional tests
#1751
opened Jan 9, 2026 by
RayenTian
Loading…
4 tasks
feat: Support lora in dtensor grpo workflow[1/3]: sync and colocated setup
CI:L1
Run doctests, unit tests, and functional tests
#1748
opened Jan 9, 2026 by
RayenTian
Loading…
4 of 9 tasks
feat: Add CUDA Graph configuration support to MegatronPolicyWorker
community-request
#1736
opened Jan 7, 2026 by
sahgerlad
Loading…
2 of 4 tasks
feat: refactor common data utilities of dtensor policy v2
CI:L0
Run doctests and unit tests
#1710
opened Jan 5, 2026 by
hemildesai
Loading…
4 tasks
[don't merge] support multiple datasets for response dataset
CI:L1
Run doctests, unit tests, and functional tests
documentation
Improvements or additions to documentation
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.