Skip to content

fix: copy dict in trajectory_logging to avoid mutating original (pres…

8145580
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Draft

feat: enable GRPO training with logprobs from offline trajectory data #467

fix: copy dict in trajectory_logging to avoid mutating original (pres…
8145580
Select commit
Loading
Failed to load commit list.

Annotations

1 error
quality-checks
failed Dec 11, 2025 in 1m 49s