Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

ci: Bump Megatron-Bridge to 6e8c6bb CI:L1 Run doctests, unit tests, and functional tests
#2860 opened Jun 17, 2026 by svcnvidia-nemo-ci Contributor Loading…
fix: Fix the default setting in Nemo Gym Nano v3 recipe config CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2857 opened Jun 17, 2026 by snowmanwwg Contributor Loading…
4 tasks
fix(data): stabilize multi-turn chat chunking and tokenization CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2856 opened Jun 17, 2026 by jinglinglingling Contributor Loading…
ci: Add super nightly tests Documentation Improvements or additions to documentation
#2855 opened Jun 16, 2026 by ashors1 Contributor Draft
4 tasks
docs(xtoken): X-Token distillation guide and README updates Documentation Improvements or additions to documentation
#2854 opened Jun 16, 2026 by avenkateshha Contributor Loading…
fix: missing validation logging in distillation CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version) community-request
#2847 opened Jun 16, 2026 by odedovadia Contributor Loading…
2 of 4 tasks
test: add vLLM HTTP logprobs contract test for NeMo-Gym capture CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2845 opened Jun 16, 2026 by ananthsub Contributor Loading…
test(data_plane): session-scope mooncake fixtures CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2838 opened Jun 16, 2026 by ZhiyuLi-Nvidia Contributor Loading…
feat: Support for dtensor ppo CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2837 opened Jun 16, 2026 by fujial-code Draft
feat: Support Linear CE Loss Fusion for GRPO community-request Documentation Improvements or additions to documentation waiting-on-customer Waiting on the original author to respond
#2833 opened Jun 16, 2026 by pengdurice Contributor Loading…
4 tasks done
ci: forward SANDBOX_CONTAINER/COMMAND/ENV_VARS to ray.sub
#2832 opened Jun 16, 2026 by kajalj22 Contributor Draft
3 tasks
Asyncrl/sc sync weights
#2831 opened Jun 16, 2026 by mehraakash Loading…
4 tasks
ci: List bundled codecs
#2830 opened Jun 15, 2026 by kajalj22 Contributor Draft
1 task
feat: super-v3 recipe and docs CI:L0 Run doctests and unit tests Documentation Improvements or additions to documentation super-v3
#2829 opened Jun 15, 2026 by macandro96 Contributor Loading…
4 tasks
feat: async checkpointing for Megatron policy workers CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2828 opened Jun 15, 2026 by ananthsub Contributor Loading…
2 of 4 tasks
perf: reduce srun overhead in ray.sub and gate driver on sandbox readiness CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#2827 opened Jun 15, 2026 by ananthsub Contributor Loading…
4 tasks
feat(ppo): in-model value head for Megatron PPO CI:L1 Run doctests, unit tests, and functional tests
#2825 opened Jun 15, 2026 by bg51717 Contributor Loading…
3 of 4 tasks
feat: video + audio understanding GRPO training recipe CI:L1 Run doctests, unit tests, and functional tests Documentation Improvements or additions to documentation
#2823 opened Jun 15, 2026 by yuekaizhang Contributor Loading…
feat: single controller (w/o sync_weight)
#2819 opened Jun 15, 2026 by yuki-97 Contributor Draft
ProTip! Adding no:label will show everything without a label.