rlvr
Here are 39 public repositories matching this topic...
Awesome List for Agentic RL
-
Updated
Feb 27, 2026 - HTML
[EMNLP'25] s3 - ⚡ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)
-
Updated
Nov 5, 2025 - Python
Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934
-
Updated
Oct 28, 2025 - Python
[ICLR 2026] An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"
-
Updated
Feb 8, 2026 - Python
A curated list of awesome resources about reward construction for AI agents. This repository covers cutting-edge research, and practical guides on defining and collecting rewards to build more intelligent and aligned AI agents.
-
Updated
Sep 1, 2025
🐝 SwarmBench: Benchmarking LLMs' Swarm Intelligence
-
Updated
May 21, 2025 - Python
This is the official code of DeepSearch [ICLR 2026]
-
Updated
Oct 22, 2025 - Python
grpo to train long form QA and instructions with long-form reward model
-
Updated
Jul 17, 2025 - Python
MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs
-
Updated
Jul 6, 2025 - Python
The official repository of the paper "Do Reasoning Models Enhance Embedding Models?"
-
Updated
Feb 20, 2026 - Python
[arXiv] "Not All Steps are Informative: On the Linearity of LLMs’ RLVR Training"
-
Updated
Feb 1, 2026 - Python
A self-distillation based training method for long context reasoning in a single LLM without reinforcement learning
-
Updated
Jan 29, 2026 - Python
Trinity-Mini-DrugProt-Think
-
Updated
Feb 23, 2026 - HTML
Improve this page
Add a description, image, and links to the rlvr topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rlvr topic, visit your repo's landing page and select "manage topics."