reward-modeling

Star

Here are 28 public repositories matching this topic...

YangLing0818 / IterComp

Star

[ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation

text-to-image dpo rlhf reward-modeling

Updated Feb 19, 2025
Python

sileod / tasksource

Star

Datasets collection and preprocessings framework for NLP extreme multitask learning

Updated Jul 9, 2025
Python

InternLM / ARM-Thinker

Star

[CVPR 2026] Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"

vlm llm vision-language-model reward-modeling agentic-rl think-with-image

Updated Feb 13, 2026
Python

holarissun / RewardModelingBeyondBradleyTerry

Star

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives

reward inverse-reinforcement-learning large-language-models rlhf reward-models largelanguagemodels reward-modeling llm-aligment llmalignment

Updated Apr 2, 2025
Python

bobxwu / learning-from-rewards-llm-papers

Star

A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward models and learning strategies across training, inference, and post-inference stages.

reinforcement-learning post-training self-correction reward-learning large-language-models llm llms reward-models reward-model reward-modeling guided-decoding test-time-scaling

Updated Jun 13, 2025

sinanuozdemir / oreilly-llm-rl-alignment

Star

This training offers an intensive exploration into the frontier of reinforcement learning techniques with large language models (LLMs). We will explore advanced topics such as Reinforcement Learning with Human Feedback (RLHF), Reinforcement Learning from AI Feedback (RLAIF), Reasoning LLMs, and demonstrate practical applications such as fine-tuning

reinforcement-learning ai llama agents ppo dpo qwen deepseek reward-modeling grpo rloo

Updated Mar 9, 2026
Jupyter Notebook

Jialuo-Li / Science-T2I

Star

[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis

science benchmark computer-vision dataset generative-model post-training reward-modeling

Updated Mar 31, 2026
Python

zli12321 / qa_metrics

Star

An easy python package to run quick basic QA evaluations. This package includes standardized QA evaluation metrics and semantic evaluation metrics: Black-box and Open-Source large language model prompting and evaluation, exact match, F1 Score, PEDANT semantic match, transformer match. Our package also supports prompting OPENAI and Anthropic API.

qa-automation-test rl-training llm exact-matching llm-evaluation llm-evaluation-toolkit llm-evaluation-framework reward-modeling

Updated Jul 18, 2025
Python

hggzjx / RewardAuditor

Star

Official Repo for Paper: "Reward Auditor: Inference on Reward Modeling Suitability in Real-World Perturbed Scenarios"

evaluation-framework large-language-models rlhf reward-modeling

Updated Jan 24, 2026
Python

allenai / hybrid-preferences

Star

Learning to route instances for Human vs AI Feedback (ACL Main '25)

language-model dpo rlhf reward-modeling

Updated Jul 23, 2025
Python

LCM-Lab / LongRM

Star

Revealing and unlocking the context boundary of reward models

benchmark reward-modeling long-context-evaluation

Updated Jan 11, 2026
Python

quanshr / DMoERM

Star

[ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling

rlhf large-language-model reward-modeling

Updated Jun 6, 2024
Python

lirenhao1997 / ToolRM

Star

ToolRM: Towards Agentic Tool-Use Reward Modeling

tool-use llm-agent function-calling reward-modeling

Updated Jan 14, 2026
Python

homzer / Q-RM

Star

Code for SFT and RL

reinforcement-learning model-parallel reward-modeling supervised-fine-tuning

Updated Jun 22, 2025
Python

JohannesAck / OffPolicyCorrectedRewardModeling

Star

Implementation for our COLM paper "Off-Policy Corrected Reward Modeling for RLHF"

rlhf reward-modeling

Updated Jul 23, 2025
Python

MiuLab / DogeRM

Star

The code used in the paper "DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging"

large-language-models rlhf model-merging reward-modeling

Updated Oct 8, 2024
Python

lca0503 / MergeToVLRM

Star

Source code of our paper "Transferring Textual Preferences to Vision-Language Understanding through Model Merging", ACL 2025

model-merging large-vision-language-model reward-modeling

Updated Apr 25, 2025
Python

NJUNLP / GRRM

Star

A novel Group Relative Reward Model (GRRM) framework enhances machine translation quality and reasoning capabilities by improving intra-group ranking through comparative analysis rather than isolated metric evaluation.

reinforcement-learning deep-learning machine-translation large-language-models llm-reasoning reward-modeling grpo

Updated Mar 27, 2026
Python

ranzeet013 / RLHF-CustomData

Star

Building an LLM with RLHF involves fine-tuning using human-labeled preferences. Based on Learning to Summarize from Human Feedback, it uses supervised learning, reward modeling, and PPO to improve response quality and alignment.

transformer reinforcement-learning-algorithms fine-tuning ppo-agent policy-optimization-algorithms supervised-finetuning flan-t5 reward-modeling

Updated Mar 24, 2025
Jupyter Notebook

zhuohaoyu / RewardAnything

Star

RewardAnything: Generalizable Principle-Following Reward Models

evaluation alignment llm rlhf reward-models rlaif reward-modeling reasoning-language-models

Updated Jul 15, 2025
HTML

Improve this page

Add a description, image, and links to the reward-modeling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reward-modeling topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reward-modeling

Here are 28 public repositories matching this topic...

YangLing0818 / IterComp

sileod / tasksource

InternLM / ARM-Thinker

holarissun / RewardModelingBeyondBradleyTerry

bobxwu / learning-from-rewards-llm-papers

sinanuozdemir / oreilly-llm-rl-alignment

Jialuo-Li / Science-T2I

zli12321 / qa_metrics

hggzjx / RewardAuditor

allenai / hybrid-preferences

LCM-Lab / LongRM

quanshr / DMoERM

lirenhao1997 / ToolRM

homzer / Q-RM

JohannesAck / OffPolicyCorrectedRewardModeling

MiuLab / DogeRM

lca0503 / MergeToVLRM

NJUNLP / GRRM

ranzeet013 / RLHF-CustomData

zhuohaoyu / RewardAnything

Improve this page

Add this topic to your repo