Tag: Reinforcement Learning
All the papers with the tag "Reinforcement Learning".
Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models
grok-3-latestScore: 0.62Published: at 03:14本文系统综述了强化学习(RL)在多模态大语言模型(MLLMs)推理中的应用,分析了算法设计、奖励机制和应用场景,提出了当前局限和未来方向,为多模态推理研究提供了结构化指南。