可验证过程奖励在提升大模型推理效率中的探索与实践

discobot 2025 年10 月 10 日 13:00 1

这是一个从 https://tech.meituan.com/2025/10/10/vsrm.html 下的原始话题分离的讨论话题