📄 论文解读

AI 自省会越学越错，得找别人来挑刺

信赖通道 ▲ 48 AI 自省陷阱多智能体协作经验学习共识验证

让 AI 自己执行任务、自己总结经验、自己记住教训——听起来很合理，但论文发现这其实是个陷阱：AI 会把自己犯的错也当成正确经验存下来，越学越偏。研究者提出一个三阶段框架：先让多个不同版本的 AI 并行执行同一任务，再让一个独立的第三方 AI 对比这些轨迹、提炼候选经验，最后让执行组投票验证，只有通过共识的经验才写入记忆。在三个长周期任务基准上，这套方法一致优于传统自省式学习。它不是你明天能用上的，但它揭示了一个反直觉的事实：AI 的自我进化需要引入外部视角，否则就是闭门造车。

📄 原文摘要(英文)

Experience-driven self-evolution is critical for large language model (LLM) agents to improve through open-world interaction. However, existing experience learning methods mostly rely on single-agent loops, where the same agent executes tasks, summarizes outcomes, and determines memory content. This setup makes agents vulnerable to the Self-Confirmation Trap: wrong-but-self-consistent trajectories are misidentified as successful experience, leading to cumulative errors during retrieval and reuse. To address this issue, we propose EDV, an Execute-Distill-Verify framework for reliable experience learning. In the Execute stage, multiple heterogeneous agents explore the same task space in parallel to generate diverse candidate trajectories. In the Distill stage, a dedicated third-party agent comparatively analyzes these trajectories to produce candidate experiences, reducing executor-centric summarization bias. In the Verify stage, the execution group validates candidates via a consensus mechanism, and only approved experiences are written into shared or private memory. By decoupling the three stages, EDV transforms experience learning from isolated self-reflection into collaborative construction, filtering erroneous and noisy content before memory insertion. We evaluate EDV on three challenging long-horizon benchmarks: tau2-bench, Mind2Web and MMTB. Results show EDV consistently outperforms strong baselines, validating that reliable experience construction is essential for robust agent self-evolution. Our code is available at https://github.com/shidingz/EDV.

arXiv 原文

📬 订阅 AI Pulse