Reward function compression facilitates goal-dependent reinforcement learning
이 뉴스, 어떠셨어요?
한 번의 탭으로 반응을 남겨요 · 로그인 불필요
Abstract
Humans can uniquely assign value to novel, abstract outcomes to support reinforcement learning.
However, this flexibility is cognitively costly and reduces learning efficiency.
We propose that goal-dependent learning initially relies on capacity-limited working memory.
With consistent experience, learners create a "compressed" reward function - a simplified goal rule -- that transfers to long-term memory for a more automatic evaluation upon receiving feedback.
This automaticity frees working memory resources, thereby boosting learning efficiency.
Across six experiments, we demonstrate that learning is impaired by the size of the goal space but improves when this space allows for compression.
Additionally, faster reward processing correlates with better learning.
Although the algorithmic details remain to be established, our behavioral results and computational models suggest that efficient goal-directed learning relies on compressing complex goal information into a stable reward function.
These findings illuminate the cognitive mechanisms of intrinsic motivation and can inform behavioral interventions supporting human goal achievement.