Reward function compression facilitates goal-dependent reinforcement learning

arXiv Q-Bio

이 뉴스, 어떠셨어요?

한 번의 탭으로 반응을 남겨요 · 로그인 불필요

CC BY

이 매체는 공공·자유 라이선스로 본문을 직접 표시합니다.

Abstract

Humans can uniquely assign value to novel, abstract outcomes to support reinforcement learning.

However, this flexibility is cognitively costly and reduces learning efficiency.

We propose that goal-dependent learning initially relies on capacity-limited working memory.

With consistent experience, learners create a "compressed" reward function - a simplified goal rule -- that transfers to long-term memory for a more automatic evaluation upon receiving feedback.

This automaticity frees working memory resources, thereby boosting learning efficiency.

Across six experiments, we demonstrate that learning is impaired by the size of the goal space but improves when this space allows for compression.

Additionally, faster reward processing correlates with better learning.

Although the algorithmic details remain to be established, our behavioral results and computational models suggest that efficient goal-directed learning relies on compressing complex goal information into a stable reward function.

These findings illuminate the cognitive mechanisms of intrinsic motivation and can inform behavioral interventions supporting human goal achievement.

전문 보기

Reward function compression facilitates goal-dependent reinforcement learning

이 뉴스, 어떠셨어요?

Abstract

관련 뉴스

'research' 카테고리 뉴스

Constructive Alignment: Governing Preference Dynamics in Human-AI Interaction

Bounded Morality: Defining the Space of Moral Computation

The MMM Data Model -- A Normative Specification for Knowledge Interoperability in a Decentralisable Knowledge Commons

arXiv의 다른 기사

RareDxR1: Autonomous Medical Reasoning for Rare Disease Diagnosis Beyond Human Annotation

A Contextual-Bandit Oversight Game with Two-Sided Informational Asymmetry

Constructing Epistemic AI Literacy: Detecting Epistemic Aims and Processes in Student-AI Co-Programming