Retention Profiles and KL Contraction Bounds in Finite Markov Chains

arXiv Math

이 뉴스, 어떠셨어요?

한 번의 탭으로 반응을 남겨요 · 로그인 불필요

CC BY

이 매체는 공공·자유 라이선스로 본문을 직접 표시합니다.

Abstract

We study Kullback-Leibler (KL) contraction in finite Markov chains through a row-wise perspective.

Evaluating the SDPI ratio at point masses yields a state-indexed retention profile $r(x)=D_{\mathrm{KL}}(P(x,\cdot)\|\pi)/\log(1/\pi(x))$ and a localization ratio $L(P)=\bar r_\pi/M\in[0,1]$ (with $M=\max_x r(x)$, $\bar r_\pi=\mathbb{E}_\pi r$) that distinguishes localized from global contraction obstructions.

Our main contributions are (i) a convexity-gap identity showing that the gap between the row-averaged divergence and $D_{\mathrm{KL}}(\mu P\|\pi)$ equals the mutual information $I_\mu(X;Y)$, and a derived decomposition of the contraction ratio into entropy inflation and a mutual-information penalty; (ii) a Cheeger-type lower bound on $M$, tying the bottleneck geometry of $P$ directly to the row-retention profile; (iii) an explicit construction proving that $L(P_n)\to 0$ does not force $\eta_{\mathrm{KL}}(P_n)/M_n\to 1$, identifying cardinality of high-retention states (not their $\pi$-mass) as the decisive quantity.

Alongside these, we record structural consequences, optimal Markov/reverse-Markov tail bounds for $r$, a Bhatia-Davis variance bound, two-sided spectral bounds with an explicit cubic correction, a KL/Pinsker mixing-time bound, and tensorization for product chains.

We further show that $L(P)$ is structurally decoupled from the spectral gap, the Cheeger constant, and the mixing time: every vertex-transitive chain satisfies $L(P)=1$ regardless of its mixing speed, and the empirical rank correlations between $L(P)$ and these classical invariants on a diverse but limited test suite are essentially zero.

The numerical experiments are exploratory and not used as evidence for a universal classification theorem.

전문 보기

Retention Profiles and KL Contraction Bounds in Finite Markov Chains

이 뉴스, 어떠셨어요?

Abstract

관련 뉴스

'research' 카테고리 뉴스

Detecting and Controlling Sycophancy with Cascading Linear Features

Life After Benchmark Saturation: A Case Study of CORE-Bench

Refusal Lives Downstream of Persona in Chat Models

arXiv의 다른 기사

Knowledge-augmented Agentic AI for Mental Health Medication Information Seeking

Accelerating Skill Assessment in Chess: A Drift-Diffusion-Enhanced Elo Rating System

Governing Actions, Not Agents: Institutional Attestation as a Governance Model for Autonomous AI Systems