Latent-Mark: An Audio Watermark Robust to Neural Codec Compression
이 뉴스, 어떠셨어요?
한 번의 탭으로 반응을 남겨요 · 로그인 불필요
Abstract
While existing audio watermarking techniques have achieved strong robustness against traditional digital signal processing (DSP) attacks, they remain vulnerable to neural compression.
This occurs because modern neural audio codecs act as noise filters and discard the imperceptible waveform variations used in prior watermarking methods.
To address this limitation, we propose Latent-Mark, the first zero-bit audio watermarking framework designed to survive neural codec compression.
Our key insight is that robustness to the encode-decode process requires embedding the watermark within the codec's invariant latent space.
We achieve this by optimizing the audio waveform to induce a detectable directional shift in its encoded latent representation, while constraining perturbations to align with the natural audio manifold to ensure imperceptibility.
To prevent overfitting to a single codec's quantization rules, we introduce Cross-Codec Optimization, jointly optimizing the waveform across multiple surrogate codecs to target shared latent invariants.
Extensive evaluations demonstrate robust zero-shot transferability to unseen neural codecs, achieving competitive resilience against traditional DSP attacks while preserving perceptual imperceptibility.
We hope our work will inspire future research into universal watermarking frameworks capable of maintaining integrity across increasingly complex and diverse generative distortions.