Interact3D: Compositional 3D Generation of Interactive Objects

arXiv CS.AI

이 뉴스, 어떠셨어요?

한 번의 탭으로 반응을 남겨요 · 로그인 불필요

CC BY

이 매체는 공공·자유 라이선스로 본문을 직접 표시합니다.

Abstract

Recent breakthroughs in 3D generation have enabled the synthesis of high-fidelity individual assets.

However, generating 3D compositional objects from single images--particularly under occlusions--remains challenging.

Existing methods often degrade geometric details in hidden regions and fail to preserve the underlying object-object spatial relationships (OOR).

We present a novel framework Interact3D designed to generate physically plausible interacting 3D compositional objects.

Our approach first leverages advanced generative priors to curate high-quality individual assets with a unified 3D guidance scene.

To physically compose these assets, we then introduce a robust two-stage composition pipeline.

Based on the 3D guidance scene, the primary object is anchored through precise global-to-local geometric alignment (registration), while subsequent geometries are integrated using a differentiable Signed Distance Field (SDF)-based optimization that explicitly penalizes geometry intersections.

To reduce challenging collisions, we further deploy a closed-loop, agentic refinement strategy.

A Vision-Language Model (VLM) autonomously analyzes multi-view renderings of the composed scene, formulates targeted corrective prompts, and guides an image editing module to iteratively self-correct the generation pipeline.

Extensive experiments demonstrate that Interact3D successfully produces promising collsion-aware compositions with improved geometric fidelity and consistent spatial relationships.

전문 보기

Interact3D: Compositional 3D Generation of Interactive Objects

이 뉴스, 어떠셨어요?

Abstract

관련 뉴스

'research' 카테고리 뉴스

Constructive Alignment: Governing Preference Dynamics in Human-AI Interaction

Bounded Morality: Defining the Space of Moral Computation

The MMM Data Model -- A Normative Specification for Knowledge Interoperability in a Decentralisable Knowledge Commons

arXiv의 다른 기사

RareDxR1: Autonomous Medical Reasoning for Rare Disease Diagnosis Beyond Human Annotation

A Contextual-Bandit Oversight Game with Two-Sided Informational Asymmetry

Constructing Epistemic AI Literacy: Detecting Epistemic Aims and Processes in Student-AI Co-Programming