Papers
arxiv:2603.19598

FlowScene: Style-Consistent Indoor Scene Generation with Multimodal Graph Rectified Flow

Published on Mar 20
· Submitted by
Zhifei Yang
on Mar 23
Authors:
,
,
,
,
,
,
,

Abstract

FlowScene is a tri-branch generative model that combines multimodal graph conditioning with rectified flow modeling to produce realistic scenes with controlled geometry, appearance, and stylistic coherence.

AI-generated summary

Scene generation has extensive industrial applications, demanding both high realism and precise control over geometry and appearance. Language-driven retrieval methods compose plausible scenes from a large object database, but overlook object-level control and often fail to enforce scene-level style coherence. Graph-based formulations offer higher controllability over objects and inform holistic consistency by explicitly modeling relations, yet existing methods struggle to produce high-fidelity textured results, thereby limiting their practical utility. We present FlowScene, a tri-branch scene generative model conditioned on multimodal graphs that collaboratively generates scene layouts, object shapes, and object textures. At its core lies a tight-coupled rectified flow model that exchanges object information during generation, enabling collaborative reasoning across the graph. This enables fine-grained control of objects' shapes, textures, and relations while enforcing scene-level style coherence across structure and appearance. Extensive experiments show that FlowScene outperforms both language-conditioned and graph-conditioned baselines in terms of generation realism, style consistency, and alignment with human preferences.

Community

Paper author Paper submitter

scene generation

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2603.19598 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2603.19598 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2603.19598 in a Space README.md to link it from this page.

Collections including this paper 2