Abstract
Classifier-Free Guidance is reinterpreted as a control system for flow-based diffusion models, with a novel sliding mode control approach improving semantic alignment and stability across various guidance scales.
Classifier-Free Guidance (CFG) has emerged as a central approach for enhancing semantic alignment in flow-based diffusion models. In this paper, we explore a unified framework called CFG-Ctrl, which reinterprets CFG as a control applied to the first-order continuous-time generative flow, using the conditional-unconditional discrepancy as an error signal to adjust the velocity field. From this perspective, we summarize vanilla CFG as a proportional controller (P-control) with fixed gain, and typical follow-up variants develop extended control-law designs derived from it. However, existing methods mainly rely on linear control, inherently leading to instability, overshooting, and degraded semantic fidelity especially on large guidance scales. To address this, we introduce Sliding Mode Control CFG (SMC-CFG), which enforces the generative flow toward a rapidly convergent sliding manifold. Specifically, we define an exponential sliding mode surface over the semantic prediction error and introduce a switching control term to establish nonlinear feedback-guided correction. Moreover, we provide a Lyapunov stability analysis to theoretically support finite-time convergence. Experiments across text-to-image generation models including Stable Diffusion 3.5, Flux, and Qwen-Image demonstrate that SMC-CFG outperforms standard CFG in semantic alignment and enhances robustness across a wide range of guidance scales. Project Page: https://hanyang-21.github.io/CFG-Ctrl
Community
Project page: https://hanyang-21.github.io/CFG-Ctrl
GitHub repo: https://github.com/hanyang-21/CFG-Ctrl
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Momentum Guidance: Plug-and-Play Guidance for Flow Models (2026)
- Improving Classifier-Free Guidance of Flow Matching via Manifold Projection (2026)
- Bridging Diffusion Guidance and Anderson Acceleration via Hopfield Dynamics (2026)
- Training-Free Representation Guidance for Diffusion Models with a Representation Alignment Projector (2026)
- Free Lunch for Stabilizing Rectified Flow Inversion (2026)
- HyperAlign: Hypernetwork for Efficient Test-Time Alignment of Diffusion Models (2026)
- Rethinking Preference Alignment for Diffusion Models with Classifier-Free Guidance (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper