AI News HubLIVE
Original source2 min read

Local Mechanisms of Compositional Generalization in Conditional Diffusion

The paper investigates how conditional diffusion models achieve compositional generalization, specifically length generalization—generating images with more objects than seen during training. Experiments on CLEVR show success in some cases but not all, pointing to the importance of local conditional scores. The authors prove an equivalence between compositional structure and local conditional scores, and demonstrate that enforcing locality enables generalization in failing models. Analysis of SDXL reveals spatial locality but absent conditional locality in pixel space, yet evidence of local conditional scores in feature space.

Local Mechanisms of Compositional Generalization in Conditional Diffusion - Apple Machine Learning Research

Machine Learning Research

Open MenuClose Menu

Overview

Research Highlights

Publications

Events

Work with us

research area Methods and Algorithms

content type paperpublished April 2026

Local Mechanisms of Compositional Generalization in Conditional Diffusion

AuthorsArwen Bradley

View publication

Copy Bibtex

Conditional diffusion models appear capable of compositional generalization, i.e., generating convincing samples for out-of-distribution combinations of conditioners, but the mechanisms underlying this ability remain unclear. To make this concrete, we study length generalization, the ability to generate images with more objects than seen during training. In a controlled CLEVR setting (Johnson et al.,2017), we find that length generalization is achievable in some cases but not others, suggesting that models only sometimes learn the underlying compositional structure. We then investigate locality as a structural mechanism for compositional generalization. Prior works proposed score locality as a mechanism for creativity in unconditional diffusion models (Kamb & Ganguli, 2024; Niedoba et al., 2024), but did not address flexible conditioning or compositional generalization. In this paper, we prove an exact equivalence between a specific compositional structure (conditional projective composition) (Bradley et al., 2025) and scores with sparse dependencies on both pixels and conditioners (local conditional scores). This theory also extends to compositions of concepts (such as style+content) in feature-space. We validate our theory empirically: CLEVR models that succeed at length generalization exhibit local conditional scores, while those that fail do not. Furthermore, we show that a causal intervention explicitly enforcing local conditional scores enables length generalization in a previously failing model. Finally, we investigate SDXL and find that in pixel-space, spatial locality is present but conditional-locality is mostly absent; however, we find quantitative evidence of local conditional scores in the network’s learned feature-space.

Related readings and updates.

Local Mechanisms of Compositional Generalization in Conditional Diffusion

October 9, 2025research area Methods and Algorithms

Conditional diffusion models appear capable of compositional generalization, i.e., generating convincing samples for out-of-distribution combinations of conditioners, but the mechanisms underlying this ability remain unclear. To make this concrete, we study length generalization, the ability to generate images with more objects than seen during training. In a controlled CLEVR setting (Johnson et al., 2017), we find that length generalization is…

Read more

Mechanisms of Projective Composition of Diffusion Models

May 1, 2025research area Computer Vision, research area Methods and Algorithmsconference ICML

We study the theoretical foundations of composition in diffusion models, with a particular focus on out-of-distribution extrapolation and length-generalization. Prior work has shown that composing distributions via linear score combination can achieve promising results, including length-generalization in some cases (Du et al., 2023; Liu et al., 2022). However, our theoretical understanding of how and why such compositions work remains incomplete…

Read more

Discover opportunities in Machine Learning.

Our research in machine learning breaks new ground every day.

Work with us

Machine Learning Research

Publications

Local Mechanisms of Compositional Generalization in Conditional Diffusion

Privacy Policy

Terms of Use

Legal

Copyright © 2026 Apple Inc. All rights reserved.