ECD: Energy-based Compositional Diffusion Planning

ICML 2026

Tao Sun ¹, Utkarsh A. Mishra², Jiaxin Lu³, Danfei Xu², Iro Armeni¹

¹Stanford University, ²Georgia Institute of Technology, ³University of Texas at Austin

Left: The short trajectory fragments observed by the model during training. Middle: Heuristic composition (CompDiffuser) suffers from mode drift, causing paths to cross through walls. Right: ECD uses a global energy function to maintain consistency and correct mode drift in CompDiffuser for long-horizon planning.

✨ Features

Energy Formulation for Compositional Diffusion: Instead of stitching local chunk predictions heuristically, ECD defines a single global energy function over all chunks, using its negative gradient to guide the denoising process, ensuring conservative score fields and consistent global modes.
Inference-Only: Extends pretrained short-horizon diffusion models to long-horizon planning without retraining or fine-tuning.
Strong Performance: Outperforms prior stitching methods (CompDiffuser) on long-horizon OGBench maze tasks, particularly on the largest environments.

🧠 How It Works

Long-horizon planning $x = [x_0, \dots, x_L]$ is typically approached by dividing a trajectory into overlapping chunks and deploying a short-horizon diffusion model on each.

Standard stitching methods (such as CompDiffuser) enforce consistency by averaging or overwriting overlapping segments at each denoising step. However, this creates a non-conservative score field with non-zero curl, meaning it does not correspond to the gradient of a valid global density function. This mismatch often leads to mode drift and physical inconsistencies over long horizons.

ECD addresses this by defining a single global, scalar-valued energy function over all chunks, using its negative gradient to guide the denoising process:

$$\text{score}(x,t) = \nabla_x \sum_k E_k(x,t) = \sum_k \left[ \underbrace{-\frac{1}{\sigma_t^2} S_k^T P_k^T r_k(t)}_{\text{Interior Update}} + \underbrace{\frac{1}{\sigma_t^2} S_k^T O_k^T J_{O,k}(t)^T r_k(t)}_{\text{Boundary Reaction}} \right]$$

where $r_k(t)$ is the residual error of the local coordinates:

$$r_k(t) = W_k(P_k x_k(t) - \mu^k(O_k x_k(t), t))$$

Because the update is derived from the gradient of an energy function, it yields three key properties:

Interior Update: Pulls local coordinates toward each chunk's predicted local mean, preserving kinematic validity and obstacle avoidance.
Boundary Reaction: Uses a Jacobian-vector product ($$J_{O,k}(t)^T r_k(t)$$) to propagate boundary inconsistencies back into the chunk interiors as feedback.
Chunk Consensus: Sums local gradients across overlapping nodes to satisfy neighboring constraints simultaneously, resolving the trajectory into a single global mode.

Below is a method comparision figure illustrating the difference between CompDiffuser and ECD's reverse diffusion process. For complete details, please see our paper.

🚀 Quick Start

Installation

conda create -n ecd python=3.9 -y
conda activate ecd

If you want to use GPU acceleration, please install a CUDA build of PyTorch before the other requirements:

# Install PyTorch with CUDA (please adjust the cuda version to match your system if needed):
pip install "torch>=2.8.0" --index-url https://download.pytorch.org/whl/cu128

# Then install the remaining dependencies
pip install -r requirements.txt

Datasets

The OGBench training and evaluation datasets can be downloaded using the provided script (default location: ~/.ogbench/data).

python tools/download_ogbench_data.py

Pretrained Checkpoints

Trained checkpoints for the environments evaluated in the paper are available on Hugging Face. Place them in the repository root directory following this structure: ./logs/<env>/{planner,invdyn,ecd_prior}/....

Demo Notebook

We provide a complete step-by-step walkthrough in demo.ipynb. It loads the antmaze-giant checkpoint, configures the baseline and ECD policies, and generates the comparison animation shown above.

💻 Training and Evaluation on OGBench

Training

Training involves two independent learned components, plus one quick data-only fit:

Component	Command	Output
Short-horizon diffusion planner	`python -m ecd.train`	`logs/<env>/planner/<planner_name>/`
Inverse-dynamics model (except `pointmaze*`)	`python -m ecd.invdyn`	`logs/<env>/invdyn/<invdyn_name>/`
Gaussian–Markov prior	`python -m ecd.fit_ecd_prior`	`logs/<env>/ecd_prior/gaussian_markov.pt`

Each environment's training and evaluation scripts are shipped under scripts/ with the paper's hyper-parameters. For example, to train on antmaze-large environment, run:

ENV=antmaze-large-stitch-v0

bash scripts/$ENV/train.sh         # trains planner and inverse dynamics (if applicable)
bash scripts/fit_prior.sh $ENV     # fits Gaussian–Markov prior for the approximation

Evaluation

Each method is evaluated with the per-environment script bash scripts/<env>/eval.sh <method> <seed>, where <method> can be cd, ecd, or cdgs. All three share the same checkpoints, candidate budget, trajectory blending, and replanning; they differ only in the inference rule. For example, on antmaze-large environment, run:

ENV=antmaze-large-stitch-v0
SEED=0
bash scripts/$ENV/eval.sh cd   $SEED   # CD: CompDiffuser baseline (interleave)
bash scripts/$ENV/eval.sh cdgs $SEED   # CDGS: Compositional Diffusion with Guided Search
bash scripts/$ENV/eval.sh ecd  $SEED   # ECD: Energy-based Compositional Diffusion (ours)

🔌 Using ECD as a Plug-in

Because ECD operates entirely at inference time, it can wrap any pretrained short-horizon diffusion denoiser. To use it, instantiate the CompositionalPolicy and set the inference type to ecd_chunk:

from ecd.policy import CompositionalPolicy

policy = CompositionalPolicy(
    diffusion_model=denoiser,                 # Your short-horizon chunk denoiser (see ecd/planner.py)
    normalizer=normalizer, 
    ev_n_comp=N,
    ev_cp_infer_t_type="ecd_chunk",           # Change to "interleave" for the standard CD baseline
    ecd_config=dict(
        rank_type="overlap",                  # Map-free candidate ranker
        base_scale=0.15, 
        react_scale=0.10,                     # Interior update / boundary-reaction strength
        markov_type="laplacian", 
        chunk_react_type="markov",
    ),
)

# Generate a long-horizon plan
plan = policy.plan(start_xy, goal_xy, b_s=40)

📊 Results

The table below shows the success rates (%, mean ± standard deviation across 3 random seeds) on various OGBench stitch tasks. ECD consistently improves performance over the CompDiffuser baseline, with the largest gains observed on long-horizon and complex maze tasks.

Environment	CD	CDGS (4×)	CDGS (8×)	ECD (Ours)	Scripts	Ckpt	Logs
pointmaze-medium	100 ± 0	100 ± 0	100 ± 0	100 ± 0	train / eval	link	logs
pointmaze-large	100 ± 0	97 ± 2	98 ± 2	100 ± 0	train / eval	link	logs
pointmaze-giant	77 ± 3	69 ± 4	68 ± 6	84 ± 2	train / eval	link	logs
antmaze-medium	95 ± 4	96 ± 1	97 ± 2	97 ± 1	train / eval	link	logs
antmaze-large	74 ± 4	71 ± 3	72 ± 6	82 ± 1	train / eval	link	logs
antmaze-giant	72 ± 9	78 ± 7	82 ± 6	82 ± 6	train / eval	link	logs
antmaze-large-o15d	83 ± 3	84 ± 6	87 ± 1	89 ± 1	train / eval	link	logs
humanoid-medium	90 ± 4	91 ± 1	89 ± 3	92 ± 2	train / eval	link	logs
humanoid-large	59 ± 2	50 ± 3	46 ± 4	64 ± 4	train / eval	link	logs
humanoid-giant	42 ± 1	27 ± 1	--	49 ± 1	train / eval	link	logs

Note

CD and CDGS are re-run by us under the same evaluation protocol as ECD (same checkpoints for planner and inverse-dynamics, same adaptive replanning and trajectory blending strategy).

CDGS (4×) / CDGS (8×) denote CDGS with 4 / 8 inference-time resampling rounds per denoising step. We did not run CDGS with higher resampling counts due to its significantly higher inference runtime (≈ 4× / 8× that of CD) and diminishing returns on success rate.

Todo

Release code and pretrained checkpoints for the pointmaze and locomotion environments evaluated in the paper.
Add a demo notebook with animation comparing ECD and CompDiffuser on a long-horizon maze task.
Release code and pretrained checkpoints for additional OGBench environments.

📝 Citation

@inproceedings{sun2026ecd,
  title     = {Energy-based Compositional Diffusion Planning},
  author    = {Sun, Tao and Mishra, Utkarsh A. and Lu, Jiaxin and Xu, Danfei and Armeni, Iro},
  booktitle = {Proceedings of the 43rd International Conference on Machine Learning (ICML)},
  series    = {PMLR},
  volume    = {306},
  year      = {2026}
}

🙏 Acknowledgments

We thank the authors of the following open-source codebases for their components used in this project:

OGBench (Park et al., ICLR 2025).
CompDiffuser (Luo et al., NeurIPS 2025).
CDGS (Mishra et al., ICLR 2026)
GSC (Mishra et al., CoRL 2023)

📄 License

This project is released under the Apache License 2.0 (see LICENSE). This code builds upon OGBench and CompDiffuser, both of which are MIT-licensed; their original copyright and permission notices are retained in NOTICE.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assets		assets
ecd		ecd
eval_logs		eval_logs
scripts		scripts
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
demo.ipynb		demo.ipynb
ogb_env_spec.csv		ogb_env_spec.csv
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ECD: Energy-based Compositional Diffusion Planning

ICML 2026

✨ Features

🧠 How It Works

🚀 Quick Start

Installation

Datasets

Pretrained Checkpoints

Demo Notebook

💻 Training and Evaluation on OGBench

Training

Evaluation

🔌 Using ECD as a Plug-in

📊 Results

Todo

📝 Citation

🙏 Acknowledgments

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ECD: Energy-based Compositional Diffusion Planning

ICML 2026

✨ Features

🧠 How It Works

🚀 Quick Start

Installation

Datasets

Pretrained Checkpoints

Demo Notebook

💻 Training and Evaluation on OGBench

Training

Evaluation

🔌 Using ECD as a Plug-in

📊 Results

Todo

📝 Citation

🙏 Acknowledgments

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages