KSampler for Wan 2.2 MoE for ComfyUI

This repo is a fork from the original, due to that not being maintained since it's creation.

KSampler for Wan 2.2 MoE for ComfyUI

These nodes are made to support "Mixture of Expert" Flow models with the architecture of Wan2.2 A14B (With a high noise expert and low noise expert). Instead of guessing the denoising step at which to swap from the high noise model to the low noise model, this node automatically chanage to the low noise model when we reach the diffusion timestep at which the signal to noise ratio is supposed to be 1:1. There is additionally an option to adjust the CFG throught the denoising process, in both the high and low denoise stages.

Installation

To install this node, follow these steps:

Clone this repository into your ComfyUI custom nodes directory.
Restart ComfyUI to load the new node.

git clone https://github.com/GalaxyTimeMachine/ComfyUI-WanMoeKSampler.git /path-to-ComfyUI/custom_nodes/WanMoeKSampler

Usage

See workflows included in this repository for basic usage.

About the `boundary` parameter:

This correspond to the diffusion timestep around which the model used is supposed to start using the low noise expert. For Wan 2.2 T2V, this value should be 0.875, For Wan 2.2 I2V, the value should be 0.900. Using other values might still work.

Note that diffusion timesteps is NOT the same thing as denoising steps at all. You could think of the diffusion timesetp roughly as how much noise is added in the image (during training). At timestep 0, the image is clean, with no noise added. At a timestep of 1, the image/video is pure noise. And for Wan2.2 a14B T2V model, around timestep 0.875(0.9 for I2V), the video should be half noise, half useful data. The timestep is realated to the corresponding denoising step with a non-linear relationship that depends on the total number of steps, the sampling method used, and the noise scheduler (and sigma shift).

A good way to control where the high/low split occurs is by increasing or decreasing the sigma_shift. The higher you set that to, the later the switch will occur. (I've found that 12.00 is a good split for most things, but play with it)

Why Higher `sigma_shift` Delays the Switch from High to Low noise

The switching logic is based on converting the $\mathbf{\sigma}$ (noise magnitude) at each step into a normalized $\mathbf{t}$ (timestep) value, and checking if $t$ is less than the boundary (e.g., 0.875).

The core formula used by ComfyUI's discrete flow sampling to calculate the normalized timestep $t$ from $\sigma$ is conceptually:

$$\text{Normalized } t \approx \frac{\sigma}{\sigma + \text{shift}}$$

Where $\text{shift}$ is your sigma_shift value.

1. The Impact of Increasing `sigma_shift`

Increase the Denominator: When you increase the $\text{shift}$ (e.g., from 0.0 to 8.0), you are increasing the denominator ($\sigma + \text{shift}$).
Decrease the $t$ value: Increasing the denominator makes the entire fraction smaller. Therefore, for any given $\sigma$, the resulting normalized $\mathbf{t}$ value is smaller.

2. The Logic of the Switch

The model runs the high-noise sampler as long as the normalized timestep $t$ remains above the boundary:

$$\text{High Noise Runs} \iff t \ge \text{boundary}$$

Since increasing the sigma_shift reduces $t$ for every step, it takes more steps (i.e., $\sigma$ must drop to a much lower level) before $t$ finally falls below the 0.875 boundary.

In short: The sigma_shift effectively stretches the noise curve, making the switch point appear "later" in the total number of steps, thus keeping the high-noise expert active for a longer duration.

License

This project mostly contains code copy-pasted from ComfyUI, which is licenced under GPL3.0. Therefore it is also licenced under GPL 3.0. (see LICENCE file for more details)

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
workflows		workflows
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
nodes.py		nodes.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

KSampler for Wan 2.2 MoE for ComfyUI

Installation

Usage

About the `boundary` parameter:

Why Higher `sigma_shift` Delays the Switch from High to Low noise

1. The Impact of Increasing `sigma_shift`

2. The Logic of the Switch

License

About

Uh oh!

Releases

Packages

Languages

License

GalaxyTimeMachine/ComfyUI-WanMoeKSampler

Folders and files

Latest commit

History

Repository files navigation

KSampler for Wan 2.2 MoE for ComfyUI

Installation

Usage

About the boundary parameter:

Why Higher sigma_shift Delays the Switch from High to Low noise

1. The Impact of Increasing sigma_shift

2. The Logic of the Switch

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

About the `boundary` parameter:

Why Higher `sigma_shift` Delays the Switch from High to Low noise

1. The Impact of Increasing `sigma_shift`

Packages