Skip to content

Conversation

grimoire
Copy link
Collaborator

Qwen3-next require kernels from:

https://github.com/Dao-AILab/causal-conv1d
https://github.com/fla-org/flash-linear-attention

We need env check for different model-device combinations.

@lvhan028 lvhan028 requested a review from windreamer October 15, 2025 07:47
@lvhan028 lvhan028 added the enhancement New feature or request label Oct 15, 2025
Copy link
Collaborator

@windreamer windreamer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we need to consider to integrate ssm cache pool in PD migration request?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants