[model] add support for mixtral moe model #128

936187425 · 2024-04-16T02:57:13Z

support for Mixtral-8x7B-v0.1

guocuimi · 2024-04-22T18:17:47Z

src/models/huggingface/mixtral.h

+    const int64_t head_dim = args.head_dim();
+    const int64_t n_kv_heads = args.n_kv_heads().value_or(n_heads);
+    const int64_t n_local_heads = n_heads / world_size;
+    const int64_t n_local_kv_heads = n_kv_heads / world_size;


just a heads up. i added support for MQA and GQA, please also include that support in your change. FYI dff774e

you can learn MQA and GQA from this blog: https://iamshobhitagarwal.medium.com/navigating-the-attention-landscape-mha-mqa-and-gqa-decoded-288217d0a7d1

936187425 added 3 commits April 15, 2024 22:53

[feat]add mixtral.h

4342b65

[feat] add some classes in mixtral.h

4e88395

[feat] construct modules from mixtral model except Mixtral moe impl

7172786

guocuimi reviewed Apr 22, 2024

View reviewed changes

936187425 and others added 6 commits April 26, 2024 05:13

[feat] add the replicated_linear

c13ebd3

Merge branch 'vectorch-ai:main' into Mixtral

f2ffe46

[format]

13da555

[refactor] add the wrapper of fused_moe_layer add the fused_moe_kernel

bdb529c

Merge branch 'vectorch-ai:main' into Mixtral

1a8d37d

[feat] add the load_state_dict in fused_moe.cpp

3eb45f0

936187425 changed the title ~~[model] added support for mixtral moe model~~ [model] add support for mixtral moe model May 16, 2024

936187425 and others added 5 commits May 22, 2024 11:32

Merge branch 'vectorch-ai:main' into Mixtral

51f3d56

[feat] add MixtralBlockExpert using torch version

d649ac9

[format]

046cf63

[bug]Remove third_party/pybind11/ from submodules

11a4a8a

[fix]remove the third_party/pybind11

9a5ac5b

guocuimi force-pushed the main branch from a158018 to e5f18d9 Compare May 27, 2025 02:45

guocuimi force-pushed the main branch from 19f5798 to 378a1f1 Compare June 18, 2025 21:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[model] add support for mixtral moe model #128

[model] add support for mixtral moe model #128

Uh oh!

936187425 commented Apr 16, 2024 •

edited

Loading

Uh oh!

guocuimi Apr 22, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[model] add support for mixtral moe model #128

Are you sure you want to change the base?

[model] add support for mixtral moe model #128

Uh oh!

Conversation

936187425 commented Apr 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

guocuimi Apr 22, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

936187425 commented Apr 16, 2024 •

edited

Loading