ggml-cpu : deduplicate scalar implementations #14897

xctan · 2025-07-27T07:01:40Z

This PR cleans up redundant fallback implementations in each architecture, which were introduced in the previous refactor PR #13892.

ggml/src/ggml-cpu/arch/x86/quants.c

Copilot

Pull Request Overview

This PR removes redundant scalar fallback implementations from architecture-specific code that were added in a previous refactor, replacing them with calls to generic implementations. The cleanup eliminates duplicated scalar computation code across different CPU architectures while maintaining the same functionality through centralized generic fallback functions.

Key changes:

Replaced architecture-specific scalar implementations with calls to generic functions
Added UNUSED macros to silence compiler warnings for variables no longer used in fallback paths
Maintained existing optimized vectorized code paths while cleaning up fallback implementations

Reviewed Changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
ggml/src/ggml-cpu/arch/x86/repack.cpp	Replaced scalar implementations with generic function calls for quantization and matrix operations
ggml/src/ggml-cpu/arch/x86/quants.c	Removed duplicated scalar fallback code for vector dot product operations
ggml/src/ggml-cpu/arch/wasm/quants.c	Cleaned up scalar implementations in WebAssembly-specific code
ggml/src/ggml-cpu/arch/s390/quants.c	Removed redundant scalar fallbacks in s390 architecture code
ggml/src/ggml-cpu/arch/riscv/repack.cpp	Simplified RISC-V repack operations by using generic implementations
ggml/src/ggml-cpu/arch/riscv/quants.c	Restructured conditional compilation and removed scalar duplications
ggml/src/ggml-cpu/arch/powerpc/quants.c	Eliminated redundant scalar code in PowerPC architecture implementation

xctan added 8 commits July 25, 2025 01:18

remove redundant code in riscv

ece2987

remove redundant code in arm

bddebe6

remove redundant code in loongarch

2f1c43b

remove redundant code in ppc

14a661c

remove redundant code in s390

4441970

remove redundant code in wasm

8b81b5a

remove redundant code in x86

6985619

remove fallback headers

686d658

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Jul 27, 2025

slaren reviewed Jul 27, 2025

View reviewed changes

ggml/src/ggml-cpu/arch/x86/quants.c Show resolved Hide resolved

fix x86 ggml_vec_dot_q8_0_q8_0

7b068bb

ggerganov approved these changes Jul 28, 2025

View reviewed changes

ggerganov requested a review from slaren July 28, 2025 06:02

slaren approved these changes Jul 28, 2025

View reviewed changes

slaren requested a review from Copilot July 28, 2025 15:39

Copilot AI reviewed Jul 28, 2025

View reviewed changes

slaren merged commit db16e28 into ggml-org:master Jul 28, 2025
47 checks passed

ggerganov mentioned this pull request Jul 30, 2025

Q2k interleaving implementation - x86/x64 SIMD #14373

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ggml-cpu : deduplicate scalar implementations #14897

ggml-cpu : deduplicate scalar implementations #14897

Uh oh!

xctan commented Jul 27, 2025

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

ggml-cpu : deduplicate scalar implementations #14897

ggml-cpu : deduplicate scalar implementations #14897

Uh oh!

Conversation

xctan commented Jul 27, 2025

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!