test-backend-ops: enables perf/eval testing of composite ops #14833

etasnadi · 2025-07-23T13:03:01Z

This patch adds support for testing computation graphs "composite ops" in test-backend-ops.

This is useful

when measuring the performance gains of fused ops compared to indirect implementations,
in op development to check the correctness if the implementation of the op under development does not exist yet on CPU but it's equivalent computation graph can be built by combining existing ops forming a composite op.

Currently out of the tree code is used to test the correctness (#14316 or #14316) or non-standardized out-of-tree vibe coded standalone gists added to test the performance in #14388 (comment).

In particular, this PR enables

comparing the output of computation graphs when executed on backend1 vs backend2,
comparing the output of a computation graph to the output of a regular op or even comparing two composite ops if it makes sense,
performance testing of computation graphs.

An example is when we compare the output of CONV_2D (direct conv implementation) with the ggml_conv_2d (indirect conv implementation as the latter contains im2col followed by a mul_mat in the resulting graph).

To test output of an op against a graph, the user needs to add a test case for the graph and the actual op, then they need to subclass a test_case_compare : public test_case that accepts the two test cases in the constructor. The tensor name assignment should be defined in test_case_compare to let the eval() function know how to copy the inputs between the two graphs before execution. The output nodes will then be compared after execution.

When testing the perf of a graph, the get_input_names of test_case should be overwritten to return the name of the input tensors that will be used by eval_perf to know which nodes should be duplicated. The default implementation returns an empty list and eval_perf assumes that the graph tests a regular op containing only the input nodes connected to a single output node doing the actual calculation, so only the output will be duplicated in this case.

Copilot

Pull Request Overview

This PR adds support for testing computation graphs "composite ops" in test-backend-ops, enabling performance and correctness evaluation of fused operations compared to indirect implementations. This is useful for op development when the direct implementation doesn't exist on CPU but can be tested via equivalent computation graphs.

Introduces test_case_compare class for comparing outputs between different operation implementations
Adds support for composite operation performance testing with proper node duplication logic
Implements example comparison between direct CONV_2D and im2col-based CONV_2D implementations

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 5 comments.

File	Description
tests/test-backend-ops.cpp	Main implementation adding composite op testing infrastructure and CONV_2D comparison examples
ggml/src/ggml-backend.cpp	Adds new function for comparing outputs between two different computation graphs
ggml/include/ggml-backend.h	Declares the new graph comparison function in the public API

Comments suppressed due to low confidence (1)

tests/test-backend-ops.cpp

ggml/src/ggml-backend.cpp

tests/test-backend-ops.cpp

Enables perf/eval testing of composite ops in test-backend-ops.

8cee16f

github-actions bot added testing Everything test related ggml changes relating to the ggml tensor library for machine learning labels Jul 23, 2025

etasnadi changed the title ~~Enables perf/eval testing of composite ops in test-backend-ops.~~ test-backend-ops: enables perf/eval testing of composite ops Jul 23, 2025

etasnadi added 4 commits July 23, 2025 22:59

#include <map> added, example tests removed

1a2dfad

Unused variable removed.

5f4fc48

Overrides added.

e08e8fc

Overrides added, eval() renamed to prevent shading.

41e9ec0

CISC requested a review from Copilot July 24, 2025 10:07

Copilot AI reviewed Jul 24, 2025

View reviewed changes

tests/test-backend-ops.cpp Outdated Show resolved Hide resolved

tests/test-backend-ops.cpp Outdated Show resolved Hide resolved

ggml/src/ggml-backend.cpp Outdated Show resolved Hide resolved

ggml-org deleted a comment from Copilot AI Jul 24, 2025

Replaces asserts to GGML_ASSERT and (void) to GGML_UNUSED.

2dc1a6a

CISC reviewed Jul 24, 2025

View reviewed changes

tests/test-backend-ops.cpp Outdated Show resolved Hide resolved

tests/test-backend-ops.cpp Outdated Show resolved Hide resolved

assert include removed, 2 further unused macro replaced

21108b8

CISC requested a review from slaren July 24, 2025 15:38

etasnadi mentioned this pull request Jul 25, 2025

GGML direct conv2d support leejet/stable-diffusion.cpp#739

Open

daniandtheweb mentioned this pull request Jul 25, 2025

ggml: adds CONV_2D op and direct GEMM Vulkan implementation #14316

Merged

jeffbolznv mentioned this pull request Jul 29, 2025

vulkan: optimizations for direct convolution #14933

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

test-backend-ops: enables perf/eval testing of composite ops #14833

test-backend-ops: enables perf/eval testing of composite ops #14833

Uh oh!

etasnadi commented Jul 23, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

test-backend-ops: enables perf/eval testing of composite ops #14833

Are you sure you want to change the base?

test-backend-ops: enables perf/eval testing of composite ops #14833

Uh oh!

Conversation

etasnadi commented Jul 23, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!