Restore clip's cb() to its rightful glory - extract common debugging elements in llama #17914

pwilkin · 2025-12-10T18:13:31Z

I used my callback function from my Qwen3Next testing days, it seems like it works more cleanly than the previous one which was causing some problems with the scheduler / buffers.

ngxson

If you want to go a step ahead, I would suggest using ggml_backend_sched_set_eval_callback to make it works the same way as libllama. This will be a cleaner solution

ngxson · 2025-12-10T18:19:49Z

tools/mtmd/clip.cpp

+        std::string t_name = std::string(name) + "_" + std::to_string(il);
+        ggml_tensor * args[] = { t };
+        ggml_tensor * res = ggml_custom_4d(ctx0, t->type, t->ne[0], t->ne[1], t->ne[2], t->ne[3], args, 1, print_debug, 1, nullptr);
+        strcpy(res->name, t_name.c_str());


use ggml_set_name instead

Or even better, ggml_format_name

ngxson · 2025-12-10T18:21:59Z

tools/mtmd/clip.cpp

+        ggml_tensor * args[] = { t };
+        ggml_tensor * res = ggml_custom_4d(ctx0, t->type, t->ne[0], t->ne[1], t->ne[2], t->ne[3], args, 1, print_debug, 1, nullptr);
+        strcpy(res->name, t_name.c_str());
+        ggml_build_forward_expand(gf, res);


I think we should guard the whole thing under ctx->debug_graph. seems like it's got removed by mistake?

Oh, yeah :>

ngxson · 2025-12-10T18:27:16Z

tools/mtmd/clip.cpp

 #include "ggml-cpp.h"
 #include "ggml-alloc.h"
 #include "ggml-backend.h"
+#include "ggml/src/ggml-impl.h"


This should be removed - we cannot include internal header from ggml

ngxson · 2025-12-10T18:27:57Z

tools/mtmd/clip.cpp

 #include <cstring>
 #include <fstream>
 #include <map>
+#include <memory>


some of these are already included by clip-impl.h - do we really need to include them again here?

…D_DEBUG_GRAPH with same functionality

pwilkin · 2025-12-10T20:29:21Z

All right, based on the convo with @ngxson I've decided to tackle this properly:

I moved the common debugging functions to llama-debug.cpp, added their headers to llama.h or llama-cpp.h depending on whether they use C or C++ APIs.
I plugged eval-callback to those new common functions
I modified mtmd's cb() to do the same thing as the llm_graph_builder's one, which is basically to just set the tensor name. The entire tensor dump is set via ggml_backend_sched_set_eval_callback
The added bonus is I created a template version of the ggml_debug function, so you can now set in the template whether NaNs should abort execution or not (default: no)

pwilkin · 2025-12-10T20:32:37Z

I would very much like to extend the callback procedure to (a) make it also possible in other clients (such as llama-cli) (b) make it configurable via args (c) add a couple of standard debug callbacks, for example in addition to the printout also dumping selected tensors to a file, computing some diagnostic functions on the tensors and so on (but of course not within this PR).

ngxson · 2025-12-10T21:05:32Z

common/debug.cpp

I think this should be inside common/debug.h instead. There is no internal libllama components using these functions

Oh, fair point.

pwilkin requested a review from ngxson as a code owner December 10, 2025 18:13

pwilkin force-pushed the clip-cb branch from bfb9adf to 34b607c Compare December 10, 2025 18:14

ngxson reviewed Dec 10, 2025

View reviewed changes

loci-dev mentioned this pull request Dec 10, 2025

UPSTREAM PR #17914: Restore clip's cb() to its rightful glory auroralabs-loci/llama.cpp#516

Open

Extract common debugging functions; plug eval-callback and mtmd's MTM…

2c3cbcf

…D_DEBUG_GRAPH with same functionality

pwilkin force-pushed the clip-cb branch from 34b607c to 2c3cbcf Compare December 10, 2025 20:25

pwilkin requested a review from ggerganov as a code owner December 10, 2025 20:25

pwilkin changed the title ~~Restore clip's cb() to its rightful glory~~ Restore clip's cb() to its rightful glory - extract common debugging elements in llama Dec 10, 2025

ngxson reviewed Dec 10, 2025

View reviewed changes

github-actions bot added the examples label Dec 10, 2025

pwilkin added 2 commits December 11, 2025 20:25

Move to common

44ab26a

Remove unneeded header

c6e3705

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Restore clip's cb() to its rightful glory - extract common debugging elements in llama #17914

Restore clip's cb() to its rightful glory - extract common debugging elements in llama #17914

pwilkin commented Dec 10, 2025

Uh oh!

ngxson left a comment •

edited

Loading

Uh oh!

ngxson Dec 10, 2025

Uh oh!

ngxson Dec 10, 2025

Uh oh!

ngxson Dec 10, 2025

Uh oh!

pwilkin Dec 10, 2025

Uh oh!

ngxson Dec 10, 2025

Uh oh!

ngxson Dec 10, 2025

Uh oh!

pwilkin commented Dec 10, 2025

Uh oh!

pwilkin commented Dec 10, 2025 •

edited

Loading

Uh oh!

ngxson Dec 10, 2025

Uh oh!

pwilkin Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Restore clip's cb() to its rightful glory - extract common debugging elements in llama #17914

Are you sure you want to change the base?

Restore clip's cb() to its rightful glory - extract common debugging elements in llama #17914

Conversation

pwilkin commented Dec 10, 2025

Uh oh!

ngxson left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pwilkin commented Dec 10, 2025

Uh oh!

pwilkin commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ngxson left a comment •

edited

Loading

pwilkin commented Dec 10, 2025 •

edited

Loading