UPSTREAM PR #18079: model-conversion : use CONVERTED_EMBEDDING_MODEL for embedding_verify_logits by loci-dev · Pull Request #588 · auroralabs-loci/llama.cpp

loci-dev · 2025-12-16T08:43:52Z

This commit updates the embedding model verification script to use the CONVERTED_EMBEDDING_MODEL environment variable instead of using the EMBEDDING_MODEL_PATH (the original embedding model path) as the basis for the converted model file name.

The motivation for this that currently if the converted embedding model file name differs from the original embedding model directory/name the verification script will look for the wrong .bin files that were generating when running the models.

…_logits This commit updates the embedding model verification script to use the CONVERTED_EMBEDDING_MODEL environment variable instead of using the EMBEDDING_MODEL_PATH (the original embedding model path) as the basis for the converted model file name. The motivation for this that currently if the converted embedding model file name differs from the original embedding model directory/name the verification script will look for the wrong .bin files that were generating when running the models.

loci-review · 2025-12-16T09:42:20Z

Explore the complete analysis inside the Version Insights

Performance Analysis Summary: PR #588

Analysis Scope: Model conversion embedding verification script update
Files Modified: 1 shell script (non-compiled)
Performance Impact: None

This PR modifies a bash script used for embedding verification in the model conversion workflow. The change corrects file path resolution to use the converted model name instead of the original model name. No compiled binaries were modified. All performance metrics (response time, throughput, power consumption) remain unchanged at 0% across all 16 analyzed binaries including libllama.so, libggml-cpu.so, and llama-run. The script modification adds two variable assignments for path resolution with negligible execution overhead (sub-microsecond). No functions in performance-critical areas (llama_decode, llama_encode, ggml_compute_forward, ggml_mul_mat) were affected. Tokens per second for inference workloads remains unaffected.

loci-dev temporarily deployed to PROD__AL_DEMO December 16, 2025 08:43 — with GitHub Actions Inactive

loci-dev force-pushed the main branch 27 times, most recently from c785ce2 to ab5b02c Compare December 18, 2025 14:10

loci-dev force-pushed the main branch 30 times, most recently from c07a58e to c71ff69 Compare December 24, 2025 12:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UPSTREAM PR #18079: model-conversion : use CONVERTED_EMBEDDING_MODEL for embedding_verify_logits#588

UPSTREAM PR #18079: model-conversion : use CONVERTED_EMBEDDING_MODEL for embedding_verify_logits#588
loci-dev wants to merge 1 commit intomainfrom
upstream-PR18079-branch_danbev-model-conversion-embedding-converted-model-path

loci-dev commented Dec 16, 2025

Uh oh!

loci-review bot commented Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

loci-dev commented Dec 16, 2025

Uh oh!

loci-review bot commented Dec 16, 2025

Performance Analysis Summary: PR #588

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants