Added Reranker example by srv1n · Pull Request #598 · utilityai/llama-cpp-rs

srv1n · 2024-12-08T07:28:45Z

The parent llama.cpp repo recently added support for reranking.

llama : add reranking support ggml-org/llama.cpp#9510

This PR

Adds a rerank example
adds a new pooling type called rank( matches with llama.cpp implementation) in params
initialized_logits made public in llamabatch

Validated against all examples in the llama.cpp and BGE-reranker-v2-m3 and all rerank scores match.

Please review, make edits and feel free to commit. Thank you for the awesome repo!

MarcusDunn

This looks great! Thanks for the PR.

my only question is, why did you make initialized_logits public?

MarcusDunn · 2024-12-08T16:14:14Z

~~also, is the submodule up to date enough to support this?~~ yes it is.

srv1n · 2024-12-09T03:18:32Z

I was trying to match it as closely with the original llama.cpp repo. Specifically the batch decode treats pooling type none and the other pooling types differently. .

I initially started off trying to modify the embedding example to match the original repo ( since reranking in llama.cpp is built into the examples/embedding.cpp ).

Its actually not being used in the reranking (since pooling is set to rank), but does it make sense to leave it public so we could use it with pooling type none if required?

MarcusDunn · 2024-12-14T04:59:14Z

sorry for the late reply!

I would prefer we keep it private unless it is required to have a feature work.

srv1n · 2025-02-06T12:46:50Z

My bad finally got around to it. Updated now. initialized_logits is not public anymore.

MarcusDunn reviewed Dec 8, 2024

View reviewed changes

undid making initialized_logits public

d789cac

srv1n force-pushed the add-reranker-example branch from c72a971 to d789cac Compare February 6, 2025 12:45

MarcusDunn merged commit 73a346c into utilityai:main Feb 6, 2025
2 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added Reranker example#598

Added Reranker example#598
MarcusDunn merged 1 commit intoutilityai:mainfrom
srv1n:add-reranker-example

srv1n commented Dec 8, 2024

Uh oh!

MarcusDunn left a comment

Uh oh!

MarcusDunn commented Dec 8, 2024 •

edited

Loading

Uh oh!

srv1n commented Dec 9, 2024

Uh oh!

MarcusDunn commented Dec 14, 2024

Uh oh!

srv1n commented Feb 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

srv1n commented Dec 8, 2024

Uh oh!

MarcusDunn left a comment

Choose a reason for hiding this comment

Uh oh!

MarcusDunn commented Dec 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

srv1n commented Dec 9, 2024

Uh oh!

MarcusDunn commented Dec 14, 2024

Uh oh!

srv1n commented Feb 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MarcusDunn commented Dec 8, 2024 •

edited

Loading