Replies: 1 comment
-
|
Hey there, I’ve taken a look at your sample in the That said, I’d caution that minimal_result = {
"score": evaluation_result.score,
"passing": evaluation_result.passing,
"timestamp": evaluation_result.timestamp
}Have you run into any quirks with |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
We just shipped a small Assay-side sample that consumes a frozen artifact derived from LlamaIndex's
EvaluationResultsurface:https://github.com/Rul1an/assay/tree/main/examples/llamaindex-evalresult-evidence
The goal is pretty narrow: test the smallest LlamaIndex result surface an external consumer could ingest without pulling in traces, callbacks, prompts, or completions.
We kept it to:
passing,score,feedback,evaluator_name,timestampWe are not treating that artifact as LlamaIndex truth, and we are not assuming the checked-in fixture shape is a stable wire contract.
Is
EvaluationResultroughly the right place to start for a minimal external-consumer surface, or is there a thinner official result surface you would rather point us at?Beta Was this translation helpful? Give feedback.
All reactions