[bugfix] fix includes eval #178
Closed
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Bug fix to the basic.includes eval:
If a ref in sample["ideal"] is a single character,
evals.elsuite.utils.get_answercan return an empty string if the ref is found in the last character of the prompt.any(...)then treats the empty string as false and reports a no-match.Fix: check explicitly for
None, as that is whatget_answerreturns in case of failure.Alternatively, one can return a bool from
get_answer, currently its only used in the includes (as far as i can see) and that would work there.