[bugfix] fix includes eval #178

niklasnolte · 2023-03-15T18:23:07Z

Bug fix to the basic.includes eval:

If a ref in sample["ideal"] is a single character, evals.elsuite.utils.get_answer can return an empty string if the ref is found in the last character of the prompt. any(...) then treats the empty string as false and reports a no-match.

Fix: check explicitly for None, as that is what get_answer returns in case of failure.

Alternatively, one can return a bool from get_answer, currently its only used in the includes (as far as i can see) and that would work there.

elh · 2023-05-23T21:45:21Z

FYI I believe this has been fixed #972

jwang47

Thanks for the fix!

Edit: @elh is right, it's actually been fixed in #972

fix includes eval

f142db2

niklasnolte mentioned this pull request Mar 15, 2023

binary count #182

Closed

12 tasks

jwang47 approved these changes Jun 2, 2023

View reviewed changes

jwang47 closed this Jun 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[bugfix] fix includes eval #178

[bugfix] fix includes eval #178

Uh oh!

niklasnolte commented Mar 15, 2023

Uh oh!

elh commented May 23, 2023

Uh oh!

jwang47 left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[bugfix] fix includes eval #178

[bugfix] fix includes eval #178

Uh oh!

Conversation

niklasnolte commented Mar 15, 2023

Uh oh!

elh commented May 23, 2023

Uh oh!

jwang47 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jwang47 left a comment •

edited

Loading