-
Notifications
You must be signed in to change notification settings - Fork 2.8k
A Larger Deep Multi-Step Deductive Reasoning Dataset over Natural Language with Multi-Step Deductive Reasoning Instruction For OpenAI EVAL #651
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
andrew-openai
merged 24 commits into
openai:main
from
14H034160212:qiming/multi-step-deductive-reasoning
May 27, 2023
Merged
Changes from all commits
Commits
Show all changes
24 commits
Select commit
Hold shift + click to select a range
ffa349c
update complex logical reasoning evals
14H034160212 856de37
add pararule-plus
14H034160212 379ec90
update pararule plus dataset
14H034160212 c8cffd6
remove the data
7a2676f
use lfs to upload jsonl file
48833eb
remove jsonl
78d2e93
reupload data using lfs
6fc12b2
remove redundant line
14H034160212 1ac53df
remove the largejsonl file
9b82821
add another new the largejsonl file
1823c24
use fuzzymatch replace match
4d76c0b
remove redundant code
fe57ecc
Update evals/registry/evals/pararule-plus-multi-step-deductive-reason…
14H034160212 82160db
Update evals/registry/evals/pararule-plus-multi-step-deductive-reason…
14H034160212 43e5238
add description
1383dcd
erge remote-tracking branch 'upstream/main'
14H034160212 e2d0702
resolve conflict
14H034160212 45b8eda
update description for logiqa plus
14H034160212 d42b906
remove the changed gita file
14H034160212 ae2d1a7
remove the change gita
14H034160212 50efba9
Merge remote-tracking branch 'upstream/main'
14H034160212 ed24b52
add origin gita back
14H034160212 e4f3051
resolve conflict
14H034160212 04c7f78
remove redundant code
14H034160212 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
3 changes: 3 additions & 0 deletions
3
...le-plus-multi-step-deductive-reasoning/pararule-plus-multi-step-deductive-reasoning.jsonl
Git LFS file not shown
8 changes: 8 additions & 0 deletions
8
evals/registry/evals/pararule-plus-multi-step-deductive-reasoning.yaml
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,8 @@ | ||
| pararule-plus-multi-step-deductive-reasoning: | ||
| id: pararule-plus-multi-step-deductive-reasoning.dev.v0 | ||
| description: multi-step deductive reasoning instruction for the PARARULE-Plus dataset | ||
| metrics: [accuracy] | ||
| pararule-plus-multi-step-deductive-reasoning.dev.v0: | ||
| class: evals.elsuite.basic.fuzzy_match:FuzzyMatch | ||
| args: | ||
| samples_jsonl: pararule-plus-multi-step-deductive-reasoning/pararule-plus-multi-step-deductive-reasoning.jsonl | ||
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.