Releases · allenai/OLMo-in-loop-evals

12 Aug 22:59

github-actions

v0.8.6

9ca8b34

v0.8.6 Latest

Latest

What's new

Pinned datasets, tokenizers, pyarrow versions.

Commits

92e17d1 Pinning dependencies to known good versions. (#15)
73e9db1 Add release process instructions

Assets 4

20 Jul 16:37

github-actions

v0.8.5

7802f83

v0.8.5

What's new

Removed 👋

Remove sklearn and numpy as depedencies. Manual implementation of F1 score.

Commits

ed1b21b remove sklearn/numpy deps (#14)

Assets 4

05 Jun 21:09

davidheineman

v0.8.4

d6909c1

v0.8.4

What's new

Add BOS token, when the BOS token exists in the tokenizer

Commits

Assets 4

27 May 15:41

davidheineman

v0.8.3

fa83ee4

v0.8.3

What's new

Fix speed problem for BPB/RC tasks

Commits

Assets 4

19 May 21:12

davidheineman

v0.8.2

dc81627

v0.8.2

What's new

Add few-shot HumanEval
Fix prompting setup for MT MBPP. For more details, see allenai/oe-eval-internal#489

Commits

d3d2448 add few show he
696f7ae update fixed mbpp (#10)
8ead415 detach bpb tensor (prevents warning)

Assets 4

18 May 03:13

davidheineman

v0.8.1

c8c554c

v0.8.1

What's new

Add mt MBPP and Minerva Math 500

Commits

Assets 4

18 May 00:57

davidheineman

v0.8.0

bdd2d12

v0.8.0

What's new

Add fast MCQA

Commits

b635bb9 Add fast MCQA (#8)

Assets 4

16 May 20:37

davidheineman

v0.7.2

d386498

v0.7.2

What's new

Add basic skills evals

Commits

Assets 4

03 Apr 17:09

davidheineman

v0.7.1

8dd0b46

v0.7.1

What's new

Fix normalization to match the OLMES standard

Commits

7363946 fix bias in eval per-char and per-byte normalization (#6)

Assets 4

10 Mar 23:50

github-actions

v0.7.0

6ad85b8

v0.7.0

What's new

Add in-loop GSM, Minerva, MBPP, HumanEval

Commits

2f01ec8 Add in-loop gen tasks (#5)

Assets 4

Releases: allenai/OLMo-in-loop-evals

v0.8.6

What's new

Commits

Uh oh!

v0.8.5

What's new

Removed 👋

Commits

Uh oh!

v0.8.4

What's new

Commits

Uh oh!

v0.8.3

What's new

Commits

Uh oh!

v0.8.2

What's new

Commits

Uh oh!

v0.8.1

What's new

Commits

Uh oh!

v0.8.0

What's new

Commits

Uh oh!

v0.7.2

What's new

Commits

Uh oh!

v0.7.1

What's new

Commits

Uh oh!

v0.7.0

What's new

Commits

Uh oh!