Skip to content

Commit c4acd69

Browse files
teilomilletclaude
andcommitted
Bump version to 0.3.0
New in 0.3.0: - Delight Policy Gradient gating (soft sigmoid + hard top-K) - SEPA-annealed PG→DG transition with lambda override - Scale-only surprisal normalization for instruct models - Per-generation top-K surprisal token logging - Hard delight transform (binary sign-aware token mask) - Cross-model campaign support (Nemotron, Kimi, Qwen 235B) - trust_remote_code + tokenize=True for broader model compat - All ty type checker errors resolved (CI green) Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>
1 parent 9307e9d commit c4acd69

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "hatchling.build"
44

55
[project]
66
name = "retrain"
7-
version = "0.2.1"
7+
version = "0.3.0"
88
description = "RLVR training framework for LLMs"
99
readme = "README.md"
1010
requires-python = ">=3.11"

0 commit comments

Comments
 (0)