Skip to content

fix: gate evolution success on artifact diffs#16

Open
austinmw wants to merge 1 commit intoNousResearch:mainfrom
austinmw:fix/evolution-noop-gating-pr
Open

fix: gate evolution success on artifact diffs#16
austinmw wants to merge 1 commit intoNousResearch:mainfrom
austinmw:fix/evolution-noop-gating-pr

Conversation

@austinmw
Copy link
Copy Markdown

@austinmw austinmw commented Apr 9, 2026

Summary

  • only treat evolution runs as success when the evolved artifact actually differs from baseline
  • surface artifact-change status in the results table and metrics output
  • add a focused unit test for the success gate

Why

A run can report a small positive holdout delta even when baseline_skill.md and evolved_skill.md are byte-for-byte identical. Those should be treated as no-op runs, not successful improvements.

Testing

  • pytest tests/skills/test_evolve_skill.py tests/core/test_constraints.py tests/skills/test_skill_module.py

@austinmw austinmw force-pushed the fix/evolution-noop-gating-pr branch from 60e3683 to ab5dd13 Compare April 9, 2026 01:40
@austinmw austinmw changed the title Gate evolution success on artifact diffs fix: gate evolution success on artifact diffs Apr 9, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant