-
Notifications
You must be signed in to change notification settings - Fork 614
Closed
Description
Temp value cutoff (TVC) is a great param but it is based on winrate which scales very weirdly. Even only at 5, when she is winning easily or losing badly, she will blunder a queen or both rooks without batting an eyelash.
I propose a new parameter called temp value cutoff scaling: Type float, min 0.00, max 1.00, default 1.00.
If this param is set to anything but 1 it will decrease TVC when lc0 is losing. The lower the value the lower TVC scales.
For example, if TVC is set to 10 and TVC scaling is set to 0.5, Lc0 will have TVC of 5 when losing, TVC of 7.5 when the position is equal, and TVC of 10 when winning.
This is a simple change but I believe it will help very much with both Training data generation and Human play.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels