Skip to content

Temp value cutoff scaling #1618

@SquarishRectangle

Description

@SquarishRectangle

Temp value cutoff (TVC) is a great param but it is based on winrate which scales very weirdly. Even only at 5, when she is winning easily or losing badly, she will blunder a queen or both rooks without batting an eyelash.

I propose a new parameter called temp value cutoff scaling: Type float, min 0.00, max 1.00, default 1.00.

If this param is set to anything but 1 it will decrease TVC when lc0 is losing. The lower the value the lower TVC scales.

For example, if TVC is set to 10 and TVC scaling is set to 0.5, Lc0 will have TVC of 5 when losing, TVC of 7.5 when the position is equal, and TVC of 10 when winning.

This is a simple change but I believe it will help very much with both Training data generation and Human play.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions