Skip to content

[Bug] The entropy coefficient remains constant throughout training when the model is loaded from a saved checkpoint model #391

@Atlis

Description

@Atlis

🐛 Bug

The entropy coefficient (ent_coef) remains constant throughout training when the model is loaded from a saved checkpoint model.
This is not expected since ent_coef was set to "auto".

To Reproduce

Steps to reproduce the behavior:

  1. Train a model with the TQC algo with the ent_coef set to "auto".
  2. Observe that the ent_coef value varies (in the Tensorboard interface).
  3. Save the trained model to a file using the CheckpointCallback.
  4. Stop the training.
  5. Start a new training using a saved checkpoint making sure ent_coef is still set to "auto".
  6. Observe that the ent_coef value remain constant throughout training (in the Tensorboard interface).

Checklist

  • I have checked that there is no similar issue in the repo (required)
  • I have read the documentation (required)
  • I have provided a minimal working example to reproduce the bug (required) (described in the steps to reproduce above)

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions