🐛 Bug
The entropy coefficient (ent_coef) remains constant throughout training when the model is loaded from a saved checkpoint model.
This is not expected since ent_coef was set to "auto".
To Reproduce
Steps to reproduce the behavior:
- Train a model with the TQC algo with the ent_coef set to "auto".
- Observe that the ent_coef value varies (in the Tensorboard interface).
- Save the trained model to a file using the CheckpointCallback.
- Stop the training.
- Start a new training using a saved checkpoint making sure ent_coef is still set to "auto".
- Observe that the ent_coef value remain constant throughout training (in the Tensorboard interface).
Checklist