Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression

Code for NeurIPS 2024 accepted paper: Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression.

Environment

Paper results were collected with MuJoCo 210 (and mujoco-py 2.1.2.14) in OpenAI gym 0.23.1 with the D4RL datasets. Networks are trained using PyTorch 1.11.0 and Python 3.7.

Usage

Pretrained Models

We have uploaded pretrained dynamics models in SCAS_dynamics/ to facilitate experiment reproduction.

You can also pretrain dynamics models by running:

./run_pretrain.sh

Offline RL

The SCAS algorithm can be trained by running:

./run_experiments.sh

Logging

This codebase uses tensorboard. You can view saved runs with:

tensorboard --logdir <run_dir>

Citation

If you find this work useful, please consider citing:

@article{mao2024offline,
  title={Offline reinforcement learning with ood state correction and ood action suppression},
  author={Mao, Yixiu and Wang, Qi and Chen, Chen and Qu, Yun and Ji, Xiangyang},
  journal={Advances in Neural Information Processing Systems},
  volume={37},
  pages={93568--93601},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
SCAS_dynamics		SCAS_dynamics
.gitignore		.gitignore
README.md		README.md
SCAS.py		SCAS.py
main.py		main.py
model.py		model.py
pretrain.py		pretrain.py
run_experiments.sh		run_experiments.sh
run_pretrain.sh		run_pretrain.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression

Environment

Usage

Pretrained Models

Offline RL

Logging

Citation

About

Uh oh!

Releases

Packages

Languages

thu-rllab/SCAS

Folders and files

Latest commit

History

Repository files navigation

Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression

Environment

Usage

Pretrained Models

Offline RL

Logging

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages