This repository implements various Reinforcement Learning (RL) algorithms to solve environments with continuous state spaces, such as CartPole and Acrobot. It explores both policy-based and value-based methods to tackle these problems effectively.
-
Policy-Based:
- Proximal Policy Optimization (PPO)
- REINFORCE with baseline
- Actor-Critic
-
Value-Based:
- Semi-Gradient N-Step SARSA
- CartPole
- Acrobot
ActorCritic.py: Implementation of the Actor-Critic algorithm.cartpole_ppo.py: PPO implementation for CartPole.acrobot_ppo.py: PPO implementation for Acrobot.semigradnstepSarsa.py: Semi-Gradient N-Step SARSA implementation.cm_MCTS.py: Monte Carlo Tree Search module over CatVSMonsters environment [Experiment]- Results and Logs:
- Contains visualizations of reward trends, mean and standard deviations, and hyperparameter analysis.
- Clone the repository:
git clone https://github.com/muktac5/RL_Algorithms_Continuous_State_Spaces.git cd RL_Algorithms_Continuous_State_Spaces - Run an algorithm:
python cartpole_ppo.py



