RL Algorithms in Continuous State Spaces

This repository implements various Reinforcement Learning (RL) algorithms to solve environments with continuous state spaces, such as CartPole and Acrobot. It explores both policy-based and value-based methods to tackle these problems effectively.

Implemented Algorithms

Policy-Based:
- Proximal Policy Optimization (PPO)
- REINFORCE with baseline
- Actor-Critic
Value-Based:
- Semi-Gradient N-Step SARSA

Environments

CartPole
Acrobot

Repository Structure

ActorCritic.py: Implementation of the Actor-Critic algorithm.
cartpole_ppo.py: PPO implementation for CartPole.
acrobot_ppo.py: PPO implementation for Acrobot.
semigradnstepSarsa.py: Semi-Gradient N-Step SARSA implementation.
cm_MCTS.py: Monte Carlo Tree Search module over CatVSMonsters environment [Experiment]
Results and Logs:
- Contains visualizations of reward trends, mean and standard deviations, and hyperparameter analysis.

How to Run

Clone the repository:

git clone https://github.com/muktac5/RL_Algorithms_Continuous_State_Spaces.git
cd RL_Algorithms_Continuous_State_Spaces

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
acrobot_ppo_logs		acrobot_ppo_logs
acrobot_rf_results		acrobot_rf_results
cartpole_ppo_logs		cartpole_ppo_logs
cartpole_rf_results		cartpole_rf_results
ActorCritic.py		ActorCritic.py
README.md		README.md
Report.pdf		Report.pdf
acrobot_ppo.py		acrobot_ppo.py
acrobot_reinforce_baseline_final.py		acrobot_reinforce_baseline_final.py
cartpole_ppo.py		cartpole_ppo.py
cartpole_reinforce_baseline_final.py		cartpole_reinforce_baseline_final.py
cm_MCTS.py		cm_MCTS.py
semigradnstepSarsa.py		semigradnstepSarsa.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL Algorithms in Continuous State Spaces

Implemented Algorithms

Environments

Repository Structure

How to Run

Results

REINFORCE with Baseline results:

Cartpole :

Acrobot :

Proximal policy optimization Results :

Cartpole :

Acrobot :

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RL Algorithms in Continuous State Spaces

Implemented Algorithms

Environments

Repository Structure

How to Run

Results

REINFORCE with Baseline results:

Cartpole :

Acrobot :

Proximal policy optimization Results :

Cartpole :

Acrobot :

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages