Code Authors: Advait Desai and Gargi Gupta
Program: SRA VJTI's Eklavya 2025 Program
| Go2 Quadruped Walk | Lunar Lander |
|---|---|
![]() |
![]() |
| Blackjack | Frozen Lake |
|---|---|
![]() |
![]() |
We aim to train a PPO-based gait policy in MuJoCo and deploy it on a low-cost, tortoise-style quadruped robot.
- Learn Reinforcement Learning basics (Monte Carlo, Q-Learning)
- Learn Deep Learning (MNIST digit classifier)
- Combine Deep Learning with RL (DQN, DDQN, TRPO, PPO)
- Implement on environments (Brax, MuJoCo Menagerie, Bipedal, Go2)
- Monte Carlo Methods - Basic RL policy evaluation
- Q-Learning - Value-based reinforcement learning
- Deep Learning - MNIST digit classification for neural network fundamentals
- DQN/DDQN - Deep Q-Networks for discrete action spaces
- TRPO/PPO - Trust Region and Proximal Policy Optimization for continuous control
- Brax - High-performance physics simulation
- MuJoCo Menagerie - Diverse robotics environments
- Bipedal Walker - Humanoid locomotion
- Go2 Quadruped - Advanced quadruped robot simulation
quad_move_eklavya/
├── 1_brax_training_viewer/
├── 2_mujoco_menagerie/
│ └── flybody/
├── 3_monte_carlo/
├── 4_Q_learning/
├── 5_number_classifier/
├── 6_DQN/
├── 7_DDQN/
├── 8_PPO_bipedal/
└── 9_PPO_go2_stable_baselines/
git clone https://github.com/Advait2211/quad_move_eklavya.git
cd quad_move_eklavyapython3.10 -m venv venvmacOS/Linux:
source venv/bin/activateWindows (PowerShell):
.\venv\Scripts\activatepip install -r requirements.txt- Python 3.10+
- MuJoCo physics engine
- PyTorch/TensorFlow for deep learning
- Stable Baselines3 for RL algorithms
- Additional dependencies listed in
requirements.txt



