RL-flappy-bird

Reinforcement Learning on a playable version of Flappy Bird.

Demo

Training the AI agent from scratch — watch it progressively learn to navigate the pipes:

Installation

pip install -e .

On Linux, you also need tkinter as a backend for interactive matplotlib support:

sudo apt-get install python3-tk

To also install the tools for recording demos:

pip install -e ".[tools]"

Usage

Human player

rl-flappy-bird

A window will open. The score and commands are displayed on the right side of the window.

AI player

To let the AI agent learn from scratch:

rl-flappy-bird --agent ai

To load a pretrained agent:

rl-flappy-bird --agent ai --load_save

Press S during the simulation to save the agent's current state.

Record a training demo

python tools/record_demo.py

RL Algorithm

The state is composed of the Bird's horizontal and vertical distances to the next pipe opening.

The agent explores its environment with an increasingly greedy Epsilon-Greedy scheme. After each simulation, it:

Updates its approximation of the underlying Markov Decision Process from observed transitions.
Solves for the optimal value function via Value Iteration.

The best action in a given state is the one that maximizes the expected value.

Customization

Sprites (bird, pipes, background) can be swapped by:

placing new JPG files in the sprites/ directory;
updating the sprite paths in rl_flappy_bird/args.py.

Other simulation parameters can also be tuned in args.py:

environment dimensions;
bird dynamics (gravity, jump velocity);
RL hyperparameters (discount factor, state discretization).

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
assets		assets
rl_flappy_bird		rl_flappy_bird
sprites		sprites
tools		tools
.gitignore		.gitignore
README.md		README.md
ai_save.json		ai_save.json
commands.txt		commands.txt
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL-flappy-bird

Demo

Installation

Usage

Human player

AI player

Record a training demo

RL Algorithm

Customization

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RL-flappy-bird

Demo

Installation

Usage

Human player

AI player

Record a training demo

RL Algorithm

Customization

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages