GitHub - YavarYeganeh/Deep_AIF: Deep Active Inference (Deep AIF) Agents

Deep Active Inference Agents for Delayed and Long-Horizon Environments

Yavar Taheri Yeganeh · Mohsen Jafari · Andrea Matta

With the recent success of world-model agents—which extend the core idea of model-based reinforcement learning by learning a differentiable model for sample-efficient control across diverse tasks—active inference (AIF) offers a complementary, neuroscience-grounded paradigm that unifies perception, learning, and action within a single probabilistic framework powered by a generative model. Despite this promise, practical AIF agents still rely on accurate immediate predictions and exhaustive planning, a limitation that is exacerbated in delayed environments requiring plans over long horizons—tens to hundreds of steps. Moreover, most existing agents are evaluated on robotic or vision benchmarks which, while natural for biological agents, fall short of real-world industrial complexity. Deep AIF addresses these limitations with a generative–policy architecture featuring:

(i) Multi-step latent transition that lets the generative model predict an entire horizon in a single look-ahead.
(ii) Integrated policy network that enables the transition and receives gradients of the expected free energy.
(iii) Alternating optimization scheme that updates model and policy from a replay buffer.
(iv) Single gradient step that plans over long horizons, eliminating exhaustive planning from the control loop.

Along with the Deep AIF implementation and supporting codes, the current repository also includes a simulated industrial environment that mimics a realistic industrial scenario with delayed and long-horizon settings. The empirical results confirm the effectiveness of the proposed approach, demonstrating that the coupled world-model with the AIF formalism yields an end-to-end probabilistic controller capable of effective decision making in delayed, long-horizon settings without handcrafted rewards or expensive planning. More benchmarks and results will be included.

-- More codes will be released soon!

📄 Read the Paper (arXiv:2505.19867)
📈 Slides

Requirements

You need a Python environment with the following libraries (and other supporting ones):

torch>=2.1.1
numpy>=1.24.3
simpy>=4.0.1

Usage

Clone the source code by running:

git clone https://github.com/YavarYeganeh/Deep_AIF.git
cd ./Deep_AIF

Then, train the agent by running:

python train.py --seed <random_seed> --batch <batch_size> --horizon <depth> --samples <number_of_samples> --num_threads <num_threads> --replay_policy_update --npf --reward_multiplier <reward_multiplier>

Usage: train.py [-h]

--seed : Random seed (default: 0).
-b, --batch : Select batch size, i.e., number of environments used for training (default: 1).
--horizon : The depth for the signle lookahead transition (default: 300).
--samples : Number of samples to be used for Expected Free Energy (EFE) calculation (default: 10).
--num_threads : Number of threads to use (CPU only, default: 4).
--replay_policy_update : If set, also uses replay scenarios during policy gradient when training. It dreaming in a batch of scenarios, encouraging generic policy learning (suggested).
--npf : If set, employs a different form of preference function with sigmoid scaling of the energy-saving element (suggested).
--reward_multiplier : Multiplier applied to the production reward, effective with --npf (default: 20).

Example:

python train.py --horizon 300 --replay_policy_update --npf --reward_multiplier 20

During training, the code records comprehensive data on the agent’s performance and training statistics across epochs in results/signature/, where signature represents a combination of the timestamp and settings. Finally, it produces stats_final.pkl in the same folder, containing all collected information.

Citation

Yeganeh, Y. T., Jafari, M., & Matta, A. (2025). Deep Active Inference Agents for Delayed and Long-Horizon Environments. arXiv preprint arXiv:2505.19867.

@article{yeganeh2025deep,
  title={Deep Active Inference Agents for Delayed and Long-Horizon Environments},
  author={Yeganeh, Yavar Taheri and Jafari, Mohsen and Matta, Andrea},
  journal={arXiv preprint arXiv:2505.19867},
  year={2025}
}

Contact

For inquiries or collaboration, please reach out to yavar.taheri@polimi.it or yavaryeganeh@gmail.com.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
img		img
slides		slides
src		src
LICENSE.txt		LICENSE.txt
README.md		README.md
test.py		test.py
test_all-on.py		test_all-on.py
test_random.py		test_random.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Active Inference Agents for Delayed and Long-Horizon Environments

Requirements

Usage

Usage: train.py [-h]

Citation

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Deep Active Inference Agents for Delayed and Long-Horizon Environments

Requirements

Usage

Usage: train.py [-h]

Citation

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages