Maximum Entropy Inverse Reinforcement Learning

This is a python implementation of the Maximum Entropy Inverse Reinforcement Learning (MaxEnt IRL) algorithm based on the similarly named paper by Ziebart et al. and the Maximum Causal Entropy Inverse Reinforcement Learning (MaxCausalEnt IRL) algorithm based on his PhD thesis. Project for the Advanced Seminar in Imitation Learning, summer term 2019, University of Stuttgart.

This implementation is available as python package at https://pypi.org/project/irl-maxent/ and can be installed via pip install irl-maxent. You may also want to have a look at the accompanying presentation.

For an example demonstrating how the Maximum (non-causal) Entropy IRL algorithm works, see the corresponding Jupyter notebook (notebooks/maxent.ipynb). Note that the provided python files (src/) contain a slightly more optimized implementation of the algorithms.

To run a demonstration without the notebook, you can directly run ./src/example.py. Also have a look at this file on how to use the provided framework. The framework contains:

Two GridWorld implementations for demonstration (irl_maxent.gridworld)
The algorithm implementations (irl_maxent.maxent)
A gradient based optimizer framework (irl_maxent.optimizer)
Plotting helper functions (irl_maxent.plot)
A MDP solver framework, i.e. value iteration and corresponding utilities (irl_maxent.solver)
A trajectory/trajectory generation framework (irl_maxent.trajectory)

This project solely relies on the following dependencies: numpy, matplotlib, itertools, and pytest.

Name	Name	Last commit message	Last commit date
Latest commit qzed Fix deprecated use of np.float Apr 21, 2024 549c3e3 · Apr 21, 2024 History 108 Commits
.devcontainer	.devcontainer	.devcontainer: Add setup script	Apr 21, 2024
.vscode	.vscode	Update vscode settings	Mar 18, 2022
notebooks	notebooks	Re-run notebook	Jul 5, 2022
src	src	Fix deprecated use of np.float	Apr 21, 2024
.gitignore	.gitignore	Add .gitignore	May 6, 2019
CITATION.cff	CITATION.cff	Add citations file	Sep 16, 2023
LICENSE	LICENSE	Add license	Jul 1, 2019
Presentation.pdf	Presentation.pdf	Add presentation	Jun 24, 2019
README.md	README.md	Fix module path in README	Mar 19, 2022
pyproject.toml	pyproject.toml	Add support for packaging	Mar 18, 2022
requirements.txt	requirements.txt	Add requirements.txt	Mar 18, 2022
setup.py	setup.py	Add support for packaging	Mar 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Maximum Entropy Inverse Reinforcement Learning

About

Releases

Packages

Contributors 2

Languages

License

qzed/irl-maxent

Folders and files

Latest commit

History

Repository files navigation

Maximum Entropy Inverse Reinforcement Learning

About

Topics

Resources

License

Citation

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages