Modular library for safe RL, providing baselines for a number of constraints using a variety of algorithms.
Projects
Software tools, libraries, and research prototypes from the group, with links to repositories, documentation, and papers.
Featured
By year
2026
2025
Adaptive shielding using GR(1) specifications and Inductive Logic Programming to repair specifications.
Probabilistic shielding for constrained Markov Decision Processes with unknown safety dynamics.
Write formal Reinforcement Learning reward specifications in Quantitative Linear Temporal Logic on finite traces.