toolings
SAFETY & ROBUSTNESS Uncategorised

SafeLife: Avoiding Side Effects in Complex Environments

Partnership on AI

BENCHMARK

SafeLife is a reinforcement learning environment that's designed to test an agent's ability to learn and act safely. In this benchmark, they focus on the problem of avoiding negative side effects. The SafeLife environment has complex dynamics, procedurally generated levels, and tunable difficulty. Each agent is given a primary task to complete, but there's a lot that can go wrong! Can you train an agent to reach its goal without making a mess of things?