Design Decisions#
This page outlines the key design decisions made during the development of the gxm API.
#
needs termination and truncation
different auto reset models
only one way is really compatible with jax
termination and truncation require a second observation -> the api would get really bloated -> in the gymnasium style api we need an additional obs and state variable making it 7 in total
state, obs, reward, done, info = env.step(key, state, action)
state, obs, true_obs, reward, terminated, truncated, info = env.step(key, state, action)
we decided to instead bundle everything into a single object
env_state, timestep = env.step(key, env_state, action)