Skip to content

tinker_cookbook.rl.ProblemGroupBuilder

class tinker_cookbook.rl.ProblemGroupBuilder(EnvGroupBuilder)

Builds a group of ProblemEnv instances from a factory callable.

Fields:

  • env_thunk (Callable[[], ProblemEnv])
  • num_envs (int)
  • dataset_name (str) – Default: 'problems'

make_envs()

Create num_envs ProblemEnv instances using the factory callable.

compute_group_rewards(trajectory_group, env_group)

Return zero group rewards (all rewards come from per-step scoring).

Parameters:

  • trajectory_group (list[Trajectory])
  • env_group (Sequence[Env])