Could you please explain how the Verifier was trained? The paper seems to only mention that GPT-4 was used as the Verifier.
· Sign up or log in to comment