Submitted by Joseph Imperial 1 Scaling Policy Compliance Assessment in Language Models with Policy Reasoning Traces University Of Bath 0 2