Paper page - When is Your LLM Steerable?
…searches and post-hoc evaluation of full autoregressive rollouts . In this work, we investigate whether steerability can be predicted from the model's internal states at the beginning of the generation process…