Paper page - Compliance versus Sensibility: On the Reasoning Controllability in Large Language Models
… Notably, task accuracy is not strictly determined by sensibility, with models often maintaining high performance even when using conflicting patterns, suggesting a reliance on internalized parametric memory that increases with model size. …