The assistant axis: situating and stabilizing the character of large language models
…Another is that it already exists in pre-trained models, reflecting some structure in the training data itself. To find out, we looked at the base versions of some of these models…