Values in the wild: Discovering and analyzing values in real-world language model interactions
…Although our method could potentially be used as an evaluation of how closely a model hews to the developer’s preferred values, it can’t be used pre-deployment. That is, the…