A “diff” tool for AI: Finding behavioral differences in new models
…This controls pro-government censorship and propaganda in these Chinese-developed models, and is absent in the American models we compared them against. An “American Exceptionalism” feature found in Meta’s Llama…