XBOW tests Anthropic's Mythos Preview for offensive security
… But this time, we expanded our testing to analyze other angles as well: The model’s judgment with regard to threat modeling, vulnerability validation, and safety The model’s ability to read source code versus interact with live systems Its ability to find exploits we’re not yet looking for in our s… …