Evaluating Claude’s bioinformatics research capabilities with BioMysteryBench
…Interestingly, Claude Sonnet 4.6 and more capable models were able to solve significant fractions of human-difficult problems, with Claude Mythos Preview topping out at a 30% solve rate. So what…
