Paper page - Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning
…View arXiv page View PDF Project page Add to collection Community FINAL Bench introduces a new evaluation paradigm for LLMs: functional metacognitive reasoning — not just "can the model solve it," but "does…