Search: self-built add-ons

Paper page - Chiaroscuro Attention: Spending Compute in the Dark

…We propose CHIAR-Former (Chiaroscuro Attention), a 4-layer hybrid transformer that routes each token to one of three operators - DCT spectral mixing , RBF kernel mixing , or full self-attention - based on…

Jun 9, 2026

Paper page - PREPING: Building Agent Memory without Tasks

…Experiments on AppWorld, BFCL v3, and MCP-Universe show that Preping substantially improves over a no-memory baseline and achieves performance competitive with strong playbook-based methods built from offline or online…

May 15, 2026

Paper page - BioMatrix: Towards a Comprehensive Biological Foundation Model Spanning the Modality Matrix of Sequences, Structures, and Language

…Built upon the Qwen3 language model (1.7B and 4B), BioMatrix is continually pretrained on 304.4 billion tokens spanning general and domain-specific text, sequence and structure views of molecules and…

Jun 23, 2026

Followed topics

Search

Paper page - Chiaroscuro Attention: Spending Compute in the Dark

Paper page - PREPING: Building Agent Memory without Tasks

Paper page - BioMatrix: Towards a Comprehensive Biological Foundation Model Spanning the Modality Matrix of Sequences, Structures, and Language

Paper page - Context-Aware RL for Agentic and Multimodal LLMs

Paper page - EVA01: Unified Native 3D Understanding and Generation via Mixture-of-Transformers

Paper page - Intern-Atlas: A Methodological Evolution Graph as Research Infrastructure for AI Scientists

Paper page - RLDX-1 Technical Report

Paper page - Learning to Act and Cooperate for Distributed Black-Box Consensus Optimization

Paper page - UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

Paper page - PlatonicNav: Unveiling Semantic Correspondence in Navigation with Platonic Topological Maps