Search: coding improvements

Paper page - Turning Drift into Constraint: Robust Reasoning Alignment in Non-Stationary Environments

…Xiaoyu Yang , , , Abstract A novel framework called Autonomous Preference Optimization (APO) is proposed to address reasoning alignment challenges in multi-modal large language models under concept drift conditions, achieving improved robustness and…

May 7, 2026

Paper page - Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation

…https://huggingface.co/zhuhz22/Causal-Forcing And the full-stack open-source code: https://github.com/thu-ml/Causal-Forcing We release 2-step frame-wise AR model with 50% latency and…

May 15, 2026

Paper page - A^2TGPO: Agentic Turn-Group Policy Optimization with Adaptive Turn-level Clipping

…Self-Evaluated Process Rewards for Retrieval-Augmented Agents (2026) Improving Search Agent with One Line of Code (2026) Enhancing LLM-based Search Agents via Contribution Weighted Group Relative Policy Optimization (2026) PRAISE…

May 8, 2026

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

…Hope that helps 🤗 great job! i would like to repeat this with my so100 arm, do you have the remote inference code? I don't understand why robotic models are not…

Sep 17, 2025

Paper page - Mela: Test-Time Memory Consolidation based on Transformation Hypothesis

…Lungchuan Chen Abstract A memory-augmented transformer architecture called Mela incorporates hierarchical memory modules inspired by human memory consolidation processes, enabling improved long-context language modeling through multi-granularity memory representations. AI…

May 12, 2026

Paper page - Large Language Models Explore by Latent Distilling

…Across math, science, code, and creative writing benchmarks, ESamp improves diversity and Pass@k efficiency while preserving strong throughput through an asynchronous implementation in tLLM. We hope ESamp can be a useful…

Apr 29, 2026

Paper page - MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

…Our code is available at https://github.com/IST-DASLab/MatryoshkaLoRA. View arXiv page View PDF GitHub 1 Add to collection Community We propose MatryoshkaLoRA , a general, Matryoshka-inspired training framework for…

May 11, 2026

Paper page - LASE: Language-Adversarial Speaker Encoding for Indic Cross-Script Identity Preservation

…An ECAPA+GRL ablation shows the GRL objective improves either backbone but the WavLM choice contributes too. In synthetic multi-speaker diarisation , LASE matches ECAPA-TDNN on cross-script speaker recall (0…

May 4, 2026

Open R1: Update #2

…By combining rule-based verification (Math Verify) with LLM-based evaluation, we improve dataset quality while maintaining scale. The final dataset consists of 220k problems with verified reasoning traces, making it a…

Feb 6, 2025

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

…finally, something good about living in modern world, you guys are awesome! "...it is crucial to improve and simplify the way in which casual users deploy and access local models. We will…

Feb 20, 2026 · Georgi Gerganov

Followed topics