Search

Showing top 105 results for "real-time coding"

Smol2Operator: Post-Training GUI Agents for Computer Use

…I converted SmolVLM2-500M-Video-Instruct to gguf before using the tool provided in llamacpp codebase. So I suppose it can be converted smoothly -- but haven't done it this time. ok…

Apr 8, 2025 · Amir Mahla

Paper page - OVO-S-Bench: A Hierarchical Benchmark for Streaming Spatial Intelligence in Multimodal LLMs

…Real-Time Evaluation of Visual Streaming Assistant Models (2026) Please give a thumbs up to this comment if you found it helpful! If you want recommendations for any Paper on Hugging Face…

Jun 4, 2026

Paper page - Memory-Bound but Not Bandwidth-Limited: The Physical AI Inference Gap in Batch-1 LLM Decode

…Generated by Qwen/Qwen2.5-Coder-32B-Instruct Physical AI systems, including robots, autonomous vehicles, embodied agents and edge copilots, often run a different inference workload from cloud LLM serving: single-stream…

Jun 1, 2026

Paper page - Cosine Misleads: Auxiliary Losses Reshape Vision Language Models, Not Their Latents

…Generated by Qwen/Qwen2.5-Coder-32B-Instruct Latent visual reasoning (LVR) inserts supervised latent tokens between perception and answer generation in vision-language models (VLMs). The field uses alignment between these…

Jun 9, 2026

Paper page - Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

…Benchmarking Process-side Anomalies in Real-world Agent Execution Trajectories (2026) FALAT: Tracing Failures in LLM Agent Trajectories via Dependency-Guided Search (2026) Time to REFLECT: Can We Trust LLM Judges for…

Jun 4, 2026

To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.

‹ Prev 1 2 3 4 5 6 7 8 9 10 11

Followed topics

Search

Smol2Operator: Post-Training GUI Agents for Computer Use

Top stories

Paper page - Test-Time Gradient Guidance of Flow Policies in Reinforcement Learning

Paper page - SwiftVR: Real-Time One-Step Generative Video Restoration

Paper page - OVO-S-Bench: A Hierarchical Benchmark for Streaming Spatial Intelligence in Multimodal LLMs

Paper page - Memory-Bound but Not Bandwidth-Limited: The Physical AI Inference Gap in Batch-1 LLM Decode

Paper page - Cosine Misleads: Auxiliary Losses Reshape Vision Language Models, Not Their Latents

Paper page - Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories