Search

Showing top 96 results for "AI in systems"

Filtered by topic: LLMs Clear ✕

All sources xda-developers.com 40 huggingface.co 16 developer.nvidia.com 11 theregister.com 5 spectrum.ieee.org 3 intel.com 3 techradar.com 2 techcrunch.com 2 404media.co 2 restofworld.org 1 blogs.windows.com 1 theverge.com 1

After self-hosting LLMs for a year, I realized that models are not the real bottleneck

…They still spit out broken JSON or missed crucial system variables because they were drowning in noise. Buying more VRAM to fix a structural problem is an expensive mistake. A giant model…

May 26, 2026 · Yash Patel

Paper page - Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

…AI-generated summary Reinforcement learning (RL) has been applied to improve large language model (LLM) reasoning, yet the systematic study of how training scales with task difficulty has been hampered by the…

May 8, 2026

Paper page - Counting as a minimal probe of language model reliability

…AI-generated summary Large language models perform strongly on benchmarks in mathematical reasoning , coding and document analysis , suggesting a broad ability to follow instructions. However, it remains unclear whether such success reflects…

May 5, 2026

Pruning and Distilling LLMs Using NVIDIA TensorRT Model Optimizer | NVIDIA Technical Blog

…He brings a deep background in both AI software engineering and customer management, translating innovations into practical customer outcomes. Before NVIDIA, he held roles developing, breaking, and fixing AI solutions in the…

Oct 7, 2025 · Max Xu

Discussions and forums

r/LocalLLaMA · u/OttoRenner · 2d ago

Stop traumatizing AI into loops and turn hallucinations into an honest "I don't know!" by being NICE to them (Proof of Concept, Research, I don't want to sell anything)

!UPDATE!(20.05.2026) WE HAVE NEW NUMBERS FROM 1.500+ TESTS IT'S WORKING! check my update post https://www.reddit.com/r/LocalLLaMA/s/AyNOehjkYT Or the go straight to the my Github https://github.com/OttoRenner/Gentle-Codi…

r/LocalLLaMA · u/TumbleweedNew6515 · 4d ago

Update on 12x32gb sxm v100 cluster / local AI for legal drafting

Update from the lawyer with the V100 server. A few of you asked what I actually ended up running once the dust settled, so here it is. Still just a lawyer, still driving the whole thing through Claude Code, still not ful…

Paper page - A^2TGPO: Agentic Turn-Group Policy Optimization with Adaptive Turn-level Clipping

…from sparse rewards and challenges in credit assignment, which are addressed through A²TGPO that adapts information gain normalization, accumulation, and clipping for improved policy optimization. AI-generated summary Reinforcement learning for agentic…

May 8, 2026

Talk like a graph: Encoding graphs for large language models

…Various node and edge encodings were combined systematically. This led to functions like the ones in the following figure: Examples of graph encoding functions used to encode graphs via text. Analysis and…

Mar 12, 2024

To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.

‹ Prev 1 2 3 4 5 6 7 8 9 10

Followed topics