Search

Showing top 106 results for "new consumer hardware"

Top stories

Discussions and forums

Hacker News · u/HenryNdubuaku · 2w ago

Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model

Hey HN, Henry here from Cactus. We open-sourced Needle, a 26M parameter function-calling (tool use) model. It runs at 6000 tok/s prefill and 1200 tok/s decode on consumer devices.We were always frustrated by the little e…

736 207
Hacker News · u/dnosoz · 4w ago

Show HN: I built a 2nd-order PyTorch optimizer for LLMs that runs on 16GB GPUs

Hi HN,I'm Danilo. I've been struggling with the limitations of AdamW when fine-tuning LLMs locally. Second-order optimizers (like Shampoo or SOAP) offer significantly better step-convergence by exploiting Kronecker-facto…

2 4
r/truenas · u/RaMcHiP · 1w ago

Building My First Serious NAS / Media Server – Looking for Opinions Before I Pull the Trigger

I’m putting together a NAS/media server/homelab box and wanted to get some opinions before I finalize everything. The goal is "affordable", high-capacity storage, Plex/transcoding, 10Gb networking, and room for future ex…

r/DataHoarder · u/HeavySpell7989 · 3w ago

Windrose (Steam Early Access) writes 35 GB/hr to disk while idle, even after the "fix" — SMART data from a dedicated server

Sharing measurements in case it's useful to others running dedicated servers or just curious about the post-patch behavior of this game. Context: Windrose launched on Steam EA two weeks ago. Pixel Operative, TechSpot and…

Hacker News · u/zambelli · 1w ago

Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks

Hi HN, I'm Antoine Zambelli, AI Director at Texas Instruments.I built Forge, an open-source reliability layer for self-hosted LLM tool-calling.What it does:- Adds domain-and-tool-agnostic guardrails (retry nudges, step e…

660 240