Search

Showing top 142 results for "Gemma 4 local use"

Related topics: Google Gemma

Tracked topic

Google Gemma

Gemma is a family of open-weight language models released by Google for text generation and related NLP tasks.

119 articles indexed Last updated 15h ago See topic hub
tomshardware.com › tech-industry › artificial-intelligence

Maker packs an opinionated, googly-eyed AI chatbot into a mobile suitcase, powered by an Nvidia Jetson — entirely local machine entity runs Gemma 4 E4B and can respond in 200ms

Gemma 4 E4B, ~200ms cached TTFT, 30+ sensors, no WiFi/BT/cellular. He has opinions. from r/LocalLLaMA In the r/LocalLLaMA subreddit, CreativelyBankrupt outlines the ‘recipe’ for this characterful digital companion. “Sparky runs entirely on the Jetson. …

May 17, 2026 · Mark Tyson

Top stories

Discussions and forums

r/docker · u/CreativeCollege2815 · 2w ago

Using a Gemma4 Safetensor Already Downloaded Locally

Hi everyone. I need some help or advice. I’m learning how to use N8N, so I downloaded Docker and installed N8N locally. I also wanted to install Gemma4, which I use in ComfyUI to help with image generation prompts. Is it…

r/LocalLLaMA · u/gladkos · May 1, 2026

Qwen 3.6 27B vs Gemma 4 31B - making Packman game!

Gemma just crushed Qwen in a local LLM gamedev contest! Device: MacBook Pro M5 Max, 64GB RAM Qwen 3.6 27B: 32 tokens/sec · 18m 04s · 33,946 tokens. Gemma 4 31B: 27 tokens/sec · 3m 51s · 6,209 tokens. So what is more impo…

Hacker News · u/lostathome · 1w ago

Show HN: Hitoku Draft – Context aware local assistant

Hi guys.I have been working on Hitoku Draft, an open-source, voice-first AI assistant that runs entirely locally. I posted about it already, and now it has also transcription with voice editing. Looking for feedback, as …

15 1
Hacker News · u/limondas · 15h ago

Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?

Claude Code like agentic workflow ai too costly for me.Any LLM can I run with VSCode at the below setup? 16ram Intel core i7 h processor 13gen 512gb NVMe SSD I want to run the ai as local agentic workflow with Vscode.I w…

5 2
r/LocalLLaMA · u/gladkos · May 8, 2026

Multi-Token Prediction (MTP) for LLaMA.cpp - Gemma 4 speedup by 40%

Implemented Multi-Token Prediction for LLaMA.cpp. Quantized Gemma 4 assistant models into GGUF format. Ran tests on a MacBook Pro M5Max. Gemma 26B with MTP drafts tokens 40% faster. Prompt: Write a Python program to find…