After self-hosting LLMs for a year, I realized that models are not the real bottleneck
… I started adding strict rules and constraints Earlier, my prompts were way too open-ended, which let the model wander off-task or become overly verbose. Now, I explicitly include small, precise rules depending on what I need: Keep paragraphs short. …