r/netsec
· u/Fickle-Box1433
· 4d ago
I evaluated 5 LLM agents on patching real-world CVEs. Here is what I found.
I built an independent benchmark with 20 real CVEs across 15 CWE categories, 5 models (3 OpenAI, 2 Poolside Laguna), three prompt conditions: full advisory, behavioral description only, and location only (file and functi…
Hacker News
· u/adnan9999
· 1w ago
Show HN: Unsiloed AI – #1 on olmOCR-Bench
Most of the document parsers fail on real world challenges like complex tables, handwritten documents, historical document scans, equations, multi-column layouts, complex reading order, etc. We built Unsiloed Parser to h…
r/Games
· u/Turbostrider27
· 2w ago
LEGO Batman: Legacy of the Dark Knight Review Thread
Game Information Game Title: LEGO Batman: Legacy of the Dark Knight Platforms: Nintendo Switch 2 (May 22, 2026) PlayStation 5 (May 22, 2026) Xbox Series X/S (May 22, 2026) PC (May 22, 2026) Trailer: Developer: Review Agg…