Compares
Gemma 4 on Your Machine: How Google’s New Open Weights Stack Up (Model Showdown) Gemma 4 on Your Machine: How Google’s New Open Weights Stack Up (Model Showdown)
Same Weights, Different Results Same Weights, Different Results
Will Schenk March 24, 2026
Can LLMs Use Real-World Tools? Mercury-2, ELO, and the Umwelten Setup Can LLMs Use Real-World Tools? Mercury-2, ELO, and the Umwelten Setup
Will Schenk March 15, 2026
The Car Wash Test: Learning from Model Evals The Car Wash Test: Learning from Model Evals
Will Schenk March 1, 2026
June 2025 Coding Agent Report June 2025 Coding Agent Report
Will Schenk June 15, 2025
Moral Vibe Check Moral Vibe Check
Will Schenk March 24, 2025
Agentic YOLO with Warp, Cursor, and Claude Agentic YOLO with Warp, Cursor, and Claude
Will Schenk March 7, 2025
Slicing up a design from figma Slicing up a design from figma
Will Schenk November 27, 2024
Subscribe to our newsletter
Ready to ship production AI?
Whether you need a quick Vibe Check or a full Habitat built on Habitat OS, we'd love to hear what you're working on.







