Models
Gemma 4 on Your Machine: How Google’s New Open Weights Stack Up (Model Showdown) Gemma 4 on Your Machine: How Google’s New Open Weights Stack Up (Model Showdown)
Same Weights, Different Results Same Weights, Different Results
Will Schenk March 24, 2026
Can LLMs Use Real-World Tools? Mercury-2, ELO, and the Umwelten Setup Can LLMs Use Real-World Tools? Mercury-2, ELO, and the Umwelten Setup
Will Schenk March 15, 2026
Sraffa's Gesture, the Crack in the Crystal, and Why the Stochastic Parrot Still Bites Sraffa's Gesture, the Crack in the Crystal, and Why the Stochastic Parrot Still Bites
Will Schenk March 12, 2026
The Car Wash Test: Learning from Model Evals The Car Wash Test: Learning from Model Evals
Will Schenk March 1, 2026
gpt5 is smarter than you are gpt5 is smarter than you are
Will Schenk September 4, 2025
Code Generation with Local Models Code Generation with Local Models
Will Schenk August 20, 2025
gpt-5 and gpt-oss gpt-5 and gpt-oss
Will Schenk August 13, 2025
How I classify models How I classify models
Will Schenk January 21, 2025
AI for research: DeepResearch a clear winner AI for research: DeepResearch a clear winner
Will Schenk January 12, 2025
Learning on the go with NotebookLM Learning on the go with NotebookLM
Will Schenk January 9, 2025
Subscribe to our newsletter
Ready to ship production AI?
Whether you need a quick Vibe Check or a full Habitat built on Habitat OS, we'd love to hear what you're working on.










