
Edge AI10 min read
Context vs. RAG: The Tension Nobody Talks About (Part 1 of 3)
Anyone deploying production AI is already using 1M context models. The interesting question is when to load everything into context versus when to retrieve. Vercel just ran the eval, and the answer is surprising.
Ethan Marsh·April 4, 2026