ShipboardAI
SolutionsField NotesThe WatchAboutContactGet Started
← All posts
Tag

KV cache VRAM

1 post tagged with this topic.

Rack-mounted GPU cluster with glowing status lights
Infrastructure14 min read

The Real Cost of Running 1M Context Locally (Part 2 of 3)

VRAM and KV cache math for 7B, 13B, 70B, and 405B models at 1M tokens. Scaling per user, for a single crew member, the whole workforce, and the whole workforce plus guests. The numbers are uglier than the marketing suggests, and there is exactly one way out.

James Calder·April 7, 2026

ShipboardAI

Sovereign AI infrastructure, deployed at sea. All of humanity's knowledge at your fingertips, even when the link drops.

Subscribe

The Watch, in your inbox

Short commentary on maritime, AI, and edge computing, delivered when news breaks, not on a schedule. Filtered through one question: does it matter if you lose the link?

No spam. Unsubscribe any time.

Company

  • About
  • Field Notes
  • The Watch
  • Contact

Solutions

  • GPU Infrastructure
  • Edge AI
  • Network Architecture
  • Custom Solutions
  • Consulting

© 2026 ShipboardAI. All rights reserved.

RSSPrivacyTerms