Technical
12 May 2026
55 min read
Interactive tutorial
50% human written
Authorship mix
Honest estimate of who wrote what. The green bar is my human-written share — drafting, editing, structure, voice. The red stripes are AI's share — synthesis, scaffolding, first drafts. 50% human, 50% AI on this one.
A section-by-section walk-through of the methodology in "Cost Modeling for Public-Facing LLM Chat Applications" — the naive per-token formula and why it lies, the structural corrections that break it (caching, traffic shape, segments, daily caps), the six equations that replace it, and the worked geospatial-Q&A example that ties them together.
Technical
12 May 2026
5 min read
Interactive tutorial
0% human written
Authorship mix
Honest estimate of who wrote what. The green bar is my human-written share — drafting, editing, structure, voice. The red stripes are AI's share — synthesis, scaffolding, first drafts. 0% human, 100% AI on this one.
Twenty-three interactive chapters that walk language models end-to-end — from what tokens are and how attention works, through retrieval and context engineering, to the production techniques (caching, tools, evals), inference and serving, and the choice of model itself, closing with foundation models beyond language. Read in order or jump to the chapter that matches what you are building.
Technical
07 May 2026
138 min read
20% human written
Authorship mix
Honest estimate of who wrote what. The green bar is my human-written share — drafting, editing, structure, voice. The red stripes are AI's share — synthesis, scaffolding, first drafts. 20% human, 80% AI on this one.
A single-page, progressive tutorial for engineers and analysts who want to build their own AI-agent cost calculator. Starts at "what is a token?" and ends at a reference architecture you can clone — data model, formula library, and a validation harness. Implementation-agnostic; one worked reference lives at calc.ajinkya.ai.