Track 1 · LLMs · Blue

LLMs for application builders

Everything you need before RAG or agents: how transformers work at inference time, how to call models safely, and how tokens and context windows drive cost and reliability. Read the guides in order—later chapters assume you can run a minimal chat completion and estimate token cost.

Start with LLMs explained All tracks

Guides in this track

Five deep-dive chapters plus this overview. All guides are live—read in order.

Reading order: LLMs explained → APIs & tokens → Embeddings → Multimodal → Model selection & cost

01

LLMs explained

Predict next token, transformer intuition, temperature, context windows, training pipeline, model families, first API call.

Foundation · Live
02

APIs, tokens & context

OpenAI, Claude, Bedrock, Gemini; token counting; structured output; error handling.

Guide · Live
03

Embeddings & semantic search

Dense vectors, cosine similarity, embedding models, ANN indexes, caching.

Guide · Live
04

Multimodal — vision, audio & documents

Image input, PDF pipelines, OCR, Whisper, structured document extraction.

Guide · Live
05

Model selection & cost optimization

Model cascades, prompt caching, batch API, local inference, cost templates.

Guide · Live

LLMs for application builders

LLMs explained

APIs, tokens & context

Embeddings & semantic search

Multimodal — vision, audio & documents

Model selection & cost optimization