Reference Reference

📚 Sources

Sources

Every module in this curriculum is built on the work of researchers, engineers, bloggers, and communities who publish openly. This page collects the sources we used, organized by module.


Module 1: Get Running

Tools

Models

Hardware and Emerging Techniques

General Guides


Module 2: Choose Wisely

Training Quality vs Model Size

Quantization

Model Naming

Inference Engines

Memory Bandwidth

Mixture of Experts


Module 3A: How Models Think

Transformer Architecture

Attention Mechanism

Tokenization

Embeddings

Scaling Laws

Temperature & Sampling

General


Module 3B: How Models Fit

Quantization

KV Cache & Memory

TurboQuant

LLM in a Flash

Alternative Architectures

Inference Optimization


Module 4: What Can You Do With This?

Chat Interfaces

Document Q&A / RAG

Coding Agents

Agent Concepts

Agent Security

Automation


Module 5: Agents

Foundations

Agent Frameworks

Coding Agents

Multi-Agent Orchestration

Reliability and Security

No-Code Platforms


Module 6: Build Custom Tools

APIs and SDKs

Function Calling / Tool Use

MCP (Model Context Protocol)

RAG (Retrieval-Augmented Generation)

Testing and Evaluation


Module 7: Local + Cloud

Decision Frameworks

Cost Optimization

Model Routing

Pricing References


Module 8: Supercharge Your Setup

Skills and Plugins

MCP Ecosystem

IDE Configuration

Community


Module 9: Go Further

Image Generation

Video Generation

Audio and TTS

Music Generation

Benchmarks and Evaluation

Fine-Tuning


Module 10: What’s Next

Compression Research

On-Device Inference

Speculative Decoding

Alternative Architectures

Model Context Protocol

Foundational Papers Referenced