Inside-the-stack
Deep dive into inside-the-stack a curated series from my software engineering journey.
Articles
Cloud LLM Playbook - When You Should Use Cloud Instead of Local Models
- #ai
- #llm
- #cloud
- #openrouter
- #inference
12 Feb 2025 | 2 min read
InsideTheStack - The Kickoff
- #engineering
- #systems
- #career
- #ai
- #buildinpublic
29 Nov 2025 | 4 min read
KV Cache - The Secret Weapon That Makes LLMs Feel Instant
- #ai
- #llm
- #kv-cache
- #systems
- #inference
04 Feb 2025 | 3 min read
Local LLM Playbook - Run Strong Models On Your Machine Without a GPU
- #ai
- #llm
- #local-ai
- #ollama
- #quantization
- #m4pro
08 Feb 2025 | 4 min read
Coding Models Qwen2.5 vs GPT vs Claude 4.5 and Why Claude Changes the Entire Game
- #ai
- #llm
- #coding
- #engineering
- #claude45
- #qwen
- #gpt
15 Feb 2025 | 7 min read
How Tokenization Actually Works - The Hidden Layer Behind Every LLM
- #ai
- #llm
- #tokenization
- #systems
- #aiengineering
01 Feb 2025 | 3 min read