inside-the-stack

Inside-the-stack

Deep dive into inside-the-stack a curated series from my software engineering journey.

Articles

Cloud LLM Playbook - When You Should Use Cloud Instead of Local Models

#ai
#llm
#cloud
#openrouter
#inference

12 Feb 2025 | 2 min read

InsideTheStack - The Kickoff

#engineering
#systems
#career
#ai
#buildinpublic

29 Nov 2025 | 4 min read

KV Cache - The Secret Weapon That Makes LLMs Feel Instant

#ai
#llm
#kv-cache
#systems
#inference

04 Feb 2025 | 3 min read

Local LLM Playbook - Run Strong Models On Your Machine Without a GPU

#ai
#llm
#local-ai
#ollama
#quantization
#m4pro

08 Feb 2025 | 4 min read

Coding Models Qwen2.5 vs GPT vs Claude 4.5 and Why Claude Changes the Entire Game

#ai
#llm
#coding
#engineering
#claude45
#qwen
#gpt

15 Feb 2025 | 7 min read

How Tokenization Actually Works - The Hidden Layer Behind Every LLM

#ai
#llm
#tokenization
#systems
#aiengineering

01 Feb 2025 | 3 min read