LocalAI Calc

Enter Your Specs

Tell us what you've got — we'll show what AI models you can actually run.

RAM (GB)

VRAM / GPU RAM (GB)

CPU Cores

Storage Free (GB)

GPU / Accelerator

Platform / OS

— Or Pick a Preset Device

Compatible Models

🧠 RAM is king for CPU inference. More = bigger models.

⚡ Apple Silicon uses unified memory — great for local AI.

🖥️ VRAM determines GPU-accelerated model size.

📦 Quantized models (Q4, Q5) use 4–5× less RAM than full precision.

GB1–3B models
GB7B Q4
GB13B Q4
GB34B Q4
GB70B Q4
GB+405B Q4