HomeTools › AI Hardware Requirements Calculator

AI Hardware Requirements Calculator

This AI hardware requirements calculator estimates the RAM, CPU, and storage needed to run a local large language model on your NAS or home server based on model size, use case, and concurrent users. Compares on-device inference cost against cloud API pricing in AUD.

Find out exactly how much RAM, CPU, and processing power you need to run local AI models on a NAS - then compare the 3-year cost against paying for GPT-4o or Claude in AUD. Enter your model size and use case below.

Why Run AI Locally on a NAS?

At moderate use (500 queries/day), Claude or GPT-4o costs $1,300 to $2,000 AUD per year in API fees. A NAS capable of running a 7B parameter model typically costs $800 to $1,200 AUD upfront and pays for itself within 12 to 18 months. This calculator gives you the exact break-even figure for your specific usage pattern and AU electricity rate.

Privacy is the second reason. When you query a cloud AI, your prompts, documents, and outputs leave Australia and enter US jurisdiction under US data laws. Local inference on a NAS keeps everything on-premises. For businesses handling client data, this is directly relevant to the Australian Privacy Act 2024 and any data residency obligations your contracts require.

NBN latency is the third factor. A cloud AI round-trip from Australia to US servers adds 150-300ms of network latency per response. Local inference is bounded only by your NAS CPU or NPU speed, not by your upload plan. On NBN 25 or NBN 50, cloud AI feels slower than advertised. Local inference on a capable NAS removes that constraint entirely.

What RAM Do You Actually Need?

The rule of thumb is simple: you need roughly 1 GB of RAM per billion parameters at Q4 quantization, plus 2 GB overhead for the operating system and inference engine. A 7B model needs 5-6 GB RAM. A 13B model needs 9-10 GB. A 70B model needs 38-42 GB, which puts it out of reach of most consumer NAS hardware and into dedicated server territory.

This calculator uses Q4_K_M quantization throughout. This is the practical default for home and SMB NAS hardware: it reduces RAM requirements by roughly 75% versus full-precision weights, with a quality penalty most users cannot distinguish from the full-precision output. If you are running RAG (Retrieval-Augmented Generation), add 20-50% overhead on top of the base model RAM figure to account for the embedding model and vector database.

When Local AI on a NAS Is Not the Right Choice

Image generation models (Stable Diffusion, FLUX) require a dedicated GPU. The compute requirements are an order of magnitude higher than text LLMs, and NAS hardware cannot run them at any practical speed. Fine-tuning also requires full-precision weights and optimizer states, typically 4 to 6 times the inference RAM, making it impractical on consumer NAS hardware. This calculator covers text LLM inference only.

3B
7B
13B
30B
70B
e.g. Phi-3 Mini, Llama 3.2 3B, fastest on NAS hardware
RAG adds vector store RAM overhead. Fine-tuning is rarely practical on NAS.
Each concurrent request holds a copy of the model context in RAM.
Used only for the cloud API cost comparison below.

RAM Requirements

Minimum
GB RAM
Model loads but headroom is tight
Recommended
GB RAM
Comfortable operation with your workload

Hardware Assessment

Recommended NAS Models (AU pricing, March 2026)

3-Year Cost Comparison (AUD)

Estimated Annual Power Cost (AU, NSW rate ~$0.35/kWh)

Full running cost breakdown → NAS Power Calculator

Australian Context

USD-priced cloud AI: GPT-4o and Claude are billed in USD. At current exchange rates (~$1 USD = ~$1.55 AUD), a moderate usage pattern costs $1,300-$2,000 AUD/year, more if you're running team workflows. One-time NAS hardware typically breaks even within 12-18 months.

Privacy Act 2024 implications: When you query a cloud AI, your data leaves AU soil and enters US jurisdiction. Local inference on a NAS keeps all data, queries, documents, and outputs, within your premises. No data retention, no model training on your inputs.

NBN upload constraints: Cloud AI round-trips add latency for AU users. Local inference is bounded only by your NAS CPU/NPU speed, typically 2-30 tokens/second depending on model size and hardware.

Frequently Asked Questions