// Android Device Check
Samsung, Pixel, OnePlus, Motorola — the answer depends on the chipset, not the brand. Snapdragon, Tensor, Dimensity and Exynos behave very differently under a sustained AI load. Pick your phone for an honest breakdown.
Not sure which chip? Settings > About phone > Processor, or check the spec sheet for your exact model.
Settings > About phone > RAM, or check your model's spec sheet. Ignore "RAM Plus" / virtual RAM — swap to UFS storage is far too slow for live inference.
On Android, tiers are driven by chipset and sustained thermals first, RAM second. A flagship SoC that throttles after 60 seconds can land below a cooler mid-range chip for long sessions — and the manufacturer's firmware decides how aggressively that throttling and background-killing happens.
// Before you start
Android isn't one platform — it's hundreds of hardware-and-firmware combinations. These six factors decide whether on-device AI is smooth or painful.
// How to run it
// The four tiers
Local AI on Android is private by default. llama.cpp and Ollama in Termux run entirely offline — no API key, no account, no telemetry. Your prompts stay on your phone.