The fastest and easiest way to run AI locally
Local AI, built from the ground up. Trillim gives you total control over your inference without ever leaving your hardware.
Install
Performance
Fastest BitNet Inference Engine
DarkNet
bitnet.cpp
Prefill
tokens/secDecode
tokens/sec*Benchmarked on a 12th Gen Intel i7-1255U with 10 threads
Docs
Install, chat, and more
Open documentationTrillim's Tokens