Offline-First Academic Intelligence.
Unlimited.
100%
Private.
Sage integrates llama.cpp inference runtimes, ONNX embeddings, undergrad CS course materials, and open-weight LLMs into standalone, zero-dependency distributions.
Identify Your Ideal Build Configuration
Input your local system specifications to estimate expected token generation speeds and determine the optimal Sage distribution tier.
Sage Build Staging Distributions
Select and download the pre-compiled installer package corresponding to your hardware capabilities.
Our flagship GPU tier. Pre-configures a CUDA 12.4 backend with Qwen-3.5 4B (core LLM) and 0.8B (reasoning agent). Optimized for advanced coding, planning, and real-time inference.
The standard CPU package. Deploys an AVX2-optimized llama.cpp execution engine preloaded with Qwen-3.5 2B (core LLM) and 0.8B (reasoning agent) for private local setups.
Minimal GPU runner engine. Pre-packages CUDA llama.cpp servers, ONNX embedding pipelines, and Typst. Excludes preloaded model weights to support custom user GGUF configurations.
Minimal CPU runner engine. Pre-packages AVX2 llama.cpp servers, ONNX embedding pipelines, and Typst. Excludes preloaded model weights to support custom user GGUF configurations.
Technical Staging Matrix
Inspect the deep technical composition and architectural capabilities mapped across all four package releases.
| Technical Attributes |
FLAGSHIP
Sage Pro
GPU Full
|
Sage Fast
CPU Full
|
Sage Pro-Lite
GPU Engine
|
Sage Fast-Lite
CPU Engine
|
|---|---|---|---|---|
| Core Infrastructure | ||||
| Execution Model | NVIDIA CUDA Acceleration | CPU Optimized Threading | NVIDIA CUDA Acceleration | CPU Optimized Threading |
| Download Archive Size | ~5.0 GB | ~3 GB | ~250 MB | ~200 MB |
| Hardware Requirements | NVIDIA GPU (CUDA 12.4+) 16GB+ RAM, 5GB SSD space |
x86_64 CPU (AVX2 Support) 8GB+ RAM, 4GB disk space |
NVIDIA GPU (CUDA 12.4+) 8GB+ RAM, 1GB disk space |
x86_64 CPU (AVX2 Support) 8GB+ RAM, 1GB disk space |
| LLM & Embedding Models | ||||
| Primary LLM Model | Qwen-3.5 4B GGUF (Q4_K_M) Higher intelligence, advanced coding |
Qwen-3.5 2B GGUF (Q4_K_M) Super lightweight, rapid response |
None Pre-packaged Download & Add Manually |
None Pre-packaged Download & Add Manually |
| Utility LLM Model | Qwen-3.5 0.8B GGUF (Q4_K_M) Pre-packaged for auxiliary tasks |
Qwen-3.5 0.8B GGUF (Q4_K_M) Pre-packaged for auxiliary tasks |
None Pre-packaged Download & Add Manually |
None Pre-packaged Download & Add Manually |
| Embedding Engine | bge-small-en-v1.5 ONNX Q | bge-small-en-v1.5 ONNX Q | bge-small-en-v1.5 ONNX Q | bge-small-en-v1.5 ONNX Q |
| Server & Runtimes | ||||
| LLM Server Binary | llama.cpp b9010 CUDA 12.4 CUDA Included |
llama.cpp b9010 CPU x64 Custom stripped executables |
llama.cpp b9010 CUDA 12.4 Custom stripped executables |
llama.cpp b9010 CPU x64 Custom stripped executables |
| Document Compiler | Typst v0.13.1 CLI | Typst v0.13.1 CLI | Typst v0.13.1 CLI | Typst v0.13.1 CLI |
| Python Environment | CPython 3.12.9 Standalone Stripped installation, 100% portable |
CPython 3.12.9 Standalone Stripped installation, 100% portable |
CPython 3.12.9 Standalone Stripped installation, 100% portable |
CPython 3.12.9 Standalone Stripped installation, 100% portable |
| Best Suited Cases | ||||
| Primary Audience | GPU Power Users | Standard Laptop/Desktop Users | Standard Laptop/Desktop Users | Standard Laptop/Desktop Users |
| Generation Latency | ⚡ Real-time (Extremely Fast) | 🚀 Smooth (12-18 tokens/sec) | ⚡ Real-time (Extremely Fast) | 🚀 Smooth (12-18 tokens/sec) |
SHA-256 Release Signatures
Verify the absolute integrity of your downloaded package. Compare your calculated SHA-256 hash against the official build signatures below.
Official SHA256SUMS.txt
# Official SHA256 signatures for Sage v0.1.0
8ef9a09bf9c0520ee914699223aaf42c8b7c3cab2bf3d69c355048d4a0ee9973d sage-pro-0.1.0-windows-x86_64.zip
411c4d17a6505c210f4b977450420f630fbe7d9db7942dea809f077976968ef90 sage-pro-0.1.0-windows-x86_64.exe
00fe7986ff5f6b463e62455821146049db6f9313603938a70800d1fb69ef11a41 sage-pro-0.1.0-windows-x86_64.bin
90ee9973d48f16c731c0520ee914699223aaf42c8b7c3cab2bf3d69c355048d4a sage-fast-0.1.0-windows-x86_64.zip
430f2107d69aa6fe22623fcdcb5a01f5b2126d16f6c2606fb52e5ff4db09bf90a sage-fast-0.1.0-windows-x86_64.exe
bd258782e35f7f458f8aced1adc053e6e92e89bc735ba3be89d38a06121dc517a sage-fast-0.1.0-windows-x86_64.bin
c8938d4834b44358871698a7a8c050ad9769c60a4aa14a7e862455821146049db sage-pro-lite-0.1.0-windows-x86_64.zip
52414b40932449029e2e3adb8f7a8f244e53b073373f41f785bd6828ab574115a sage-pro-lite-0.1.0-windows-x86_64.exe
4e53b073373f41f785bd6828ab57411552414b40932449029e2e3adb8f7a8f24a sage-fast-lite-0.1.0-windows-x86_64.zip
351887614dd249d2860e4a5f8dcbe5936558b7e8038248bf4a5f8dcbe5936558b sage-fast-lite-0.1.0-windows-x86_64.exe