Specifications
| Property | Value |
|---|---|
| Parameters | 8B (1.5B active) |
| Context Length | 128K tokens |
| Architecture | LFM2.5 (MoE) |
128K Context
Extended context window for long documents and conversations
MoE Efficiency
8B quality, 1.5B inference cost
Tool Calling
Native function calling for agentic workflows
Quick Start
- Transformers
- llama.cpp
- vLLM
- SGLang
Quick start with Transformers (compatible with
transformers>=5.0.0):