Specifications
| Property | Value |
|---|---|
| Parameters | 350M |
| Context Length | 32K tokens |
| Architecture | LFM2 (Dense) |
Ultra-Light
Minimal memory and compute footprint
Low Latency
Fastest inference in the LFM family
Edge-Ready
Runs on IoT and embedded devices
Quick Start
- Transformers
- llama.cpp
- vLLM
Install:Download & Run: