Specifications
| Property | Value |
|---|---|
| Parameters | 1.5B (1.2B LM + 115M audio encoder) |
| Context Length | 32K tokens |
| Audio Output | 24kHz |
| Supported Language | Japanese |
Japanese TTS
Natural Japanese speech synthesis
Japanese ASR
Japanese speech recognition
Voice Chat
Interleaved Japanese audio/text
Quick Start
- liquid-audio
- llama.cpp
Install:Multi-Turn Chat:Japanese ASR:Japanese TTS: