leap-finetune) is Liquid’s repo for the full model customization loop: data preparation, training, evals, checkpointing, and export for Liquid Foundation Models (LFMs).
Use it when you want a repo-driven workflow that Claude Code or Codex can operate end to end from intent to runnable configs, tests, launches, and follow-up evals.
What it includes
| Area | Support |
|---|---|
| Training | SFT, DPO, GRPO, VLM SFT, VLM DPO, VLM GRPO, MoE SFT, MoE DPO, LoRA, and full fine-tuning |
| Evals | Training-time benchmark suites, standalone evals, HF/vLLM backends, and async sidecar or reserved eval workers |
| Data and rewards | Dataset loading, format validation, tool-calling data, VLM image roots, GRPO rewards, judge rewards, and OpenEnv-style RL environments |
| Launch and export | Local Ray, SLURM, Modal, KubeRay, checkpoint resume, Hugging Face export, and GGUF export |
Agent quickstart
Clone the repo and install the locked environment:AGENTS.md, CLAUDE.md, and mirrored skills under .agents/skills/ and .claude/skills/. Those files help the agent choose the right workflow for data prep, training configs, eval suites, rewards, backend launch, checkpoint inspection, and export.
Good starter prompts:
- “Train
LFM2-1.2Bwith SFT LoRA on the Hugging FaceHuggingFaceTB/smoltalkdataset, sweep learning rate and LoRA rank, and tell me which run looks best.” - “Fine-tune an LFM vision model on a chart question-answering dataset from Hugging Face, add a small eval suite, and show me the launch command.”
- “Evaluate my latest
support_sftcheckpoint with vLLM, compare it against the base model, and write the metrics toresults.json.” - “Set up a GRPO experiment that rewards valid JSON answers, run a small smoke test, and suggest the next config to try.”
CLI usage
Run from the repo environment when you are developing configs or changing LFT itself:uv tool:
leap_finetune from another uv project, add it as a dependency:
Python usage
run_config accepts YAML paths or typed config objects. A config with slurm, modal, or kuberay submits to that backend; otherwise local training expects visible CUDA devices.
Fine-tuning with evals
Fine-tuning with evals
Standalone evals
Standalone evals
Eval-only runs use
EvalRunConfig, not JobConfig, because they do not include a training dataset or training settings.short_answer, the final assistant turn is treated as the ground truth.