> ## Documentation Index > Fetch the complete documentation index at: https://docs.liquid.ai/llms.txt > Use this file to discover all available pages before exploring further. # Invoice extractor tool Browse the complete example on GitHub A Python CLI that extracts payment details from invoice PDFs. ![](https://github.com/Liquid4All/cookbook/raw/main/examples/invoice-parser/media/diagram.jpg) This is a practical example of building local AI tools and apps with * No cloud costs * No network latency * No data privacy loss ## What's inside? In this example, you will learn how to: * **Set up local AI inference** using llama.cpp to run Liquid models entirely on your machine without requiring cloud services or API keys * **Build a file monitoring system** that automatically processes new files dropped into a directory * **Extract structured output from images** using [LFM2.5-VL-1.6B](https://docs.liquid.ai/lfm/models/lfm25-vl-1.6b), a small vision-language model. ## Environment setup You will need * [llama.cpp](https://github.com/ggerganov/llama.cpp) to serve the Language Models locally. * [uv](https://docs.astral.sh/uv/) to manage Python dependencies and run the application efficiently without creating virtual environments manually. ### Install llama.cpp **macOS:** ```bash theme={"theme":{"light":"github-light","dark":"github-dark"}} brew install llama.cpp ``` **Linux (build from source):** ```bash theme={"theme":{"light":"github-light","dark":"github-dark"}} git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build --config Release # Add build/bin to your PATH ``` **Windows:** Follow the [instructions](https://github.com/ggml-org/llama.cpp/blob/master/docs/install.md) in the llama.cpp repository. Verify that `llama-server` is available: ```bash theme={"theme":{"light":"github-light","dark":"github-dark"}} which llama-server ``` ### Install UV **macOS/Linux:** ```bash theme={"theme":{"light":"github-light","dark":"github-dark"}} curl -LsSf https://astral.sh/uv/install.sh | sh ``` **Windows:** ```powershell theme={"theme":{"light":"github-light","dark":"github-dark"}} powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex" ``` ## How to run it? Let's start by cloning the repository: ```sh theme={"theme":{"light":"github-light","dark":"github-dark"}} git clone https://github.com/Liquid4All/cookbook.git cd cookbook/examples/invoice-parser ``` The tool supports two modes: **watch** (continuous monitoring) and **process** (one-shot). The tool automatically starts and stops `llama-server` for you — no need to run it separately. ### Watch mode Run it as a background service that continuously monitors a directory and automatically parses invoice images as they land in the folder: ```sh theme={"theme":{"light":"github-light","dark":"github-dark"}} uv run python src/invoice_parser/main.py watch \ --dir invoices/ \ --image-model LiquidAI/LFM2.5-VL-1.6B-GGUF:Q8_0 \ --output bills.csv \ --process-existing ``` ### Process mode Process specific files or folders and exit: ```sh theme={"theme":{"light":"github-light","dark":"github-dark"}} # Process an entire folder uv run python src/invoice_parser/main.py process \ --image-model LiquidAI/LFM2.5-VL-1.6B-GGUF:Q8_0 \ invoices/ # Process specific files uv run python src/invoice_parser/main.py process \ --image-model LiquidAI/LFM2.5-VL-1.6B-GGUF:Q8_0 \ invoices/water_australia.png invoices/british_gas.png # Save results to a CSV file uv run python src/invoice_parser/main.py process \ --image-model LiquidAI/LFM2.5-VL-1.6B-GGUF:Q8_0 \ --output bills.csv \ invoices/ ``` Feel free to modify the path to the invoices directory and the model IDs to suit your needs. If you have `make` installed, you can run the application with the following commands: ```sh theme={"theme":{"light":"github-light","dark":"github-dark"}} make run # watch mode make process # one-shot process mode ``` ## Results You can run the tool with a sample of images under `invoices/` with ```sh theme={"theme":{"light":"github-light","dark":"github-dark"}} uv run python src/invoice_parser/main.py process \ --image-model LiquidAI/LFM2.5-VL-1.6B-GGUF:Q8_0 \ invoices/ ``` or, if you have Make installed in your system, just do ```sh theme={"theme":{"light":"github-light","dark":"github-dark"}} make process ``` Results are then printed on console. The model correctly extracts all 4 out of 4 invoices: | File | Utility | Amount | Currency | | ----------------------------- | ----------- | ------ | -------- | | water\_australia.png | water | 68.46 | AUD | | Sample-electric-Bill-2023.jpg | electricity | 28.32 | USD | | castlewater1.png | water | 436.55 | GBP | | british\_gas.png | electricity | 81.31 | GBP | ## Next steps The model works perfectly out-of-the-box on our sample of invoices. However, depending on your specific invoice formats and layouts, you may encounter cases where the extraction is not accurate enough. In those cases, you can fine-tune the model on your own dataset to improve accuracy. Learn how to fine-tune Vision Language Models on your own dataset to improve extraction accuracy. ## Need help? Connect with the community and ask questions about this example.