--- license: apache-2.0 base_model: ibm-granite/granite-docling-258M tags: - text-generation - documents - code - formula - chart - ocr - layout - table - document-parse - docling - granite - extraction - math - gguf - llama.cpp - quantized language: - en pipeline_tag: image-text-to-text library_name: gguf quantized_by: bowserj --- # granite-docling-258M-GGUF This repository contains GGUF format quantized versions of [ibm-granite/granite-docling-258M](https://huggingface.co/ibm-granite/granite-docling-258M) for use with [llama.cpp](https://github.com/ggerganov/llama.cpp).
Granite Docling is a multimodal Image-Text-to-Text model engineered for efficient document conversion. This GGUF version enables fast CPU and GPU inference using llama.cpp, making it ideal for edge deployment and resource-constrained environments.