# Georgi Gerganov Georgi Gerganov is a Bulgarian software engineer, creator of llama.cpp and the ggml tensor library, and one of the most consequential figures in the local-AI movement. His work made it practical to run large language models on consumer hardware — laptops, phones, and even microcontrollers — by combining extreme performance optimization with pragmatic quantization and a permissive, open-source ethos. The GGUF file format he co-designed has become a de-facto standard for distributing open-weight models to the local-inference ecosystem. He runs ggml.ai and remains the lead maintainer of llama.cpp, which is consumed by tools like Ollama, LM Studio, and countless downstream wrappers. A physicist by training, he became widely known for the meteoric rise of llama.cpp in 2023 and has since been one of the most cited open-source contributors in the AI space. ## Quotes <!-- QueryToSerialize: LIST FROM #type/quote AND [[Georgi Gerganov]] WHERE public_note = true SORT file.name ASC --> ## Books <!-- QueryToSerialize: LIST FROM #type/book AND [[Georgi Gerganov]] WHERE public_note = true SORT file.name ASC --> ## Related - [[Artificial Intelligence (AI)]] - [[Large Language Models (LLMs)]] - [[Running AI Models Locally]] - [[Ollama]] - [[LM Studio]] - [[AI Quantization]] ## References - https://github.com/ggerganov - https://x.com/ggerganov - https://ggerganov.com/