# Georgi Gerganov
Georgi Gerganov is a Bulgarian software engineer, creator of llama.cpp and the ggml tensor library, and one of the most consequential figures in the local-AI movement. His work made it practical to run large language models on consumer hardware — laptops, phones, and even microcontrollers — by combining extreme performance optimization with pragmatic quantization and a permissive, open-source ethos.
The GGUF file format he co-designed has become a de-facto standard for distributing open-weight models to the local-inference ecosystem. He runs ggml.ai and remains the lead maintainer of llama.cpp, which is consumed by tools like Ollama, LM Studio, and countless downstream wrappers.
A physicist by training, he became widely known for the meteoric rise of llama.cpp in 2023 and has since been one of the most cited open-source contributors in the AI space.
## Quotes
<!-- QueryToSerialize: LIST FROM #type/quote AND [[Georgi Gerganov]] WHERE public_note = true SORT file.name ASC -->
## Books
<!-- QueryToSerialize: LIST FROM #type/book AND [[Georgi Gerganov]] WHERE public_note = true SORT file.name ASC -->
## Related
- [[Artificial Intelligence (AI)]]
- [[Large Language Models (LLMs)]]
- [[Running AI Models Locally]]
- [[Ollama]]
- [[LM Studio]]
- [[AI Quantization]]
## References
- https://github.com/ggerganov
- https://x.com/ggerganov
- https://ggerganov.com/