Advanced
Architecture
How a request travels through Bodhi App: from the wire to the inference engine and back
Security Model
What Bodhi App protects, what it relies on the deployment to provide, and how to harden a self-hosted installation
Inference Stack
How Bodhi App invokes llama.cpp: variants, GGUF resolution, runtime arguments, and the keep-alive timer
Performance Tuning
Choosing variants, quantization, context window, and concurrency to match Bodhi App to your hardware
Observability
Logs, settings introspection, the background queue, and what is honest about today’s observability gaps in Bodhi App