Bodhi App - Your Unified AI Gateway

Home

Advanced

Architecture

How a request travels through Bodhi App: from the wire to the inference engine and back

Security Model

What Bodhi App protects, what it relies on the deployment to provide, and how to harden a self-hosted installation

Inference Stack

How Bodhi App invokes llama.cpp: variants, GGUF resolution, runtime arguments, and the keep-alive timer

Performance Tuning

Choosing variants, quantization, context window, and concurrency to match Bodhi App to your hardware

Observability

Logs, settings introspection, the background queue, and what is honest about today’s observability gaps in Bodhi App