Endpoints

Full surface; see the OpenAPI spec for shapes.

MethodPathDescription
POST/v1/chat/completionsOpenAI Chat Completions (streaming default)
POST/v1/completionsLegacy alias for /v1/chat/completions
GET/v1/modelsList models the caller is allowed to use
POST/anthropic/v1/messagesAnthropic Messages drop-in (translates to / from OpenAI)
GET/qgre/v1/usage_per_keyIn-memory billing-of-record table
GET/qgre/v1/latencyPer-route P50/P95/P99 latency
GET/qgre/v1/upstreamsUpstream pool snapshot (health, in_flight, …)
GET/healthzLoad-balancer probe (no auth)
GET/metricsPrometheus exposition (no auth, scrape from inside the trust boundary)
GET/openapi.yamlOpenAPI 3.1 spec

Full request / response schemas live in the API reference (rendered from the OpenAPI YAML).