Get deployment metrics
Returns per-node GPU/CPU/memory utilization and Knative queue-proxy request rate / concurrency / latency for the trainer pods. The sampler half of a Loops deployment is an OracleVersion and uses the existing model-metrics endpoint.
Authorizations
Pass your Baseten API key. Clients automatically send Authorization: Bearer <key>. Direct callers can also use Authorization: Api-Key <key>; both schemes are accepted.
Path Parameters
Body
Time-range request for trainer deployment metrics.
Epoch millis to end fetching metrics.
Epoch millis to start fetching metrics.
Resolution of the returned series, in seconds. When omitted, a step is derived from the time range so large windows return fewer points.
Unit of time for request-volume metrics, in seconds (e.g. 60 for requests/minute). Defaults to per-second.