Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.tokenfactory.nebius.com/llms.txt

Use this file to discover all available pages before exploring further.

Dedicated Endpoints expose two separate status types:
  1. Lifecycle - What stage the deployment is in
  2. Readiness - Whether the endpoint can currently serve traffic
This distinction helps separate deployment progress from actual availability.

Lifecycle Status

Starting

Endpoint is being created or started.
Provisioning is in progress, and traffic may not be available yet.

Updating

Endpoint is applying configuration changes.
Traffic may continue, but capacity can temporarily degrade.

Running

Endpoint is deployed and expected to operate normally.

Error

A deployment issue requires attention.
The endpoint may still serve traffic depending on readiness.
If unresolved for more than 3 hours, contact support.

Stopping

Stop was requested, but shutdown is still in progress.

Stopped

Endpoint is intentionally disabled.

Readiness Status

Not ready

Endpoint cannot reliably serve requests. Expected during: Starting, Stopping, or Stopped

Partially ready

Endpoint can serve traffic, but below expected capacity.
Some replicas are still starting, updating, or unavailable.
If unresolved for more than 3 hours, contact support.

Ready

Endpoint is fully provisioned and ready for expected traffic.

Common Status Examples

LifecycleReadinessMeaning
StartingNot readyProvisioning
StartingPartially readyServing, still scaling
UpdatingReadyServing normally during update
RunningReadyFully operational
RunningPartially readyDegraded capacity
RunningNot readyUnexpected outage
ErrorReadyServing, but deployment issue exists
ErrorNot readyDeployment failure
StoppingPartially readyShutting down
StoppedNot readyFully stopped
Key PrincipleLifecycle is deployment state. Readiness is traffic-serving capability.Use Readiness to decide whether to send traffic.