> ## Documentation Index
> Fetch the complete documentation index at: https://docs.tokenfactory.nebius.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Quickstart: Deploy in UI

> Dedicated endpoints can be deployed and managed through the Token Factory UI, with API workflows also available for automation.

You can create a dedicated endpoint from either of these UI locations:

* Explore page: [https://tokenfactory.nebius.com/](https://tokenfactory.nebius.com/)
* Inference → Model Endpoints: [https://tokenfactory.nebius.com/models](https://tokenfactory.nebius.com/models)

<Columns cols={2}>
  <Column>
    <Frame>
      <img src="https://mintcdn.com/nebius-723e8b65/GxPYlmfqUngeWq2z/images/image-1.png?fit=max&auto=format&n=GxPYlmfqUngeWq2z&q=85&s=a48f9c1d5725cd3bf7aaf5beaa57c557" alt="Image" width="3832" height="1912" data-path="images/image-1.png" />
    </Frame>
  </Column>

  <Column>
    <Frame>
      <img src="https://mintcdn.com/nebius-723e8b65/GxPYlmfqUngeWq2z/images/image-3.png?fit=max&auto=format&n=GxPYlmfqUngeWq2z&q=85&s=4a3d66ac56c2e26d16ecaa9759abe9aa" alt="Image" width="3824" height="1480" data-path="images/image-3.png" />
    </Frame>
  </Column>
</Columns>

1. Explore page [https://tokenfactory.nebius.com/](https://tokenfactory.nebius.com/)
2. Inference/Model Endpoints [https://tokenfactory.nebius.com/models](https://tokenfactory.nebius.com/models)

From there, select a supported model template and complete the deployment configuration, including region, GPU configuration, and autoscaling settings.

<Columns cols={3}>
  <Column>
    <Frame>
      <img src="https://mintcdn.com/nebius-723e8b65/GxPYlmfqUngeWq2z/images/image-6.png?fit=max&auto=format&n=GxPYlmfqUngeWq2z&q=85&s=db90c3011e3d2e25426d29a8b66fd10e" alt="Image" width="1230" height="1550" data-path="images/image-6.png" />
    </Frame>
  </Column>

  <Column>
    <Frame>
      <img src="https://mintcdn.com/nebius-723e8b65/GxPYlmfqUngeWq2z/images/image-5.png?fit=max&auto=format&n=GxPYlmfqUngeWq2z&q=85&s=6d69195993767bdf004d1f30ff4fc467" alt="Image" width="1268" height="1288" data-path="images/image-5.png" />
    </Frame>
  </Column>

  <Column>
    <Frame>
      <img src="https://mintcdn.com/nebius-723e8b65/GxPYlmfqUngeWq2z/images/image-4.png?fit=max&auto=format&n=GxPYlmfqUngeWq2z&q=85&s=eb8ec36b7cfc66056e87106a61be47eb" alt="Image" width="1468" height="1658" data-path="images/image-4.png" />
    </Frame>
  </Column>
</Columns>

## Walkthrough on UI deployment

<iframe src="https://www.youtube.com/embed/5NsvfFjShaY" title="YouTube video player" frameborder="0" className="w-full aspect-video rounded-xl" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen />

<Note>
  **Prefere automation?**\
  For API-based deployment, see [Deploy in API ](/ai-models-inference/dedicated-endpoints/deploy-api)section
</Note>

## Using endpoint

Go to **Inference → Model Endpoints** and open your private endpoint card.

<Frame>
  <img src="https://mintcdn.com/nebius-723e8b65/GxPYlmfqUngeWq2z/images/image-7.png?fit=max&auto=format&n=GxPYlmfqUngeWq2z&q=85&s=9f1704ff232a062173f550e2a740ed8a" alt="Image" width="3840" height="1516" data-path="images/image-7.png" />
</Frame>

There, you can view key deployment details, including:

* Endpoint ID
* Routing key
* Model
* GPUs per replica
* Minimum and maximum replicas
* Deployment status
* Ready-to-use code snippets

To update configuration, click **Edit Endpoint**.

<Columns cols={2}>
  <Column>
    <Frame>
      <img src="https://mintcdn.com/nebius-723e8b65/GxPYlmfqUngeWq2z/images/image-9.png?fit=max&auto=format&n=GxPYlmfqUngeWq2z&q=85&s=3f7c23ef7a91d732cb943b916e4a9839" alt="Image" width="2104" height="1910" data-path="images/image-9.png" />
    </Frame>
  </Column>

  <Column>
    <Frame>
      <img src="https://mintcdn.com/nebius-723e8b65/GxPYlmfqUngeWq2z/images/image-10.png?fit=max&auto=format&n=GxPYlmfqUngeWq2z&q=85&s=2efb6197bf2ebe59e1e6e6cf3c960e6a" alt="Image" title="Image" style={{ width:"56%" }} width="1106" height="1834" data-path="images/image-10.png" />
    </Frame>
  </Column>
</Columns>

To check observability metrics you can either:

1. Open dedicated endpoints model card and click Observability button below
2. Go to observability section and set filters to your enpoint: [https://tokenfactory.nebius.com/observability](https://tokenfactory.nebius.com/observability?monitoring_model=all\&monitoring_cloud_account_id=all\&monitoring_region=all)

Read more at [Observability section](/ai-models-inference/observability)

## Walkthrough on Operating Dedicated Endpoints

See **Walkthrough on Operating Dedicated Endpoints** for deployment management, scaling, and operational best practices

<iframe src="https://www.youtube.com/embed/4RlTHxu-CXo" title="YouTube video player" frameborder="0" className="w-full aspect-video rounded-xl" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen />
