# Nebius Token Factory documentation > Comprehensive Nebius Token Factory documentation: quickstart, API reference, model inference guides, fine-tuning, integrations & examples for seamless LLM deployment. ## Docs - [Dedicated Endpoints](https://docs.tokenfactory.nebius.com/ai-models-inference/dedicated-endpoints.md) - [Dedicated Endpoints Overview](https://docs.tokenfactory.nebius.com/ai-models-inference/dedicated-endpoints-overview.md) - [Function calling & Tools](https://docs.tokenfactory.nebius.com/ai-models-inference/function-calling.md): Learn how to make models select functions to extend their capabilities - [Structured output & JSON](https://docs.tokenfactory.nebius.com/ai-models-inference/json.md): Learn how to get structured output from our text models - [Inference Observability](https://docs.tokenfactory.nebius.com/ai-models-inference/observability.md) - [Observability API integrations](https://docs.tokenfactory.nebius.com/ai-models-inference/observability-api-integrations.md) - [Overview](https://docs.tokenfactory.nebius.com/ai-models-inference/overview.md): Get an inference engine overview - [UI & Playground](https://docs.tokenfactory.nebius.com/ai-models-inference/playground.md) - [Rate Limits & Scaling](https://docs.tokenfactory.nebius.com/ai-models-inference/rate-limits.md): Learn how rate limiting of requests works and what are the scaling rules - [Cancel a multipart upload](https://docs.tokenfactory.nebius.com/api-reference/datasets/cancel-a-multipart-upload.md): Cancel a multipart upload - [Complete a multipart upload, create a new dataset](https://docs.tokenfactory.nebius.com/api-reference/datasets/complete-a-multipart-upload-create-a-new-dataset.md): Complete a multipart upload - [Create a dataset by uploading data](https://docs.tokenfactory.nebius.com/api-reference/datasets/create-a-dataset-by-uploading-data.md): Create a dataset - [Create a new multipart upload for a dataset](https://docs.tokenfactory.nebius.com/api-reference/datasets/create-a-new-multipart-upload-for-a-dataset.md): Create a new upload - [Delete a dataset](https://docs.tokenfactory.nebius.com/api-reference/datasets/delete-a-dataset.md): Delete a dataset - [Filter operations by attributes](https://docs.tokenfactory.nebius.com/api-reference/datasets/filter-operations-by-attributes.md): List operations - [Get a list of datasets in project](https://docs.tokenfactory.nebius.com/api-reference/datasets/get-a-list-of-datasets-in-project.md): List datasets - [Get dataset info](https://docs.tokenfactory.nebius.com/api-reference/datasets/get-dataset-info.md): Get dataset info - [Get dataset query template](https://docs.tokenfactory.nebius.com/api-reference/datasets/get-dataset-query-template.md): Get dataset query template - [Get information about a multipart upload](https://docs.tokenfactory.nebius.com/api-reference/datasets/get-information-about-a-multipart-upload.md): Get upload information - [Get operation errors](https://docs.tokenfactory.nebius.com/api-reference/datasets/get-operation-errors.md): Get errors for a given failed operation. - [Get operation info by id](https://docs.tokenfactory.nebius.com/api-reference/datasets/get-operation-info-by-id.md): Get operation info - [Get operation results](https://docs.tokenfactory.nebius.com/api-reference/datasets/get-operation-results.md): Get checkpoints for a fine-tuning operation or the output file id for a dataset-to-file operation. - [List parts of a multipart upload](https://docs.tokenfactory.nebius.com/api-reference/datasets/list-parts-of-a-multipart-upload.md): List upload parts - [Patch dataset info](https://docs.tokenfactory.nebius.com/api-reference/datasets/patch-dataset-info.md): Partially update dataset info - [Read dataset content](https://docs.tokenfactory.nebius.com/api-reference/datasets/read-dataset-content.md): Get dataset content - [Read dataset content as csv or jsonl](https://docs.tokenfactory.nebius.com/api-reference/datasets/read-dataset-content-as-csv-or-jsonl.md): Export dataset content - [Run operation](https://docs.tokenfactory.nebius.com/api-reference/datasets/run-operation.md): Create an operation - [Stop operation by id](https://docs.tokenfactory.nebius.com/api-reference/datasets/stop-operation-by-id.md): Cancel operation - [Upload a new part for a multipart upload](https://docs.tokenfactory.nebius.com/api-reference/datasets/upload-a-new-part-for-a-multipart-upload.md): Upload a new part - [Create embeddings](https://docs.tokenfactory.nebius.com/api-reference/examples/create-embeddings.md) - [Files](https://docs.tokenfactory.nebius.com/api-reference/examples/files.md) - [Image generation](https://docs.tokenfactory.nebius.com/api-reference/examples/image-generation.md) - [List of models](https://docs.tokenfactory.nebius.com/api-reference/examples/list-of-models.md) - [Overview](https://docs.tokenfactory.nebius.com/api-reference/examples/overview.md): Nebius Token Factory offers an OpenAI-compatible API for inference and fine-tuning. - [Text generation](https://docs.tokenfactory.nebius.com/api-reference/examples/text-generation.md) - [Text generation for code autocomplete](https://docs.tokenfactory.nebius.com/api-reference/examples/text-generation-for-code-autocomplete.md) - [Vision capabilities](https://docs.tokenfactory.nebius.com/api-reference/examples/vision-capabilities.md) - [Delete file](https://docs.tokenfactory.nebius.com/api-reference/files/delete-file.md) - [List user files](https://docs.tokenfactory.nebius.com/api-reference/files/list-user-files.md) - [Retrieve file](https://docs.tokenfactory.nebius.com/api-reference/files/retrieve-file.md) - [Retrieve file content](https://docs.tokenfactory.nebius.com/api-reference/files/retrieve-file-content.md) - [Returns URL with link to download file content](https://docs.tokenfactory.nebius.com/api-reference/files/returns-url-with-link-to-download-file-content.md) - [Upload custom model archive](https://docs.tokenfactory.nebius.com/api-reference/files/upload-custom-model-archive.md) - [Upload file](https://docs.tokenfactory.nebius.com/api-reference/files/upload-file.md) - [Cancel fine-tuning job](https://docs.tokenfactory.nebius.com/api-reference/fine-tuning/cancel-fine-tuning-job.md): Immediately cancel a fine-tuning job. - [Create a fine-tuning job](https://docs.tokenfactory.nebius.com/api-reference/fine-tuning/create-a-fine-tuning-job.md): Creates a job that fine-tunes a specified model based on a given dataset. - [Get fine-tuning checkpoint](https://docs.tokenfactory.nebius.com/api-reference/fine-tuning/get-fine-tuning-checkpoint.md): Get details about a specific checkpoint from a fine-tuning job. - [Get fine-tuning job info](https://docs.tokenfactory.nebius.com/api-reference/fine-tuning/get-fine-tuning-job-info.md): Get info about a fine-tuning job. - [List fine-tuning checkpoints](https://docs.tokenfactory.nebius.com/api-reference/fine-tuning/list-fine-tuning-checkpoints.md): Get training checkpoints for a fine-tuning job. - [List fine-tuning events](https://docs.tokenfactory.nebius.com/api-reference/fine-tuning/list-fine-tuning-events.md): Get status updates for a fine-tuning job. - [List fine-tuning jobs](https://docs.tokenfactory.nebius.com/api-reference/fine-tuning/list-fine-tuning-jobs.md): Lists all fine-tuning jobs. - [Create a response](https://docs.tokenfactory.nebius.com/api-reference/inference/create-a-response.md): Creates a model response for a given input - [Create chat completion](https://docs.tokenfactory.nebius.com/api-reference/inference/create-chat-completion.md): Creates a model response for the given chat conversation. - [Create completion](https://docs.tokenfactory.nebius.com/api-reference/inference/create-completion.md): Creates a model completion for the given input prompt. - [Create embeddings](https://docs.tokenfactory.nebius.com/api-reference/inference/create-embeddings.md): Creates a model response for the given text. - [Generate](https://docs.tokenfactory.nebius.com/api-reference/inference/generate.md) - [Rerank documents](https://docs.tokenfactory.nebius.com/api-reference/inference/rerank-documents.md): Reranks documents based on their relevance to a query. - [Introduction](https://docs.tokenfactory.nebius.com/api-reference/introduction.md) - [Create a custom model](https://docs.tokenfactory.nebius.com/api-reference/models/create-a-custom-model.md) - [Delete a model by full name](https://docs.tokenfactory.nebius.com/api-reference/models/delete-a-model-by-full-name.md) - [Delete a model by short name](https://docs.tokenfactory.nebius.com/api-reference/models/delete-a-model-by-short-name.md) - [Get a custom model by full name](https://docs.tokenfactory.nebius.com/api-reference/models/get-a-custom-model-by-full-name.md): Get a custom model model by full name - [Get a custom model by short name](https://docs.tokenfactory.nebius.com/api-reference/models/get-a-custom-model-by-short-name.md): Get a custom model model by short name - [List models](https://docs.tokenfactory.nebius.com/api-reference/models/list-models.md): Lists the currently available models, and provides basic information about each one such as the owner and availability. - [List of custom models](https://docs.tokenfactory.nebius.com/api-reference/models/list-of-custom-models.md): Lists the custom models - [Nebius Token Factory Cookbook](https://docs.tokenfactory.nebius.com/cookbook/overview.md): Examples and recipes to use with Token Factory - [Import Chat Completions](https://docs.tokenfactory.nebius.com/data-lab/chat-completions.md) - [Data Processing](https://docs.tokenfactory.nebius.com/data-lab/data-processing.md) - [Fine-tuning](https://docs.tokenfactory.nebius.com/data-lab/fine-tuning.md) - [Overview](https://docs.tokenfactory.nebius.com/data-lab/overview.md) - [Overview](https://docs.tokenfactory.nebius.com/integrations/overview.md): Nebius Token Factory third-party integrations for inference - [Data Processing Agreement](https://docs.tokenfactory.nebius.com/legal/dpa.md) - [Acceptable use policy (AUP) FluxDev model](https://docs.tokenfactory.nebius.com/legal/flux-dev.md) - [HIPAA & BAA Support for Nebius Token Factory](https://docs.tokenfactory.nebius.com/legal/hipaa-guideline.md) - [Legal Quick Guide](https://docs.tokenfactory.nebius.com/legal/legal-quick-guide.md) - [Privacy Policy](https://docs.tokenfactory.nebius.com/legal/privacy-policy.md) - [List of Sub-Processors](https://docs.tokenfactory.nebius.com/legal/subprocessors.md) - [Terms of Service](https://docs.tokenfactory.nebius.com/legal/terms-of-service.md) - [Billing & Consumption](https://docs.tokenfactory.nebius.com/other-capabilities/billing-new.md) - [Enterprise](https://docs.tokenfactory.nebius.com/other-capabilities/enterprise.md): Deploy and scale Llama, Qwen, DeepSeek, Flux and more on dedicated infrastructure with guaranteed uptime, zero-retention data flow and usage-based pricing, with both dedicated infrastructure and flexible options available to suit customer needs — no GPU wrangling required. - [ Dataset formats for fine-tuning](https://docs.tokenfactory.nebius.com/post-training/datasets.md) - [How to fine-tune your custom model](https://docs.tokenfactory.nebius.com/post-training/how-to-fine-tune.md): Fine-tune a base model on your own data using Nebius Token Factory. This guide walks you through creating a job, monitoring it, and retrieving checkpoints via API and Python. - [Merging LoRA Adapters](https://docs.tokenfactory.nebius.com/post-training/merge-moe-lora-weights.md): When you fine-tune a model with LoRA, inference usually requires loading both the base model and the LoRA adapter. Here we provide a merge script that bakes the LoRA adapters directly into the base weights and produces a standard checkpoint directory. - [Models for fine-tuning in Nebius Token Factory](https://docs.tokenfactory.nebius.com/post-training/models.md): Supported base models for fine-tuning in Nebius Token Factory, with available context lengths and fine-tuning types (LoRA and full fine-tuning), grouped by provider. - [Overview](https://docs.tokenfactory.nebius.com/post-training/overview.md): Post-training on Nebius Token Factory - [Quickstart](https://docs.tokenfactory.nebius.com/quickstart.md): Welcome to Nebius Token Factory - [Switch to Token Factory](https://docs.tokenfactory.nebius.com/switch.md) - [Groups & Access management](https://docs.tokenfactory.nebius.com/team-access/groups.md): Learn about access management on different levels - [User Invitations](https://docs.tokenfactory.nebius.com/team-access/invitations.md): Learn how to invite new users to Organization and add users to Projects - [Organizations and Projects](https://docs.tokenfactory.nebius.com/team-access/org-projects.md): Learn how to work with Organizations and Projects in Nebius Token Factory - [Overview](https://docs.tokenfactory.nebius.com/team-access/overview.md): Collaborate securely with Team Management & Role-Based Access - [Configure Single Sign-On](https://docs.tokenfactory.nebius.com/team-access/sso.md): Learn how to configure single sign-on - [Prompt presets](https://docs.tokenfactory.nebius.com/utilities/prompt-presets.md): Learn how to save your prompts and reuse them ## OpenAPI Specs - [openapi](https://api.tokenfactory.nebius.com/openapi.json) ## Optional - [LinkedIn](https://www.linkedin.com/company/nebius/) - [Discord Community](https://discord.com/invite/WJ2DUQRz4m) - [X](https://x.com/nebiustf)