Upcoming Advanced post-training options including:
- Speculative Decoding optimization (private beta [sign up])
- Reinforcement Fine-tuning (limited professional service [request access])
You can currently run long-context supervised fine-tuning on full model weights or LoRA adapters.
How to Fine-tune
Step-by-step guide to preparing data, launching a job, and evaluating results.
Models
Supported base models for training and inference.
Datasets
Create and manage datasets for training and validation.
Deploy Custom LoRA
Serve LoRA-adapted models serverlessly with per-token billing.