Unsloth: Easily run and train models locally
Unsloth is a tool that lets users run and train AI models 100% offline on local Mac and Windows devices. It supports GGUF and Safetensors models with tool-calling, web search, and an OpenAI-compatible API. Users can train models without coding, compare models side by side, import documents to create datasets, and export models. Unsloth offers a free open-source version as well as paid Pro and Enterprise tiers.
Unsloth - Train and Run Models Locally
unsloth
ModelsBlogUnsloth Studio✨Docs
New
✨Introducing Unsloth Studio
Easily run & train models locally.
Join our DiscordStart for free
Latest News
Gemma 4 12B and QAT is here!Jun 5, 2026
Unsloth joins PyTorch ecosystemMay 11, 2026
Unsloth API endpointMay 5, 2026
Qwen3.6 is out now!Apr 22, 2026
View more news
Run models locally
Unsloth Studio runs 100% offline on your Mac and Windows device. Run GGUF and Safetensors models with tool-calling, web search, and OpenAI compatible API.
Compare models side by side and upload images, docs, audio, code files and more.
Learn more
No-code training
Auto-create datasets from PDF, CSV, JSON docs and start training with real-time observability.
Unsloth's custom kernels supports optimized training for LoRA, FP8, FFT, PT and 500+ models including text, vision, audio and embeddings.
QuickstartLearn more
Model Arena
Chat with and compare 2 different models, such as a base model and a fine-tuned one, to see how their outputs differ.
Just load your first GGUF/model, then the second, and voilà!
Learn more
Data Recipes
Data Recipes transforms your docs into useable datasets via graph-node workflow. Upload unstructured or structured files like PDFs, CSV and JSON. Unsloth Data Recipes auto turns documents into your desired formats.
QuickstartLearn more
Export models
Export any model, including your fine-tuned models, to safetensors, or GGUF for use with llama.cpp, vLLM, Ollama, and more.
Learn more
Don’t believe us?
Why not try our fully free open source version? Finetune 2X faster on a single NVIDIA GPU for free on Google Colab or Kaggle Notebooks.
Get access now
Sign up to our newsletter
We'll share monthly updates!
Subscribe now
Train your own custom model in 24 hrs, not 30 days.
30x faster than FA2 + 30% accuracy
90% less memory usage than FA2
audio, embedding, vision support
The details
We're making AI more accessible to everyone
Find out more
Unsloth makes everything greener
As hardware costs rise and performance gains plateau, we use our math and coding skills to make models train and run smarter + faster.
Want lightning fast inference? We’re working on it!
Contact us
Don't forget to join our newsletter!
By registering you agree to unsloth's Terms of Service and Privacy Policy,
Subscribe now
MultiGPU Docs
Even better multiGPU in the works!
Don't forget to join our newsletter!
Subscribe
Pricing
Free
Freeware of our standard version of unsloth
Get started
Open-source
Supports Mistral, Gemma
Supports LLama 1, 2, 3
MultiGPU - coming soon
Supports 4 bit, 16 bit LoRA
unsloth Pro
2.5x faster training + 20% less VRAM
Contact us
2.5x number of GPUs faster than FA2
20% less memory than OSS
Enhanced MultiGPU support
Up to 8 GPUS support
For any usecase
unsloth Enterprise
Unlock 30x faster training + multi-node support + 30% accuracy
Contact us
32x number of GPUs faster than FA2
up to +30% accuracy
5x faster inference
Supports full training
All Pro plan features
Multi-node support
Customer support
Ready to use unsloth?
Get started for free
Company
About📰 NewsletterPrivacy PolicyTerms of Service
Product
Introduction🐋 DockerDownloadDocumentation🦥 Models
Community
Twitter (X)
Hugging Face
Discord
unsloth
[email protected]
© 2026 unsloth. All rights reserved.
Join Our Discord