From$0.115per 1M tokens
Only pay for what you use. Deploy models 70% cheaper than alternatives with our J-FACTOR compression.
Pay for what you use
No monthly commitments. Start with 10GB free inference every month.
Free
Perfect for prototyping and small projects
Always free
Pro
For production apps and growing teams
After 10GB free tier
Team
For teams needing dedicated resources
Volume discounts
Start Building in 2 Minutes
Get 10GB free monthly inference. No credit card required.
Developer Questions
How does the free tier work?
You get 10GB of free inference every month across all models. That's approximately 8.7M tokens based on average usage. No credit card required to start.
When do I start paying?
You only pay when you exceed the 10GB free monthly limit. We'll show you real-time usage in your dashboard and send alerts before you hit the limit.
How is billing calculated?
We charge per token used (input + output). For example, 1M input tokens + 500K output tokens = 1.5M tokens billed at $0.115 per 1M tokens = $0.1725.
Can I use multiple models?
Yes! All plans include access to every model in our catalog. You only pay for the tokens you use, regardless of which models you deploy.
What's JOBIM-JFactor compression?
Our proprietary compression technology that reduces model size by 70% while maintaining 99% of original performance. This lets us offer much lower prices than competitors.
How do I get started?
Just sign up and get your API key. You can start making requests immediately with 10GB free monthly inference. Check our documentation for code examples in Python, JavaScript, and more.