About Deep Infra
Deep Infra — Deep Infra offers cost-effective, scalable, easy-to-deploy, and production-ready machine-learning models and infrastructures for deep-learning models. It provides a platform to run top AI models using a simple API, with pay-per-use pricing and low-latency inference. Users can deploy custom LLMs on dedicated GPUs and access various models for text generation, text-to-speech, text-to-image, and automatic speech recognition.
Top use cases
- Running text generation models like Llama and Qwen
- Generating speech from text using models like Kokoro and Dia
- Creating images from text prompts using Stable Diffusion and FLUX models
- Transcribing audio using Whisper for automatic speech recognition
- Deploying custom large language models on dedicated GPUs
Built for
Key features
- Fast ML inference with a simple API
- Scalable and production-ready infrastructure
- Pay-per-use pricing
- Support for various ML model types (text generation, text-to-speech, text-to-image, ASR)
- Custom LLM deployment on dedicated GPUs
- Auto Scaling
Pros & cons
Pros
- Cost-effective pay-per-use pricing
- Scalable infrastructure
- Easy deployment process
- Low latency inference
- Wide range of supported models
- Dedicated GPUs for custom LLMs
Cons
- Requires adding a card or pre-paying to use services
- Usage tiers and invoicing thresholds
- Limited concurrent requests per account (200)
- Some models billed for inference execution time, others per token
Company information
- Deep Infra Support Email & Customer service contact & Refund contact etc. Here is the Deep Infra support email for customer service: [email protected] . More Contact, visit the contact us page()
- Deep Infra Company Deep Infra Company name: Deep Infra . Deep Infra Company address: . More about Deep Infra, Please visit the about us page(https://deepinfra.com/about_us) .
- Deep Infra Login Deep Infra Login Link: https://deepinfra.com/login?from=%2Fdash
- Deep Infra Sign up Deep Infra Sign up Link:
- Deep Infra Pricing Deep Infra Pricing Link: https://deepinfra.com/pricing
- Deep Infra Linkedin Deep Infra Linkedin Link: https://linkedin.com/company/deep-infra
- Deep Infra Twitter Deep Infra Twitter Link: https://twitter.com/DeepInfra
- Deep Infra Github Deep Infra Github Link: https://github.com/DeepInfra
Frequently asked questions
What pricing models does Deep Infra offer?
Deep Infra offers per-token pricing for some language models and inference execution time-based pricing for most other models. There are no long-term contracts or upfront costs.
What GPUs are used to run the models?
All models run on H100 or A100 GPUs, optimized for inference performance and low latency.
How does auto-scaling work?
The system automatically scales the model to more hardware based on your needs. Each account is limited to 200 concurrent requests.
What are the usage tiers?
Every user is part of a usage tier. As usage and spending increase, users are automatically moved to the next tier, each with an invoicing threshold.
Can I deploy my own custom LLMs?
Yes, you can deploy your own model on Deep Infra's hardware and pay for uptime, getting dedicated SXM-connected GPUs and automatic scaling.
Related tools

A free-to-use AI system for conversations, insights, and task automation.

Claude is an AI assistant from Anthropic that helps with tasks via natural language.


DeepSeek is an AI company providing foundation models and APIs for AI applications.

AI research and deployment company focused on building safe and beneficial AGI.

