FreeInference Documentation
Free LLM inference for coding agents and IDEs
FreeInference provides free access to state-of-the-art language models specifically designed for coding agents like Cursor, Codex, Roo Code, and other AI-powered IDEs.
Quick Links
Quick Start - Get started in 5 minutes
IDE & Coding Agent Integrations - Configure with Cursor, Codex, and other coding agents
Available Models - View available models
Key Features
- Free Access
Free inference for coding agents and development tools
- Multiple Models
Access Qwen, GLM, DeepSeek, and other powerful models
- IDE Integration
Easy setup with Cursor, Codex, Roo Code, Kilo Code, and more
Getting Started
Get your API key - Register at https://freeinference.org and create your API key
Choose your IDE:
Cursor - AI-powered code editor
Codex - Terminal-based coding assistant
Roo Code / Kilo Code - VS Code extensions
Configure and start coding!
See the Quick Start guide for detailed setup instructions.
Available Models
Model |
Context Length |
Best For |
|---|---|---|
GLM-4.6 |
200K tokens |
Long context, bilingual |
MiniMax M2 |
196K tokens |
Very large codebases |
Llama 3.3 70B |
131K tokens |
General coding tasks |
Llama 4 Maverick |
128K tokens |
Multimodal support |
DeepSeek R1 |
64K tokens |
Complex reasoning |
Qwen3 Coder 30B |
32K tokens |
Code generation |
See the complete Available Models list for all available models.
Support
Need help? Check out:
IDE & Coding Agent Integrations - IDE setup guides
Available Models - Available models
GitHub Issues - Report bugs or request features