FreeInference Documentation

Free LLM inference for coding agents and IDEs

FreeInference provides free access to state-of-the-art language models specifically designed for coding agents like Cursor, Codex, Roo Code, and other AI-powered IDEs.

Key Features

Free Access

Free inference for coding agents and development tools

Multiple Models

Access Qwen, GLM, DeepSeek, and other powerful models

IDE Integration

Easy setup with Cursor, Codex, Roo Code, Kilo Code, and more

Getting Started

  1. Get your API key - Register at https://freeinference.org and create your API key

  2. Choose your IDE:

  3. Configure and start coding!

See the Quick Start guide for detailed setup instructions.

Available Models

Model

Context Length

Best For

GLM-4.6

200K tokens

Long context, bilingual

MiniMax M2

196K tokens

Very large codebases

Llama 3.3 70B

131K tokens

General coding tasks

Llama 4 Maverick

128K tokens

Multimodal support

DeepSeek R1

64K tokens

Complex reasoning

Qwen3 Coder 30B

32K tokens

Code generation

See the complete Available Models list for all available models.

Support

Need help? Check out: