Skip to content

AI Free Member Models

  1. June 2026

The following free models are available through the Gnoppix AI API. Each model includes its vendor and primary use case.

Vendor: Nex AGI
Use case: Agentic mixture-of-experts model (397B total, 17B active) built on Qwen3.5. Designed for coding, tool use, deep research, and long-horizon agentic workflows. Supports reasoning, function calling, and structured outputs. Accepts text and image input.

Vendor: OpenRouter
Use case: High-performance foundation model designed for agentic workloads. Natively supports tool use and long-context tasks (1M context). Strong in code generation, automated workflows, and complex instruction execution. Compatible with Claude Code and other productivity tools.

Vendor: NVIDIA
Use case: Content safety moderator (4B parameters, fine-tuned Gemma-3-4B-it). Evaluates prompts, images, and responses for safety. Supports custom policy enforcement with reasoning traces, multilingual moderation (12 languages), and multimodal inputs.

Vendor: NVIDIA
Use case: Large-scale reasoning model (550B total, 55B active). Designed for complex reasoning, analysis, and high-quality text generation across enterprise workloads.

nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free

Section titled “nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free”

Vendor: NVIDIA
Use case: Lightweight reasoning model (30B total, 3B active). Optimized for efficient inference with strong reasoning capabilities for resource-constrained environments.

Vendor: NVIDIA
Use case: General-purpose nano model (30B total, 3B active). Compact MoE model for efficient text generation and instruction following.

Vendor: NVIDIA
Use case: Large MoE model (120B total, 12B active). Designed for high-quality text generation, reasoning, and complex task completion.

Vendor: NVIDIA
Use case: Vision-language model (12B). Processes both text and images for multimodal understanding tasks.

Vendor: NVIDIA
Use case: Compact general-purpose model (9B). Efficient text generation and instruction following for lightweight deployments.

Vendor: Poolside
Use case: Efficient coding agent model (33B total, 3B active MoE). Second-generation open-weight model under Apache 2.0. Designed for agentic coding workflows with tool calling and reasoning. Runs on a single GPU.

Vendor: Poolside
Use case: Flagship coding agent model (225B total, 23B active MoE). Optimized for complex software engineering tasks. Supports tool calling and reasoning with 128K context. Quantized to fp8 for efficient inference.

Vendor: Google
Use case: Instruction-tuned model (26B total, 4B active). Lightweight MoE model for general-purpose chat, instruction following, and text generation tasks.

Vendor: Google
Use case: Instruction-tuned model (31B dense). General-purpose chat and instruction following with strong reasoning capabilities.

Vendor: Liquid AI
Use case: On-device reasoning model (1.2B). Optimized for math, logic, and multi-step problem-solving with chain-of-thought. Runs under 1GB memory — ideal for edge deployment.

Vendor: Liquid AI
Use case: Instruction-tuned model (1.2B). Designed for chat, instruction following, and tool calling on edge devices. Fast inference on CPU and mobile NPU.

Vendor: Alibaba Cloud (Qwen team)
Use case: Next-generation instruction model (80B total, 3B active MoE). General-purpose chat and instruction following with efficient inference.

Vendor: Alibaba Cloud (Qwen team)
Use case: Code-specialized model. Designed for code generation, debugging, and software engineering tasks.

Vendor: OpenAI
Use case: Open-weight reasoning model (117B total, 5.1B active MoE, Apache 2.0). Strong reasoning, tool use, and agentic capabilities. Fits into a single H100 GPU. Configurable reasoning effort.

Vendor: OpenAI
Use case: Compact open-weight reasoning model (21B total, 3.6B active MoE, Apache 2.0). Runs within 16GB memory — ideal for local deployment and edge devices. Configurable reasoning effort.

Vendor: Meta
Use case: Large instruction-tuned model (70B). General-purpose chat, reasoning, and text generation. One of the most widely adopted open-source LLMs.

Vendor: Meta
Use case: Small instruction-tuned model (3B). Efficient text generation and chat for lightweight and on-device use cases.

Vendor: Nous Research
Use case: Very large instruction-tuned model (405B). Built on Llama-3.1-405B, fine-tuned for high-quality reasoning, instruction following, and complex task completion.