OpenClaw with free models from nvidia

ByConan February 14, 2026February 14, 2026

If you are using OpenClaw quite intensively, you will find that the API cost might be a huge part of the bot. If you are trying to use top frontier models like Opus 4.6, GPT 5.3 Codex, you can easily burn hundreds/thousands of dollars per month. I tried to reduce operating cost by using some top-tier Chinese models, such as Kimi 2.5, Minimax 2.5, GLM 5 but it is hard to use the lowest subscription coding tier. For example, my Kimi Moderato account can only “live” for 2-3 days, and it burned all the weekly usage limit.

Another option is pushing some money to OpenRouter and using cheaper models, and limiting OpenClaw to run less frequently. But we have another option to try various models before paying: using NVIDIA Build API key. So this article will cover some notes to utilize OpenClaw with free models from Nvidia Build program.

First, register to https://build.nvidia.com/ to create an account
Then, generate API key at https://build.nvidia.com/settings/api-keys
Then, go to the Models menu and select the model that you are going to use. For example, I am using GLM5 from Z.ai, which was released some days ago.
Clicking on the “View Code” and we can see the base_url and model ID that we are going to use when configuring OpenClaw model.
Next, open your OpenClaw model configuration menu (openclaw configure > Models), choose “Custom Provider”:
- In “API Base URL”, input base_url you get from NVIDIA model card: “https://integrate.api.nvidia.com/v1“
- In “API Key (leave blank if not required)”, input YOUR_API_KEY
- In “Endpoint compatibility”, choose “OpenAI-compatible (Uses /chat/completions)”
- In “Model ID”, input the model ID you get from NVIDIA model card, e.g. “z-ai/glm5“
- Wait for openclaw to verify and you are done
Next, we will need to change the context window for the model to work:
- Open “~/.openclaw/openclaw.json“, search for the configuration of nvidia model (e.g. nvidia-glm5)
- Change the proper value for “contextWindow“. This value depends on each model. For example with kimi-k2.5, it is “200000”
- Change proper value for “maxTokens”, e.g. 8192
Done, restart gateway “openclaw gateway restart” and enjoy

I also attached a part of configured model in openclaw.json as follows.

"models": {
    "mode": "merge",
    "providers": {
      
      "nvidia-kimi": {
        "baseUrl": "https://integrate.api.nvidia.com/v1/",
        "apiKey": "nvapi-YOUR_API_KEY_HERE",
        "api": "openai-completions",
        "models": [
          {
            "id": "moonshotai/kimi-k2.5",
            "name": "NVIDIA Kimi K2.5",
            "reasoning": false,
            "input": [
              "text"
            ],
            "cost": {
              "input": 0,
              "output": 0,
              "cacheRead": 0,
              "cacheWrite": 0
            },
            "contextWindow": 256000,
            "maxTokens": 8192
          }
        ]
      }
    }
  },
  "agents": {
    "defaults": {
      "model": {
        "primary": "nvidia-kimi/moonshotai/kimi-k2.5",
        "fallbacks": [
          "kimi-coding/k2p5"
        ]
      },
      "models": {
        "kimi-coding/k2p5": {
          "alias": "Kimi K2.5"
        },
        "nvidia-kimi/moonshotai/kimi-k2.5": {}
      },
    }
  },

AI
From Vibe Coding to Spec-Driven Development
ByConan November 5, 2025January 20, 2026
Nowsadays, Modern AI coding tools like GitHub Copilot / Claude Code or IDE like Cursor / Windsurf have accelerated how we write software—but they’ve also introduced a new habit: vibe coding. Developers often jump straight into implementation based on intuition or incomplete prompts. The result? Misaligned intent, inconsistent architecture, and fragile systems. Spec-Driven Development (SDD)…
Read More From Vibe Coding to Spec-Driven Development
AI
Claude Code Task Management System
ByConan January 23, 2026January 25, 2026
These days, the speed of release from Claude Code is quite … super sonic. As the CC creator tweeted last month, they use Claude Code to code for their releases, and tbh, they just released Claude Cowork this month. Early today, Claude Code 2.1.16 was released, and they introduced Claude Code Task Management System. If…
Read More Claude Code Task Management System
AI
The purpose of a software engineer is to solve problems
ByConan January 16, 2026January 16, 2026
The purpose of a software engineer is to solve known problems, and to find new problems to solve The above quote from Jensen Huang – President and CEO of NVIDIA – is the whole content of the blog entry that I want to post today :-). I am lazy, so here is the short video…
Read More The purpose of a software engineer is to solve problems
AI | General
NVidia CEO Jansen Huang wants employees to stop coding
ByConan February 14, 2026February 14, 2026
AI is significantly changing entire jobs, forever. A recent news report revealed that NVIDIA is deploying Codex for all its employees Imho, SWE jobs are changing forever. A new era of agentic coding is coming, and hopefully we can adapt to survive quickly. Another blog worth reading at https://x.com/mattshumer_/status/2021256989876109403?s=46 . This is a bit exaggerated…
Read More NVidia CEO Jansen Huang wants employees to stop coding
AI
Claude Code Notes & Workflow
ByConan January 12, 2026February 24, 2026
Today I moved my Claude Code notes & Workflow notes to this blog entry so that it can be located easier! In this post we’ll walk through all the important Claude Code commands, workflows, tools, tips, and productivity notes you need to get the most out of your AI coding assistant environment. Getting Started with…
Read More Claude Code Notes & Workflow
AI
The Evolution of RAG: From Traditional RAG to Agentic RAG
ByConan December 5, 2025December 5, 2025
In this post, we’ll walk through what traditional RAG is, why it’s hitting its limits, and the evolution from traditional RAG to Agentic RAG for production-grade LLM applications.
Read More The Evolution of RAG: From Traditional RAG to Agentic RAG

Similar Posts

Leave a Reply Cancel reply