Install
Back to Blog

GPT-5.4 is now the default model in Augment and free for a limited time. Here’s why.

Mar 5, 2026
John Edstrom
John Edstrom
GPT-5.4 is now the default model in Augment and free for a limited time. Here’s why.

GPT-5.4 is OpenAI’s newest model, and it’s the first one we’ve used that feels built for agent workflows: planning cleanly, delegating well, and consistently following through without getting lost halfway. That matters for how we build and ship in Augment, so we’ve made GPT-5.4 our new default model.

For a limited time, it’s also free to use in Augment, across Intent, the Auggie CLI, or our IDE extensions. Your existing threads won’t change unless you choose to switch models.

Post image

GPT 5.4 is free for a limited time

5.4 is built for this moment in AI

Agent orchestration is the new bottleneck. Model capability has made execution cheap. Agents can generate a lot of code quickly, but coordination is where workflows still collapse: specs drift, tool state gets lost, parallel efforts conflict, and the system slowly diverges from “what we meant” to “what happened.” The next leap isn’t raw codegen. It’s models that can manage long-horizon work: keep a plan coherent, update it when constraints change, and use tools and intermediate artifacts predictably.

GPT-5.4 feels tuned for that reality. In our testing it stays anchored longer, makes fewer “start over” turns, and behaves more like a reliable orchestrator. This is exactly the behavior you want inside Intent, where coordination is the product. But it’s not a specialist model you only reach for on big, multi-agent jobs. It’s a strong daily driver because the same traits that make orchestration work, like token efficiency on complex tasks and reliable tool use, also make day-to-day development smoother.

Since launching GPT-5, we've focused on training models that can take on longer-horizon coding work. GPT-5.4 feels like the first model to cross the threshold on intelligence, communicativeness, and reliability, so developers can ship more with confidence in an agentic environment like Intent, where Augment's context engine helps keep the model anchored in the right code and context.
Carter McClellan

Carter McClellan

OpenAI Applied Evals

What’s improved

Better token efficiency on hard tasks. On harder tasks like complex refactors, multi-file reasoning, and architectural planning, GPT‑5.4 uses ~18–20% fewer tokens in our internal testing. Fewer tokens means more work done for the same budget, and less “restart the whole approach” overhead on long runs.

More reliable tool use. GPT‑5.4 does a better job tracking where it is in a workflow, recovering from small mistakes, and completing multi-step sequences without repeating work.

Stronger orchestration. Multi-agent orchestration is one of the biggest step-ups. GPT‑5.4 is better at breaking a spec into clear tasks, respecting inter-agent dependencies, and keeping agents aligned on the same definition of done.

Try it now

Open Augment, pick GPT‑5.4, and try it on something real:

  • Run a refactor.
  • Ask for an architectural plan with clear steps.
  • Spin up an Intent workflow and let it orchestrate the work end to end.

GPT‑5.4 is free to try in Augment for a limited time. Sign up today and start building.

Written by

John Edstrom

John Edstrom

Director of Engineering

John is a seasoned engineering leader currently redefining how engineering teams work with complex codebases in his role as Director of Engineering at Augment Code. With deep expertise in scaling developer tools and infrastructure, John previously held leadership roles at Patreon, Instagram, and Facebook, where he focused on developer productivity and platform engineering. He holds advanced degrees in computer science and brings a blend of technical leadership and product vision to his writing and work in the engineering community.

Get Started

Give your codebase the agents it deserves

Install Augment to get started. Works with codebases of any size, from side projects to enterprise monorepos.