What is a good cyclomatic complexity score?

There is no universal cutoff. Most teams choose a threshold based on language, codebase age, and review burden, then apply it consistently to new code. Common defaults in mainstream tools fall between 10 and 25, depending on the metric and ecosystem, as documented in ESLint and Microsoft's CA1502 rule.

What is the difference between cyclomatic complexity and cognitive complexity?

Cyclomatic complexity measures execution-path growth, while cognitive complexity measures how difficult the code is to read. A refactor can improve one metric without improving the other. SonarQube's metrics reference documents both and their differences.

Do guard clauses reduce cyclomatic complexity?

Guard clauses often leave the number of decision points unchanged, but they usually make the main path easier to follow by removing nesting. The reduction shows up more clearly in cognitive complexity scores.

How do I enforce cyclomatic complexity in CI/CD?

Integrate a complexity analyzer into local hooks and pipeline gates. The practical goal is to block or warn on functions that exceed the team-selected threshold for new code, using official analyzer rules such as ESLint's complexity, Radon's cc, or platform rules in SonarQube. Augment Code can complement these gates with context-aware review that flags complexity regressions tied to cross-file dependencies.

Can refactoring increase total complexity?

Yes. Splitting a large function into many smaller helpers can lower per-function scores while leaving the same branching spread across the module. Always re-measure after refactoring and validate that tests still pass.

How to Reduce Cyclomatic Complexity

The systematic approach to reducing cyclomatic complexity is to identify high-complexity functions using measurable thresholds first, then apply named structural refactoring techniques, because untargeted simplification often redistributes decision points without eliminating them.

TL;DR

Reduce cyclomatic complexity by measuring per-function scores, refactoring the worst hotspots with guard clauses or Extract Method, and enforcing thresholds in CI/CD. McCabe's metric is a useful risk signal for testing and maintenance, but thresholds should be treated as team policy rather than universal law.

Why Cyclomatic Complexity Still Matters

Cyclomatic complexity, introduced by Thomas McCabe in his 1976 paper, measures the number of linearly independent paths through a function's control flow graph. Many tools and coding standards use thresholds in the 10-25 range, but those values come from tool defaults or team policy. ESLint defaults to 20, Microsoft's CA1502 rule defaults to 25, and Radon uses letter grades tied to numeric bands.

SEI guidance also describes higher complexity as a risk signal for testing and maintenance, especially when considered alongside module size and structure rather than in isolation.

Two related metrics create a practical tension:

Cyclomatic complexity measures execution-path growth and test surface area.
Cognitive complexity measures how difficult branching is to understand.

Optimizing one while ignoring the other can produce premature polymorphism, artificial flattening, and metric gaming without genuine improvement. SonarQube's metrics definitions document both metrics and their differences.

That tension matters even more in large codebases, where reducing a single function's branching often means understanding the dependencies around it first. For teams planning multi-file refactors across services, reviewing dependency mapping before edits spread across callers and shared modules is a critical first step.

[ Coming up next ]

The New Code Review Workflow for AI-Native Engineering Teams

See how leading teams keep code review fast and rigorous as AI writes more of the code.

Save your seat

— Thu, Jul 9 // 9:45 AM PDT

Prerequisites

Before starting complexity reduction work, confirm the following:

A test suite covering existing behavior is strongly recommended. Without it, refactoring high-complexity code carries high risk regardless of technique.
Install the appropriate analyzer for your language: Python uses pip install radon; JavaScript and TypeScript use ESLint with the complexity rule; Go uses go install github.com/fzipp/gocyclo/cmd/gocyclo@latest; Java often uses SonarQube or PMD via Maven or Gradle plugins.
Familiarity with basic refactoring patterns and CI pipelines helps because the workflow relies on measurement, targeted edits, and repeatable enforcement.

With those inputs ready, baseline measurement becomes reliable and repeatable.

Step 1: Measure Baseline Complexity Before Changing Anything

Measuring baseline complexity before any edits creates the evidence needed for prioritization and validation. Without a per-function baseline, a team cannot tell whether a refactoring reduced complexity or simply moved decision points into other methods.

# Python 3.10+: measure all functions, show average, flag C-grade and worse
pip install radon
radon cc -a -s .          # all files with per-function scores and average
radon cc -n c .           # show only functions ranked C (11-20) or worse

Radon grades map to: A (1-5), B (6-10), C (11-20), D (21-30), E (31-40), F (41+). A practical target is grade B or better for new code, with grade C or worse flagged for review. That target is a team policy choice rather than a documented language-wide standard.

javascript

// ESLint flat config: flag functions exceeding complexity 15
// eslint.config.js
import { defineConfig } from 'eslint/config';

export default defineConfig([
  {
    files: ['**/*.{js,mjs,ts}'],
    rules: {
      complexity: ['warn', { max: 15 }],
    },
  },
]);

Once baseline scores exist, the team can turn raw measurements into an explicit policy instead of relying on tool defaults.

Step 2: Set Complexity Thresholds Before Refactoring

Baseline data alone does not tell a team what action to take. Setting thresholds before refactoring keeps the workflow objective and prevents scope creep. The widely used McCabe-style risk table organizes scores into bands that correlate with testing and maintenance burden:

Cyclomatic Complexity	Risk Level
1-10	Simple, low risk
11-20	More complex, moderate risk
21-50	Complex, high risk
>50	Very high risk

That table is commonly reproduced in tooling and training material derived from McCabe's metric, but exact labels vary by source and organization. In practice, many teams treat 20 as a practical maximum, while stricter teams lower the ceiling for new code. Tool defaults vary, so the threshold should be treated as an engineering policy.

The following table summarizes what each major tool uses as its documented default or guidance:

Tool	Documented Default or Guidance	Recommended Adjustment
ESLint complexity	Default maximum 20	Many teams lower to 10-15
SonarQube cognitive complexity	Default rule thresholds often start at 15 for most languages and 25 for C, C++, and Objective-C	Keep as-is for cognitive complexity
NIST SP 500-235	Uses 10 as a reference point in discussion of cyclomatic complexity	Teams may choose to gate here
Microsoft CA1502	Default threshold 25	Some teams lower to 10-15
gocyclo or golangci-lint	Team-configured in practice; gocyclo reports scores but does not impose a universal default	Choose based on codebase

A practical policy is to set the quality gate around 10-15 for new code, flag anything above 20 for refactoring review, and treat anything above 50 as a high-priority technical debt item. For legacy codebases, use a ratchet: no new function exceeds 15, and existing hotspots are reduced incrementally over multiple cycles.

With thresholds defined, the refactoring work can start with the lowest-risk structural changes first. Teams rolling this into delivery workflows can also align the gate with broader CI/CD enforcement so warnings, failures, and exemptions stay consistent across repositories.

Step 3: Apply Guard Clauses to Flatten Nested Conditionals

Guard clauses are a useful first refactoring for deeply nested functions because they reduce nesting by handling exceptional paths early. This makes the main execution path easier to read, test, and review.

javascript

// JavaScript: before (nested structure, cognitive complexity: high)
function processPayment(params) {
  if (params.isValid) {
    if (params.hasFunds) {
      if (params.notFraudulent) {
        executePayment(params);
      }
    }
  }
}

// After: guard clauses (same cyclomatic score, lower cognitive complexity)
function processPayment(params) {
  if (!params.isValid) return;
  if (!params.hasFunds) return;
  if (params.isFraudulent) return;
  executePayment(params);
}

Guard clauses are especially useful in service code, validation layers, and controller logic where nesting builds up around edge cases. In larger systems, the change is safer when teams can inspect affected branches and call paths across files instead of reviewing a single method in isolation. After obvious nesting is flattened, the next opportunity is usually to isolate decision-heavy blocks into smaller units with clearer names.

Step 4: Extract Methods to Isolate Decision Points

Once guard clauses have simplified the control flow shape, Extract Method reduces local complexity by moving coherent blocks into named helpers. It works best when the extracted code has a single responsibility and a name that clarifies why the branch exists.

java

// Java 21: before (compound boolean obscuring business intent)
if (date.before(SUMMER_START) || date.after(SUMMER_END))
    charge = quantity * winterRate + winterServiceCharge;
else
    charge = quantity * summerRate;

// After: decomposed conditional
if (notSummer(date))
    charge = winterCharge(quantity);
else
    charge = summerCharge(quantity);

This technique also improves test design because the helper methods can be tested at narrower boundaries. When refactors start crossing modules and files, teams benefit from broader dependency visibility before changing call sites.

Step 5: Consolidate Duplicate Exit Conditions

After extracting named helpers, consolidating duplicate exit conditions removes repeated branches when several checks return the same outcome. The calling function becomes shorter, and the rule behind the exits gets a reusable name.

java

// Java 21: before (repeated early exits, cyclomatic complexity: 4)
double disabilityAmount() {
    if (_seniority < 2) return 0;
    if (_monthsDisabled > 12) return 0;
    if (_isPartTime) return 0;
    // compute the disability amount...
}

// After: consolidated condition (cyclomatic complexity of disabilityAmount(): 2)
double disabilityAmount() {
    if (isNotEligibleForDisability()) return 0;
    // compute the disability amount...
}

boolean isNotEligibleForDisability() {
    return _seniority < 2 || _monthsDisabled > 12 || _isPartTime;
}

Total module complexity across both functions may be equal to or slightly above the original, because the || operators in the extracted helper carry their own cyclomatic cost. The gain is per-function clarity and testability rather than an absolute reduction in branching.

This pattern is most effective when the grouped conditions express one business rule rather than a random collection of checks. If the helper name is hard to write, the conditions may not belong together. Once repeated exits are grouped, the remaining hotspots often come from behavior that changes by type or algorithm choice.

Step 6: Replace Conditional With Polymorphism for Type Dispatch

When a hotspot still branches on type after simpler refactors, replacing conditional logic with polymorphism reduces branching if behavior depends on type rather than state. This is most useful when repeated instanceof checks or type flags indicate that behavior belongs on the object itself.

java

// Java 21: before (type-based dispatch, cyclomatic complexity of handleAnimal: 3)
class AnimalHandler {
  void handleAnimal(Animal animal) {
    if (animal instanceof Dog) { /* handle dog */ }
    else if (animal instanceof Cat) { /* handle cat */ }
  }
}

// After: polymorphic dispatch (cyclomatic complexity of handleAnimal: 1)
abstract class Animal { abstract void handle(); }
class Dog extends Animal { @Override void handle() { /* dog logic */ } }
class Cat extends Animal { @Override void handle() { /* cat logic */ } }

class AnimalHandler {
  void handleAnimal(Animal animal) { animal.handle(); }
}

This refactoring should be applied selectively. Converting a short and stable conditional into a class hierarchy can reduce one metric while increasing navigation cost for the reader. If the variation is algorithmic rather than type-based, strategy selection is often a better fit than inheritance.

Step 7: Apply the Strategy Pattern to Algorithm Selection

When one method chooses among several interchangeable algorithms, the Strategy Pattern reduces complexity by moving that selection outside the method body. Instead of encoding that choice in a switch or if/else chain, the caller selects an implementation and delegates execution.

java

// Java 21: before (payment type switch, cyclomatic complexity: 3)
public class ShoppingCart {
    public void pay(String paymentType, int amount) {
        if (paymentType.equals("CreditCard")) {
            System.out.println("Paid " + amount + " using Credit Card");
        } else if (paymentType.equals("PayPal")) {
            System.out.println("Paid " + amount + " using PayPal");
        }
    }
}

// After: strategy pattern (cyclomatic complexity of pay(): 1)
public interface PaymentStrategy { void pay(int amount); }

public class CreditCardStrategy implements PaymentStrategy {
    public void pay(int amount) { System.out.println("Paid " + amount + " using Credit Card"); }
}

public class PayPalStrategy implements PaymentStrategy {
    public void pay(int amount) { System.out.println("Paid " + amount + " using PayPal"); }
}

public class ShoppingCart {
    private PaymentStrategy paymentStrategy;
    public void setPaymentStrategy(PaymentStrategy strategy) { this.paymentStrategy = strategy; }
    public void pay(int amount) { paymentStrategy.pay(amount); }
}

For Python codebases, dictionary dispatch often produces the same effect with less ceremony:

python

# Before (if/elif chain, complexity: 5)
def process_event(event_type: str, payload: dict):
    if event_type == "payment": handle_payment(payload)
    elif event_type == "refund": handle_refund(payload)
    elif event_type == "chargeback": handle_chargeback(payload)
    elif event_type == "void": handle_void(payload)

# After: dictionary dispatch (complexity: 2)
HANDLERS = {
    "payment": handle_payment,
    "refund": handle_refund,
    "chargeback": handle_chargeback,
    "void": handle_void,
}

def process_event(event_type: str, payload: dict):
    handler = HANDLERS.get(event_type)
    if handler is None:
        raise ValueError(f"Unknown event type: {event_type}")
    handler(payload)

After these structural refactors, the workflow needs one final step: verify the result and make the policy enforceable.

Step 8: Re-measure, Validate, and Gate in CI/CD

Re-measuring after refactoring confirms whether complexity actually went down and whether behavior remained intact. If tests fail after the change, the edit was not purely structural and needs another pass.

For sustained control, teams should run complexity checks before merging, ideally both in local hooks and in CI/CD. Complexity limits become durable only when the toolchain enforces them consistently. For teams tracking code quality metrics across multiple dimensions, the complexity gate becomes one signal among several rather than a standalone target.

yaml

# GitLab CI/CD: quality gate blocking merge on complexity violations
sonarqube-check:
  image: gradle:8.10.0-jdk17-jammy
  variables:
    SONAR_USER_HOME: "${CI_PROJECT_DIR}/.sonar"
    GIT_DEPTH: "0"
  script: gradle sonarqube -Dsonar.qualitygate.wait=true
  allow_failure: false
  rules:
    - if: $CI_COMMIT_REF_NAME == 'main' || $CI_PIPELINE_SOURCE == 'merge_request_event'

Augment Code's automated code review can also flag complexity regressions during the PR stage, catching hotspots before they reach the CI gate.

Common Mistakes and Pitfalls

Even with measurement and gating in place, complexity work is easy to game and hard to validate by intuition alone. The following mistakes show where teams often reduce a score without reducing maintenance costs.

Open source

augmentcode/auggie★246

Star on GitHub

Moving Complexity Rather Than Eliminating It

Extracting one large function into many small ones can reduce per-function scores while leaving total logical branching unchanged. Splitting only improves the codebase when the resulting methods have clearer responsibilities and lower review burden.

Gaming the Metric Through Artificial Flattening

Quality gates can incentivize cosmetic changes such as giant boolean expressions or arbitrary method splits. The metric is a signal; readability and test design still need human review.

Over-Abstraction and Premature Polymorphism

Converting a short if/else into an abstract hierarchy can improve a metric while making the code harder to navigate. Abstraction works best when it reflects stable variation, not when it exists only to satisfy a threshold.

Misinterpreting What the Metric Measures

Cyclomatic complexity does not measure all maintenance cost. Teams should avoid applying the same strict threshold to test helpers, parsers, generated code, and business-critical algorithms without context.

Ignoring Readability While Optimizing Structural Complexity

A lower score does not always mean code is easier to understand. When structural simplification worsens names, indirection, or navigation, engineering judgment should override the metric.

How Augment Code Supports Multi-File Refactors

Once teams begin reducing complexity across modules and services, dependency visibility becomes part of the refactoring workflow.

In practice, that matters most in three situations:

Multi-file refactors where a branch change can affect distant callers
Shared validation or policy code reused across services
Review workflows where teams need to inspect downstream impact before merge

Augment Code's Context Engine analyzes codebases across 400,000+ files through semantic dependency graphs. Teams can inspect downstream consumers and review regression risk before and after a refactor lands, rather than discovering breakage post-merge.

Set a Complexity Gate This Sprint

A lower per-function score can still leave a system hard to review if the branching was only redistributed. The concrete next step: set one team-selected CI threshold this sprint, measure the current hotspots against it, and refactor only the worst offenders with tests in place.

For teams working across interdependent codebases, Augment Code is most useful when a local simplification can affect distant callers, shared rules, or validation paths. Context Engine provides architectural visibility into those downstream impacts across 400,000+ file codebases, so teams can verify that a refactor actually reduced total complexity before merging.

How to Reduce Cyclomatic Complexity

TL;DR

Why Cyclomatic Complexity Still Matters

The New Code Review Workflow for AI-Native Engineering Teams

Prerequisites

Step 1: Measure Baseline Complexity Before Changing Anything

Step 2: Set Complexity Thresholds Before Refactoring

Step 3: Apply Guard Clauses to Flatten Nested Conditionals

Step 4: Extract Methods to Isolate Decision Points

Step 5: Consolidate Duplicate Exit Conditions

Step 6: Replace Conditional With Polymorphism for Type Dispatch

Step 7: Apply the Strategy Pattern to Algorithm Selection

Step 8: Re-measure, Validate, and Gate in CI/CD

Common Mistakes and Pitfalls

Moving Complexity Rather Than Eliminating It

Gaming the Metric Through Artificial Flattening

Over-Abstraction and Premature Polymorphism

Misinterpreting What the Metric Measures

Ignoring Readability While Optimizing Structural Complexity

How Augment Code Supports Multi-File Refactors

Set a Complexity Gate This Sprint

FAQ

Written by

Molisha Shah

Give your codebase the agents it deserves

TL;DR

Why Cyclomatic Complexity Still Matters

The New Code Review Workflow for AI-Native Engineering Teams

Prerequisites

Step 1: Measure Baseline Complexity Before Changing Anything

Step 2: Set Complexity Thresholds Before Refactoring

Step 3: Apply Guard Clauses to Flatten Nested Conditionals

Step 4: Extract Methods to Isolate Decision Points

Step 5: Consolidate Duplicate Exit Conditions

Step 6: Replace Conditional With Polymorphism for Type Dispatch

Step 7: Apply the Strategy Pattern to Algorithm Selection

Step 8: Re-measure, Validate, and Gate in CI/CD

Common Mistakes and Pitfalls

Moving Complexity Rather Than Eliminating It

Gaming the Metric Through Artificial Flattening

Over-Abstraction and Premature Polymorphism

Misinterpreting What the Metric Measures

Ignoring Readability While Optimizing Structural Complexity

How Augment Code Supports Multi-File Refactors

Set a Complexity Gate This Sprint

FAQ

What is a good cyclomatic complexity score?

What is the difference between cyclomatic complexity and cognitive complexity?

Do guard clauses reduce cyclomatic complexity?

How do I enforce cyclomatic complexity in CI/CD?

Can refactoring increase total complexity?

Related

Written by

Molisha Shah

Give your codebase the agents it deserves