How many approvals should we require on pull requests?

Default to **1 approval** for most repos, then use `CODEOWNERS` to force the right reviewers on high-risk areas (auth, payments, infra, migrations). Requiring 3+ approvals everywhere usually increases cycle time without reducing real risk.

Should we run CodeQL/SAST on every PR?

If it keeps PR CI under ~10 minutes and the findings are actionable, yes. If it makes CI slow or noisy, run SAST on `main` pushes and nightly, and keep fast checks (lint/unit tests/build) on every PR.

What’s the biggest mistake teams make with code review automation?

Adding more checks before fixing trust issues. Flaky tests and noisy linters turn automation into a bottleneck, and then people either bypass it or stop paying attention.

When does it make sense to bring in GitPlumbers?

When automation exposes systemic problems (flaky CI, unsafe dependency sprawl, tangled architecture) and you need fast, targeted fixes. Start by **running Automated Insights**, then **book a code audit** for prioritization, or **assemble a fractional team for remediation** to execute.

Platform-productivity · Feb 21, 2026 · 8 minute read

The Code Review Queue From Hell: Automate the Boring Checks Without Shipping Garbage

A paved-road approach to code review automation that keeps quality high, cycle time low, and engineers out of approval purgatory.

GitPlumbers Editorial Team

Legacy + AI Code Rescue, Platform Reliability, and Delivery Systems

We’re a group of senior engineers and ex-leads who’ve rebuilt brittle CI/CD, survived microservices reversals, and cleaned up AI-generated code that “worked” until it didn’t. We help teams ship safely with pragmatic audits, automated analysis, and targeted specialists.

Automate what’s deterministic so humans can review what’s consequential.

Back to all posts

The PR that waited three days (and still shipped a bug)

I’ve seen this movie too many times: a startup moves fast, the PR queue grows teeth, and suddenly code review becomes the bottleneck. Engineers start batching changes, reviewers skim to survive, and the team ships the exact kind of production bug that “three approvals” was supposed to prevent.

The irony is the queue is usually full of low-signal work:

“Please run prettier.”
“Lint is failing on line 842.”
“You forgot to update the migration.”
“Why is the lockfile changed?”

Humans are spending their best attention on things a machine can enforce deterministically.

The goal of code review automation isn’t to eliminate reviews. It’s to protect reviewer attention so humans focus on what humans are good at: intent, correctness, architecture, and risk.

Automate the boring checks; keep humans on the sharp edges

A good rule: if the feedback is predictable and repeatable, it belongs in automation.

Machines should enforce:

Formatting (prettier, gofmt, black)
Linting (eslint, golangci-lint)
Unit tests + coverage thresholds (carefully)
Buildability and packaging
Dependency risk (Dependabot, npm audit, pip-audit)
Secret scanning
Basic SAST (CodeQL or Semgrep)

Humans should review:

API contracts, data model changes, and migrations
Concurrency, idempotency, and failure modes
Backwards compatibility and rollout plan
Security-sensitive logic (authz/authn, crypto, multi-tenant boundaries)
Observability (logs/metrics/traces), SLO impact, and on-call blast radius

If you’re using AI-assisted coding (or doing “vibe coding”), this separation matters even more. AI-generated code often passes lint and still fails at domain correctness or introduces subtle security and reliability issues. Automation should catch the obvious stuff; humans must interrogate behavior.

The paved-road default: GitHub-native gates + one CI workflow

Every team gets tempted to build a bespoke “policy engine” with custom bots, YAML gymnastics, and a dozen exceptions. I’ve seen that fail: the bot becomes a second product, and the rules drift until nobody trusts it.

Start with paved-road defaults that GitHub already supports:

Branch protection rules on main/master
Required status checks (CI must be green)
Required reviews (1–2, not 5)
CODEOWNERS for routing, not bureaucracy

Here’s a minimal CODEOWNERS that actually reduces cycle time by sending PRs to the right people:

# .github/CODEOWNERS
# Default
* @platform-team

# High-risk areas
/apps/api/auth/* @security-champions @backend-leads
/infra/terraform/* @platform-team
/db/migrations/* @backend-leads

# Frontend ownership
/apps/web/* @frontend-team

And here’s a single GitHub Actions workflow that covers most repos without turning CI into a Rube Goldberg machine:

# .github/workflows/ci.yaml
name: CI

on:
  pull_request:
  push:
    branches: [ main ]

jobs:
  test:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4

      - name: Set up Node
        uses: actions/setup-node@v4
        with:
          node-version: 20
          cache: npm

      - name: Install
        run: npm ci

      - name: Lint
        run: npm run lint

      - name: Unit tests
        run: npm test -- --ci

  codeql:
    runs-on: ubuntu-latest
    permissions:
      actions: read
      contents: read
      security-events: write
    steps:
      - uses: actions/checkout@v4
      - uses: github/codeql-action/init@v3
        with:
          languages: javascript-typescript
      - uses: github/codeql-action/analyze@v3

Trade-off: running CodeQL on every PR can be slow on large repos. If it adds more than ~5–8 minutes and developers start “checking out” mentally, run it on main pushes and nightly, but keep fast lint + unit tests on every PR.

Before/after: cycle time and failure rate (what changes in practice)

A concrete example I’ve seen multiple times:

Before automation (typical):

PR requires 2–3 approvals regardless of change type
CI is inconsistent; developers “run tests locally” (sometimes)
Review comments are dominated by nits
Average PR time-to-merge: 36–72 hours
Production incidents: 1–2/month tied to “should’ve been caught” issues

After paved-road automation (typical in 2–4 weeks):

PR requires 1 approval by default; CODEOWNERS enforces extra review for high-risk paths
Required checks: lint + unit tests + build
Code scanning runs predictably (PR or nightly)
Average PR time-to-merge: 6–18 hours (often less for small PRs)
Incidents: fewer “obvious” failures; when failures happen, they’re more often real complexity, not process gaps

The business translation founders care about:

Faster merge cycle = faster customer-facing iteration (and less context switching)
Fewer regressions = fewer churn-y “your product broke us” calls
More reliable SDLC = easier investor diligence (you can explain your controls)

The hidden win: code review becomes a design conversation again.

Risk-based gates: don’t treat a README change like a payments refactor

One-size-fits-all review policy is how you get bureaucracy. The trick is to apply friction proportional to risk.

Practical heuristics that work:

Low risk: docs, comments, internal refactors with no behavior change
Medium risk: feature work behind a flag, non-critical endpoints
High risk: auth, payments, multi-tenant boundaries, migrations, infra, deploy pipelines

You can implement risk-based behavior without bespoke tooling using labels and path filters.

Example: require an extra approval only when high-risk directories change. GitHub branch protection can’t express “if path then approvals,” but you can approximate paved-road behavior by:

Using CODEOWNERS for sensitive paths (best default)
Keeping required approvals low (1–2)
Enforcing required checks for everyone
Adding a lightweight PR checklist template for high-risk work

Here’s a PR template that avoids performative checkboxes and focuses on operational risk:

<!-- .github/pull_request_template.md -->

## What changed

## Why

## Risk
- [ ] Migration included (if schema changed)
- [ ] Rollback plan considered
- [ ] Observability updated (logs/metrics) for new behavior
- [ ] Feature flag / gradual rollout (if user-facing)

## How to test

## Screenshots (if UI)

If you want one extra layer without building a “bot platform,” tools like reviewdog can annotate lint results inline so developers fix issues before humans even look:

- name: ESLint with reviewdog
  uses: reviewdog/action-eslint@v1
  with:
    github_token: ${{ secrets.GITHUB_TOKEN }}
    reporter: github-pr-review
    eslint_flags: "src"

Trade-off: annotation tools add setup cost and occasional noise. If the team isn’t already green on “lint/test in CI,” don’t add reviewers’ cognitive load yet.

Keep automation from becoming the new bottleneck

I’ve also seen “automation” slow teams down because it’s noisy, flaky, or too slow. Common failure modes:

Flaky tests block merges → people rerun CI until it “turns green” (garbage signal)
Slow pipelines encourage giant PRs and late-night merges
Too many required checks create a whack-a-mole experience
Security tooling spam leads to alert fatigue

Guardrails that actually work:

Keep the PR pipeline under 10 minutes whenever possible
Split fast vs slow checks:
- Fast (every PR): format, lint, unit tests, build
- Slow (nightly or main): deeper SAST, container scanning, dependency graph audits
Quarantine flaky tests:
- Tag and separate them, don’t block merges on “known flaky” suites
- Track flake rate and burn it down like real debt

A simple pattern for controlling CI runtime is to cache dependencies and avoid reinstalling the world:

- uses: actions/setup-node@v4
  with:
    node-version: 20
    cache: npm

- run: npm ci

If your repo is polyglot or you’ve got legacy build systems, this is where teams start inventing bespoke pipelines. Don’t. Standardize a single entrypoint per repo (make test, make lint) and let the workflow stay boring.

When you need specialists (and why “just hire a platform engineer” is slow)

Once you add automation, it will surface uncomfortable truths:

The test suite is brittle and untrusted
The module boundaries are spaghetti
The deployment pipeline has no real rollback story
“Security findings” are everywhere because the codebase is structurally unsafe

This is the point where I’ve watched teams waste months trying to staff up reactively. Hiring is slow, and the wrong hire (or an overconfident generalist) can turn “fix CI” into a quarter-long rewrite.

This is exactly where a curated specialist network helps. GitPlumbers shows up in three concrete ways:

Run Automated Insights to get a fast, repo-wide read on structural issues, security gaps, and reliability risks—directly from GitHub, without a six-week platform project.
Book a code audit (pre-scale, pre-funding, pre-hire) to get a prioritized remediation plan: what to fix now, what to defer, and what’s secretly a rebuild.
Assemble a fractional team for remediation: a CI/SRE specialist to de-flake pipelines, an AppSec person to tune CodeQL/Semgrep, and a senior backend engineer to untangle the hot path—without committing to a full hiring spree.

I’ve seen this approach cut “time to stable CI” from months to a few weeks, because the work is scoped, owned, and executed by people who’ve done the exact failure recovery before.

The win isn’t more tooling. The win is restoring trust: in CI, in reviews, and in the release process.

A practical rollout plan you can execute this month

Week 1: Establish the paved road
- Add branch protection + required checks
- Add CODEOWNERS
- Make CI run lint + test reliably
Week 2: Reduce review friction
- Lower default required approvals to 1
- Use CODEOWNERS for high-risk paths
- Introduce a PR template focused on risk and testing
Week 3: Add security signal (without derailing delivery)
- Turn on Dependabot
- Add CodeQL (PR or nightly depending on repo size)
- Enable secret scanning
Week 4: Measure outcomes and fix the loudest pain
- Track PR cycle time, CI duration, and flake rate
- Remove or tune any check that produces noise

If you want a fast, objective baseline before you tweak knobs: run GitPlumbers Automated Insights. It’ll tell you where automation will help immediately and where you’re about to discover deeper structural debt. From there, either book a code audit for a board-ready plan, or assemble a fractional team for remediation to get the delivery pipeline back under control.

Related Resources

Key takeaways

Automate low-signal checks (formatting, lint, tests, dependency risk) so humans focus on design, correctness, and user impact.
Prefer paved-road defaults (GitHub-native protections + one CI workflow) over bespoke bots and homegrown policy engines.
Use risk-based gates: not every repo, PR, or change deserves the same friction.
Make automation results actionable: inline annotations, short feedback loops, and a clear “what to do next.”
When automation uncovers systemic issues (flaky tests, tangled modules, unsafe deployments), bring in targeted specialists to fix root causes fast.

Implementation checklist

Define what humans must review (architecture, correctness, product behavior) vs what machines must enforce (style, tests, SAST, secrets).
Standardize a single CI entrypoint (one command per language: `make test`, `npm test`, `go test ./...`).
Turn on branch protection: required checks, required reviews, linear history (optional), and admin enforcement (case-by-case).
Add `CODEOWNERS` to route reviews and prevent “random reviewer roulette.”
Add automated annotations: lint + unit tests on every PR; heavier scans (SAST/Scorecard) on default branch or nightly if needed.
Introduce risk-based rules (paths, labels, or diff size) to reduce friction for low-risk changes.
Track outcomes: PR cycle time, review turnaround, change failure rate, and rollback frequency.
If checks are noisy or flaky, stop and fix that before adding more automation.

Questions we hear from teams

How many approvals should we require on pull requests?: Default to **1 approval** for most repos, then use `CODEOWNERS` to force the right reviewers on high-risk areas (auth, payments, infra, migrations). Requiring 3+ approvals everywhere usually increases cycle time without reducing real risk.
Should we run CodeQL/SAST on every PR?: If it keeps PR CI under ~10 minutes and the findings are actionable, yes. If it makes CI slow or noisy, run SAST on `main` pushes and nightly, and keep fast checks (lint/unit tests/build) on every PR.
What’s the biggest mistake teams make with code review automation?: Adding more checks before fixing trust issues. Flaky tests and noisy linters turn automation into a bottleneck, and then people either bypass it or stop paying attention.
When does it make sense to bring in GitPlumbers?: When automation exposes systemic problems (flaky CI, unsafe dependency sprawl, tangled architecture) and you need fast, targeted fixes. Start by **running Automated Insights**, then **book a code audit** for prioritization, or **assemble a fractional team for remediation** to execute.

Ready to modernize your codebase?

Let GitPlumbers help you transform AI-generated chaos into clean, scalable applications.

Run Automated Insights on your GitHub org Book a code audit for a prioritized remediation plan