What’s a realistic innovation budget for an enterprise team?

Start with 10% for core product teams and 15% for platform. If your SLOs are green for two consecutive quarters, you can experiment with +5%. If error budget burn exceeds 30% for a service, pause its innovation time until stability returns.

How do I prevent “stealth” projects from consuming time?

Require an RFC‑lite, a ticket with an `exp` label, and a branch named `exp/ / `. CI should block unlabelled PRs and the PMO should exclude any time not attached to an `exp` ticket from capacity accounting.

Won’t this kill creativity?

No. Constraints focus creativity. The kill/continue cadence and small blast radii lead to faster learning and less zombie work. The conversion rate typically lands around 20–30%—that’s healthy.

How do we handle compliance and security reviews?

Pre‑approve sandbox patterns with security: no PII in sandboxes, strict egress, OPA cost caps, and ArgoCD‑managed infra with TTL. Graduate behind a `feature_flag`, then run a lightweight threat model before expanding exposure.

How do we avoid innovation time getting raided for incidents?

Make the policy explicit: only services breaching SLOs pause. Other teams continue. Announce pauses and resume criteria in the weekly ops note so the default isn’t “cancel innovation for everyone.”

Culture · Oct 26, 2025 · 10 minute read

Stop Wishing for “20% Time.” Make Innovation a Budget You Can Ship Against.

Innovation without guardrails is politely sanctioned scope creep. Here’s how to fund exploration without tanking delivery, with rituals, leadership behaviors, and metrics that actually work in enterprises.

Alex Mercer

Principal Consultant, GitPlumbers

20 years shipping and rescuing systems at scale. Led SRE and platform teams through monolith-to-microservices migrations, AI-assist rollouts, and reliability turnarounds. Now at GitPlumbers, I help enterprises make innovation safe and boring enough to ship.

Innovation without guardrails is just politely sanctioned scope creep.

Back to all posts

You don’t have 20% time. You have a budget problem.

If you’ve tried “Google-style 20% time” inside a bank, insurer, or Fortune 500, you already know the punchline. The calendar says 20%, the sprint board says 140% WIP. Ops calls about a Sev2, finance wants a reforecast, and your auditors want evidence you didn’t let an engineer SSH into prod. I’ve watched three different orgs announce 20% time and end up with developers doing experiments nights and weekends, followed by a reorg that “re-centers on delivery.”

Here’s what actually works in enterprise land: treat innovation like a budget with hard caps, calendarized windows, and measurable outcomes. No slides about “innovation culture.” Just constraints you can automate and inspect.

Allocate on a calendar, not in a slide deck

Make innovation a scheduling primitive, not a slogan. What’s worked for teams I’ve led and rescued at GitPlumbers:

Budget ranges: 10% for core product teams, 15% for platform/infrastructure, 5% for compliance‑heavy units. Adjust quarterly.
Cadence: a weekly 90‑minute micro‑slot for exploration tasks + a 4‑day spike week every 6 sprints. Friday reserved for demos/retros.
Guardrails:
- Tie eligibility to SLO health: if a service is burning >30% of error budget, its innovation budget pauses until recovery.
- Max concurrent experiments per team: 2. Everything else queues.
- No stealth work: experiments must have a ticket, label, and one‑page RFC.

Example budget file checked into the repo so CI can enforce it:

# .org/innovation_budget.yaml
version: 1
quarter: 2025Q4
teams:
  checkout:
    innovation_percent: 10
    max_concurrent: 2
    slo_gate:
      error_budget_burn_threshold: 0.3
  platform:
    innovation_percent: 15
    max_concurrent: 3
    slo_gate:
      error_budget_burn_threshold: 0.4

This isn’t theater: CI reads this file, checks current burn, and blocks over‑budget PRs labeled exp.

Rituals that keep it honest

You don’t need a big‑bang “Innovation Day.” You need small, boring, repeatable rituals tied to the work.

Friday Demos (25 min, time‑boxed)
- 5 min per experiment, max five slides or a terminal. Goal: show learning, not polish.
- Attendees: product, engineering, security (rotating), support (optional).
Monday Kill/Continue (15 min)
- Every experiment gets one of: kill, continue, graduate (ship behind a flag), or archive (document + shelve).
- Decisions recorded in a public decision log.
RFC‑lite (one page)
- Context, hypothesis, success metric, blast radius, rollback, budget ask.
- Lives in the repo as rfc/exp-<slug>.md.

GitHub issue template to standardize the ask:

---
name: Experiment RFC-Lite
about: One-page experiment request
labels: [exp, rfc]
---

## Context

## Hypothesis

## Success Metric
- Primary (e.g., +10% search success, -15% MTTR)
- Guardrail (no >2% latency regression on p95)

## Blast Radius
- Env: `sandbox` | `staging` | `limited-prod` (flagged)
- Max users: <1% behind `feature_flag`

## Budget Ask
- Time: 4 days
- People: 2

## Rollback Plan

## Decision Log
- 2025-10-12: Continue for 1 week (reason...)

Slack reminder for demos (works with an incoming webhook):

curl -X POST -H 'Content-type: application/json' \
  --data '{"text":"Friday Demos in 15 min. Post links to RFCs in #exp-reviews. Agenda auto-generated from `exp` tickets opened this week."}' \
  https://hooks.slack.com/services/T000/B000/XXXX

Leadership behaviors: what you tolerate is what you get

When leaders waffle, innovation time becomes a guilt‑ridden side quest. Behaviors that have worked for me and clients:

Protect the calendar publicly. If legal demands the spike week, you move legal, not the spike. Reschedule in the open, never silently cancel.
Tie to business bets. Every quarter, publish 3–5 investment areas (e.g., “faster cold starts,” “privacy‑preserving analytics”). Experiments must map to a bet.
Kill rate target. Aim to kill 50–70% of experiments. If everything continues, you’re not picking hard problems or you’re afraid to say no.
No pet projects. Directors and above pitch like everyone else; the same RFC‑lite and demo rules apply.
Incident gate. If Sev1/Sev2 rates spike, innovation pauses for affected teams until SLOs return to green. Announce the pause and the resume criteria.
Promote visible learnings, not heroic hacks. Reward the team that killed a doomed idea after 3 days with a crisp write‑up.

Make it measurable (and boringly inspectable)

Skip vanity metrics like “number of ideas.” Track things you can query across tools.

Conversion rate: graduated_experiments / total_experiments rolling 90 days.
Time to decision: median days from RFC opened → kill/continue decision.
Adoption: % of target services/users behind a flag after 30/60/90 days.
SLO impact: error budget burn, MTTR deltas for services touched by exp.
Cost guardrails: sandbox spend per experiment with a fixed cap.

Jira JQL to pull the last 30 days of experiments shipped:

project in (PLATFORM, CHECKOUT) AND labels in (exp, spike) AND statusCategory = Done AND resolved >= -30d

Prometheus to watch error budget burn for experimental traffic (assumes label track="exp"):

sum(rate(http_requests_total{track="exp",status=~"5.."}[30d]))
/
sum(rate(http_requests_total{track="exp"}[30d]))

A simple SQL model for conversion and time to decision if you log RFC events:

-- table: innovation_log (id, rfc_id, event, ts)
-- events: opened, continued, killed, graduated

WITH decisions AS (
  SELECT rfc_id,
         MIN(CASE WHEN event IN ('killed','graduated') THEN ts END) AS decided_at,
         MIN(CASE WHEN event = 'opened' THEN ts END) AS opened_at,
         MAX(CASE WHEN event = 'graduated' THEN 1 ELSE 0 END) AS graduated
  FROM innovation_log
  GROUP BY rfc_id
)
SELECT
  COUNT(*) AS total,
  SUM(graduated) AS graduated,
  ROUND(100.0 * SUM(graduated) / NULLIF(COUNT(*),0), 1) AS conversion_pct,
  PERCENTILE_CONT(0.5) WITHIN GROUP (ORDER BY EXTRACT(EPOCH FROM (decided_at - opened_at))/86400) AS median_days_to_decision
FROM decisions;

OKR sketch that doesn’t game the system:

O: Improve innovation throughput without hurting reliability.
- KR: Convert 20–30% of experiments to shipped features with <2% p95 latency regression.
- KR: Reduce MTTR by 15% on services touched by experiments in quarter.
- KR: Kill/continue decisions within 5 business days (median).

Tooling: bake it into GitOps and CI/CD

Operationalize, don’t sermonize.

Label and branch conventions

# .github/labels.yml
- name: exp
  color: 0e8a16
  description: Experimental/spike work
- name: graduate
  color: 5319e7
  description: Ready to graduate behind a feature flag

Branches: exp/<team>/<short-slug>; PRs must reference a ticket and RFC.

CI budget gate (GitHub Actions):

# .github/workflows/innovation-budget-gate.yml
name: Innovation Budget Gate
on:
  pull_request:
    types: [opened, synchronize, reopened]
jobs:
  check:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Determine team budget
        id: budget
        run: |
          TEAM=$(jq -r '.pull_request.base.repo.name' <(echo "${{ toJson(github) }}") | cut -d'-' -f1)
          PCT=$(yq ".teams.${TEAM}.innovation_percent" .org/innovation_budget.yaml)
          echo "pct=$PCT" >> $GITHUB_OUTPUT
      - name: Enforce experiment labeling and SLO gate
        run: |
          # Require 'exp' label and check SLO burn via an internal API
          LABELS=$(gh pr view ${{ github.event.number }} --json labels -q '.labels[].name')
          if ! echo "$LABELS" | grep -q "exp"; then
            echo "Missing 'exp' label for experimental PR"; exit 1; fi
          BURN=$(curl -sf https://slo-api.internal/burn?team=${TEAM})
          THRESH=$(yq ".teams.${TEAM}.slo_gate.error_budget_burn_threshold" .org/innovation_budget.yaml)
          awk -v b=$BURN -v t=$THRESH 'BEGIN{if(b>t){exit 1}}' || { echo "Error budget burn too high ($BURN > $THRESH)"; exit 1; }

Sandbox deploys with TTL (ArgoCD ApplicationSet):

apiVersion: argoproj.io/v1alpha1
kind: ApplicationSet
metadata:
  name: exp-sandboxes
spec:
  generators:
    - git:
        repoURL: https://github.com/acme/experiments.git
        revision: main
        directories:
          - path: sandboxes/*
  template:
    spec:
      project: experiments
      source:
        repoURL: https://github.com/acme/experiments.git
        targetRevision: main
        path: "{{path}}"
      destination:
        server: https://kubernetes.default.svc
        namespace: "{{path.basename}}"
      syncPolicy:
        automated: {}
        syncOptions:
          - CreateNamespace=true
      info:
      - name: ttl
        value: "168h"  # auto-cleanup in 7 days

Cost cap via OPA (Rego policy evaluated in CI or admission controller):

package budget

deny[msg] {
  input.labels.env == "sandbox"
  cost := input.estimated_cost
  cost > 100
  msg := sprintf("Sandbox cost cap exceeded: %v > $100", [cost])
}

Tag everything in Terraform

resource "aws_instance" "exp" {
  ami           = var.ami
  instance_type = "t3.small"
  tags = {
    owner    = var.engineer
    env      = "sandbox"
    ttl      = "168h"
    purpose  = "exp:${var.slug}"
  }
}

Case study: the spike week that paid for itself

A fintech client asked us to “bring back innovation” after freezing all spikes during an incident‑heavy year. We implemented the playbook above: 10% budget for product squads, 15% for platform, weekly micro‑slots, and a 4‑day spike week every 6 sprints. We wired SLO gates into their GitHub Actions and set up an exp-sandboxes cluster via ArgoCD with a $100 cost cap per experiment.

Three quarters later:

Throughput: 41 experiments; 11 graduated (27% conversion).
Reliability: MTTR down 22% on services touched by experiments; error budget burn unchanged (within ±2%).
Business impact: Two spikes became roadmap items: a feature‑flagged query cache that cut p95 latency by 18% on a noisy endpoint, and a partial replay tool for Kafka topics that shaved on‑call toil by ~4 hours/week.
Costs: Sandbox spend capped at $2.4k/quarter; no zombie infra thanks to TTL.

The CFO liked the monthly scorecard enough to let us keep the budget through year‑end. The VP Eng liked that kill decisions were public and fast. ICs liked not doing “secret weekend hacks.”

What I’d change next time

Start with a smaller spike cadence (every 8 sprints) if your incident rate is volatile; earn your way to every 6.
Add a product counterpart as co‑owner of the Friday demos so adoption doesn’t stall.
Push harder on dogfooding: graduating behind a feature_flag only counts when two teams adopt it.
Automate the decision log ingestion; humans forget to update spreadsheets. Pipe RFC events to a table and publish a dashboard.

Innovation is not the opposite of delivery. It’s delivery with smaller blast radii and the courage to kill fast.

Related Resources

Key takeaways

Treat innovation as a budget with hard caps, not a mythic 20% time promise.
Calendar it: fixed cadence windows and weekly micro-slots beat ad‑hoc “when there’s time.”
Make experiments traceable via labels, branches, and sandboxes wired into GitOps.
Ritualize decisions: demo, kill/continue, and publish a one‑page RFC with a decision log.
Measure outcomes: conversion rate to shipped, SLO impact, MTTR reduction, and adoption.
Gate with tooling: CI checks, cost caps, TTL sandboxes, and OPA policies.
Leaders must protect time in public, kill pet projects, and tie experiments to business bets.

Implementation checklist

Define innovation budget per team as a percentage of capacity with a hard cap (10–15%).
Block recurring calendar time: one weekly 90‑min micro‑slot + one 4‑day spike per 6 sprints.
Create a `labels.yml` and branch naming scheme for experiments (`exp/*`).
Stand up a sandbox cluster with TTL and cost caps; deploy via ArgoCD ApplicationSets.
Adopt a one‑page RFC template and a Friday demo → Monday kill/continue ritual.
Instrument conversion, adoption, and SLO impact; publish a monthly scorecard.
Add CI gates to enforce budget and require `exp` labels on PRs and Jira tickets.

Questions we hear from teams

What’s a realistic innovation budget for an enterprise team?: Start with 10% for core product teams and 15% for platform. If your SLOs are green for two consecutive quarters, you can experiment with +5%. If error budget burn exceeds 30% for a service, pause its innovation time until stability returns.
How do I prevent “stealth” projects from consuming time?: Require an RFC‑lite, a ticket with an `exp` label, and a branch named `exp/<team>/<slug>`. CI should block unlabelled PRs and the PMO should exclude any time not attached to an `exp` ticket from capacity accounting.
Won’t this kill creativity?: No. Constraints focus creativity. The kill/continue cadence and small blast radii lead to faster learning and less zombie work. The conversion rate typically lands around 20–30%—that’s healthy.
How do we handle compliance and security reviews?: Pre‑approve sandbox patterns with security: no PII in sandboxes, strict egress, OPA cost caps, and ArgoCD‑managed infra with TTL. Graduate behind a `feature_flag`, then run a lightweight threat model before expanding exposure.
How do we avoid innovation time getting raided for incidents?: Make the policy explicit: only services breaching SLOs pause. Other teams continue. Announce pauses and resume criteria in the weekly ops note so the default isn’t “cancel innovation for everyone.”

Ready to modernize your codebase?

Let GitPlumbers help you transform AI-generated chaos into clean, scalable applications.

Talk to GitPlumbers about operationalizing your innovation program Download the Innovation Budget Playbook (templates + policies)