Tools · People analytics
OKR Designer
Turn strategy context into a quarter-ready OKR set that measures outcomes, not activity.
The method
Objectives and Key Results (Grove's Intel iMBO lineage, popularized by Doerr), disciplined by the strategy-kernel literature
It is the first week of the quarter and the OKR sheet is full. Every objective is inspiring, every key result is an activity — 'launch the dashboard,' 'run twelve enablement sessions' — and every target was quietly negotiated down to what the team already expected to hit. The VP reads the whole sheet and cannot find the strategy in it, because it is not there.
OKRs descend from Drucker's management by objectives through Andy Grove's practice at Intel, and John Doerr's Measure What Matters carried Grove's format to everyone else. Doerr's own telling is more demanding than the template that spread: OKRs work paired with continuous performance conversations, feedback, and recognition — the CFRs — because the quarterly ritual alone aligns paperwork, not people. The strategy literature explains why most OKR sets fail anyway. Richard Rumelt's Good Strategy / Bad Strategy names the core failure: mistaking goals for strategy. A stack of ambitious objectives with no diagnosis of the challenge and no guiding policy is what Rumelt calls bad strategy — wishful thinking, buzzwords, and ambitious goals in a template. His alternative is the proximate objective: a target close enough that the organization can see how to hit it, chosen because hitting it resolves the diagnosed challenge.
Lafley and Martin's Playing to Win supplies the test each objective has to pass. Strategy, they argue, is five integrated choices — winning aspiration, where to play, how to win, capabilities, management systems — and an objective that cannot be traced to a where-to-play or how-to-win choice is ambition wearing a format. Notice where OKRs actually sit in that cascade: they are a management system, the fifth choice, the cadence that keeps the other four honest. They are not a substitute for making them.
The honest limit comes from Phil Rosenzweig's The Halo Effect: outcomes generate attributions. Teams that hit their numbers get described as focused and disciplined; teams that miss lack execution — when the competitive environment and plain luck moved the number too. So grading needs humility, and the outcome-versus-output distinction needs teeth: a key result that counts shipped activity will score green while the business stands still.
The books hand you the format and a warning list, then leave you to draft against both. Here the drafting is live: objectives tied to your stated strategy, every key result typed outcome-versus-output honestly, and the sandbagged targets and relabeled KPIs called out by name.
The books behind this tool
- Measure What Matters — John Doerr
- Good Strategy / Bad Strategy — Richard P. Rumelt
- Playing to Win: How Strategy Really Works — A.G. Lafley & Roger L. Martin
- The Halo Effect — Phil Rosenzweig
How it works
Corpus-grounded draft (strategy cluster) at company/team/individual level: objectives tied to the stated strategy; 3–5 KRs each with outcome-vs-output typing (outputs flagged honestly, never disguised), measurement source, target logic, and baseline flags; OKR anti-pattern detection (sandbagging, KPI relabeling, cascade-by-dictation); prescribes the check-in/scoring/grading cadence. Cites the canonical constructs it leans on. Performix-surface tool (2026-07-04 routing: the goal-setting family — smart-goals · mbo-designer · balanced-scorecard · okr-designer — lives on the Performix surface, syndicated via REST/MCP).
You bring
{ context, level?: company|team|individual, objectives?, cluster? }
You get
{ level, context_summary, okr_sets[] (objective · rationale · key_results · grounded_in · strength), anti_patterns[], cadence, grounded_in, provenance }
Use it for
- →Quarter kickoff: strategy memo in → disciplined draft OKRs out, with the sandbagged targets called out
- →Pair with smart-goals for the individual layer beneath a team OKR set
- →Feed the balanced-scorecard service the same context for the measures-side complement
See it work
example outputCompany-level OKRs for a 40-person compliance-SaaS moving upmarket: Q3 = onboarding overhaul + first two enterprise logos.
O1: Turn the first 90 days into the reason customers stay, not the reason they leave
Churn concentrates in the first 90 days, which caps NRR at 96%. Before selling upmarket, the company must fix the leaky bucket — enterprise buyers scrutinize onboarding and references. This objective targets the pivotal obstacle identified in the diagnosis.
- KR: Reduce first-90-day logo churn from its current rate to under 5% of new cohorts
- Measure: % of accounts that churn within 90 days of contract start, cohort-tracked in the CRM/billing system, measured monthly by signup cohort (baseline needed)
- Target logic: Churn is the stated concentration point; a sub-5% target is a stretch because current early churn is the primary drag on 96% NRR — halving it (not shaving a point) is what breaks the retention ceiling. Baseline needed because current early-churn rate isn't quantified in context.
- KR: Increase share of new customers reaching first activated value milestone within 14 days from baseline to 70%
- Measure: % of new accounts hitting the defined activation event (e.g. first compliance workflow live) within 14 days, product analytics, weekly (baseline needed)
- Target logic: Time-to-value is the mechanism behind early churn; 14 days/70% forces a genuine speedup rather than a soft target. Baseline needed because no activation metric is defined yet — defining it is part of the work.
- KR: Raise new-customer onboarding NPS/CSAT to 45+ (NPS) across Q3 cohorts
- Measure: Post-onboarding survey score sent at day 45, tracked per cohort monthly (baseline needed)
- Target logic: Sentiment predicts renewal and reference-ability needed for enterprise selling. 45+ is a stretch for a company still fixing onboarding. Baseline needed as no survey is described.
- KR (output): Ship the redesigned onboarding flow (structured milestones + assigned onboarding owner) to 100% of new accounts by end of Q3
- Measure: % of new accounts routed through the new flow, product/ops tracking, biweekly
- Target logic: Honestly an output/milestone — the overhaul must exist before outcomes move. Marked output rather than disguised; it enables the three outcome KRs above.
Grounded in: Quality of Strategic Diagnosis, Customer Jobs and Strategic Segmentation, Disciplined Execution and Coordinated Action, Flywheel Momentum and Strategic Compounding, Stakeholder Buy-In, Trust and Engagement
O2: Prove the enterprise motion is real by landing our first lighthouse logos
The year's strategy is moving upmarket; Q3 explicitly targets the first two enterprise logos. These early wins are proof points that build the flywheel and validate whether the distinctive capability and coherence extend to enterprise.
- KR: Sign 2 enterprise logos (>$X ACV / defined seat or spend threshold) with executed contracts in Q3
- Measure: Count of closed-won enterprise-tier deals meeting the pre-agreed definition, CRM, weekly (baseline needed)
- Target logic: Directly the stated Q3 goal. Two is a stretch from zero enterprise logos for a mid-market-native team. Baseline needed to lock the enterprise-tier ACV/threshold definition so the count isn't gamed.
- KR: Build a qualified enterprise pipeline of at least 8 opportunities past first discovery to de-risk the 2 closes
- Measure: Count of enterprise-qualified opps past a defined stage gate, CRM, weekly (baseline needed)
- Target logic: Two closes require a multiple in pipeline; 8 forces real demand generation, not one lucky inbound. A ~4:1 coverage ratio is disciplined but ambitious for a new segment.
- KR: Validate enterprise-readiness by clearing security/compliance review (e.g. SOC 2 questionnaire, SSO, DPA) in at least 2 enterprise evaluations without a blocking gap
- Measure: % of enterprise deals reaching security review that pass without a deal-blocking requirement gap, deal notes, per-deal (baseline needed)
- Target logic: Enterprise deals die in procurement/security, not sales. Passing without a blocker proves the capability system actually fits enterprise — a genuine stretch for a mid-market product.
Grounded in: Strategic Focus and Distinctive Choice, Industry Structure and Competitive Forces, Distinctive Capabilities System, Coherence and Fit Among Activities
Anti-patterns flagged
- Risk of KPI-relabeling: NRR is an existing health metric — it is deliberately NOT used as a KR; the KRs target the change drivers (early churn, activation) instead. Flagging so the team resists demoting NRR into a KR.
- Output-shaped KR present and honestly labeled (onboarding flow shipment) — acceptable as an enabler but should not become the whole objective's proof.
- Watch for sandbagging on '2 logos' — with a fresh enterprise motion, 2 is genuinely ambitious, but ensure the ACV threshold is set high enough that tiny deals don't count.
Cadence
- Check-in: Weekly 30-min ops check-in: onboarding cohort metrics and enterprise pipeline movement reviewed side by side (they are coupled). First 2-3 weeks focus on establishing the missing baselines (early-churn rate, activation event, survey, enterprise-tier definition) before confidence scoring means anything.
- Scoring: Biweekly confidence scoring (red/amber/green or 1-10) per KR, owned by onboarding lead and enterprise sales lead respectively; escalate any KR red two check-ins running to leadership since Q3 has no slack.
- Grading: End-of-cycle 0.0-1.0 grade per KR with 0.7 as ambitious-success norm. Given fresh baselines, grade the outcome KRs on delta-from-established-baseline rather than absolute hit, and treat the onboarding-shipment output as pass/fail gate rather than fractional.
Run it on your data
Call it on your own inputs — over the API, or hand it to your AI agent via MCP. Discovery is open; running it is metered.