SKILL·C92866

steve-jobs-design-review

wondelai
Updated 8 days ago
1,407
148
1,407
View on GitHub
Testingexcelaitestingdesign

About

This skill critiques designs and products through the lens of Steve Jobs' principles, focusing on ruthless simplicity, essential features, and the complete end-to-end user experience. It provides a structured review protocol for cutting scope, pressure-testing workflows, and delivering binary verdicts. Use it when you need to audit for focus, simplify a product, or enforce a high standard of integrated excellence.

Quick Install

Claude Code

Recommended
Primary
npx skills add wondelai/skills -a claude-code
Plugin CommandAlternative
/plugin add https://github.com/wondelai/skills
Git CloneAlternative
git clone https://github.com/wondelai/skills.git ~/.claude/skills/steve-jobs-design-review

Copy and paste this command in Claude Code to install this skill

Documentation

Steve Jobs Design Review

Run design and product reviews the way Steve Jobs ran them: start from the customer experience, subtract until only the essential remains, and refuse to call anything done that isn't insanely great.

Core Principle

"You've got to start with the customer experience and work backwards to the technology." Review every product from what a customer sees, feels, and accomplishes — never from the feature list, the org chart, or the technology that happened to be available. And remember the standard: "Design is not just what it looks like and feels like. Design is how it works."

Scoring

Goal: 10/10. When reviewing a design, product, feature, or roadmap, rate it 0-10 against the principles below. State the current score, exactly what fails, and the specific cuts or fixes required to reach 10/10. There is no "pretty good" — anything below 10 is not done yet.

Framework

1. Simplicity Is the Ultimate Sophistication

Core concept: Simplicity is not the absence of features — it is complexity conquered. Keep subtracting until removing one more thing would break the product's purpose.

Why it works: Every element a user must perceive, parse, or decide about taxes attention and erodes confidence. Simplicity that survives deep understanding of the problem feels inevitable; simplicity achieved by hiding things feels broken.

Key insights:

  • "It takes a lot of hard work to make something simple, to truly understand the underlying challenges and come up with elegant solutions"
  • The iPod shipped with no on/off switch — the need was designed away, not the button hidden
  • Measure steps-to-value: Jobs demanded any song in three presses; the original iDVD pitch was one window, drag video in, click "Burn"
  • Prefer one good default over a setting; every preference is a decision you failed to make
  • If you must explain it, redesign it — instructions are apologies

Review applications:

ContextApplicationExample
Feature auditCount steps to core value; cut anything off the main pathSignup → first value in 3 steps, not 9
UI critiqueRemove elements until the screen states one intentOne primary button per screen
Settings reviewReplace options with opinionated defaultsAuto-save always on; no toggle

Review prompts:

  • "What can we remove and have this still work better?"
  • "Why is this here? Who asked for it, and does the core user need it?"
  • "Explain this screen in one sentence. Can't? It's two screens — or none."

Ethical boundary: Simplify by solving complexity for the user, never by burying necessary controls or costs (pricing, privacy, cancellation) where they can't be found.

See: references/simplicity-and-focus.md

2. Focus Means Saying No

Core concept: "Focusing is about saying no." Deciding what not to build is as important as deciding what to build — innovation is saying no to 1,000 things.

Why it works: Effort spread across many decent things produces nothing great. Killing good ideas concentrates the team's best people and attention on the few products that matter, and protects the product from becoming a committee's wish list.

Key insights:

  • In 1997 Jobs cut dozens of Apple products to a 2×2 matrix: consumer/pro × desktop/portable — focus saved the company
  • At retreats, the team's top-10 priority list got cut to three: "We can only do three"
  • "I'm as proud of the things we haven't done as the things we have done"
  • A roadmap with no recently killed items isn't focused, it's unexamined
  • Saying no includes features already shipped — deletion is a feature

Review applications:

ContextApplicationExample
Roadmap reviewForce-rank, then cut everything below #3Q3 plan: 3 bets, not 14 backlog items
Scope creepRequire a kill for every addNew dashboard widget = retire one
Product lineCollapse overlapping SKUs/tiersOne plan per customer type

Review prompts:

  • "If we could ship only one thing this quarter, which — and why isn't the rest cut?"
  • "What is this product deliberately bad at?"
  • "What did we say no to this cycle? Nothing? Then we said yes to mediocrity."

Ethical boundary: Say no to scope, never to evidence — killing a feature is strategy; ignoring user pain that contradicts your vision is vanity.

See: references/review-protocol.md

3. Design Is How It Works

Core concept: Design is not a veneer applied at the end — it is the architecture of how the product behaves. Judge flows, speed, and failure states, not just the mockup's beauty.

Why it works: Users don't experience screenshots; they experience latency, errors, interruptions, and sequences. A beautiful product that stutters, loses work, or confuses on failure is badly designed no matter how it looks.

Key insights:

  • The iPhone keyboard succeeded through behavior (aggressive autocorrect), not visuals — engineering and design are one discipline
  • Review the slowest moment, not the happy path: cold start, empty state, offline, error recovery
  • "It just works" is a design spec: zero configuration, zero manual, zero ceremony
  • Beauty that fights function is decoration; reject it
  • Latency is a design property — a 2-second wait is a design flaw, wherever it lives in the stack

Review applications:

ContextApplicationExample
Mockup reviewDemand the interaction, not the stillClick through states, not slides
PerformanceSet experience budgets in the reviewFirst screen < 1s or it fails review
Failure designWalk error/empty/offline pathsPayment fails → user knows exactly what next

Review prompts:

  • "Show me what happens when it fails."
  • "How does this feel after the 100th use, not the demo?"
  • "Where does the user wait, and what did we do about it?"

Ethical boundary: "How it works" includes how it respects the user — dark patterns that work for the business but against the user fail this review by definition.

See: references/end-to-end-experience.md

4. Own the Whole Experience

Core concept: The product is every touchpoint: discovery, purchase, unboxing or first run, onboarding, daily use, failure, support, billing, and leaving. Review the whole widget, not the app in isolation.

Why it works: Customers judge the experience as one thing. Apple built unboxing rituals, its own stores, and the Genius Bar because a great device sold badly or supported rudely becomes a bad product in memory.

Key insights:

  • Packaging got design-lab treatment at Apple — first impressions are part of the product
  • The first run is your unboxing: what users see at minute zero deserves hero-screen care
  • Support tickets, invoices, and cancellation flows are product surfaces — usually nobody designed them
  • Every handoff between teams (marketing → onboarding → product → support) is where experience seams show
  • Map the journey end to end; the worst touchpoint sets the perceived quality

Review applications:

ContextApplicationExample
Launch reviewAudit every touchpoint as one journeyAd promise matches first-run reality
OnboardingTreat first session as theaterFirst 60 seconds rehearsed like a keynote
LifecycleReview billing, support, offboardingCancellation takes one screen, keeps dignity

Review prompts:

  • "Walk me from hearing about this to recommending it — where does it crack?"
  • "Who designed the invoice? The error email? The cancel flow?"
  • "Does the experience keep its promise after the sale?"

Ethical boundary: Owning the whole experience means owning failures too — never design a polished entrance and a hostile exit.

See: references/end-to-end-experience.md

5. Demo or It Doesn't Exist

Core concept: Review working artifacts, not specs or slideware. Concrete demos expose truth that documents hide; decisions are made by a decider reacting to the real thing.

Why it works: Abstractions let everyone imagine a different product and agree on nothing. A demo at real size on the real device forces specific feedback, surfaces dealbreakers early, and converges by decision rather than committee drift.

Key insights:

  • Apple's software culture (Kocienda's "creative selection"): build a demo, show a decision-maker, get direct feedback, iterate — that loop is the process
  • The iPhone keyboard was chosen by a derby of competing working demos, not a requirements doc
  • Review on the target device at target data scale — a phone UI judged on a projector lies
  • Prototype the riskiest moment first; a demo of the easy 80% proves nothing
  • "Real artists ship": demos exist to force decisions, not to delay them

Review applications:

ContextApplicationExample
Design reviewBan slide-only reviewsFigma prototype or build, never static deck
Competing ideasRun a demo derby, pick oneTwo nav models built, one verdict
Stakeholder alignmentDemo to the decider weekly30-min demo replaces 3 status docs

Review prompts:

  • "Don't tell me — show me. On the device."
  • "Which of these two demos wins? Pick one; we're not shipping a compromise of both."
  • "What's the riskiest assumption, and where's the demo that tests it?"

Ethical boundary: Demos must show honest state — a staged demo that hides known breakage is a lie with a UI.

See: references/demo-culture.md

6. Taste and the Back of the Fence

Core concept: A great carpenter doesn't use plywood on the back of the cabinet, even though nobody will see it. Care invested in unseen surfaces — and the taste of the people applying it — is what quality actually is.

Why it works: Users sense craft subliminally: aligned pixels, coherent copy, graceful edge cases add up to trust. Teams that cut corners where "nobody looks" train themselves to cut corners everywhere; excellence is a habit enforced by standards, not inspections.

Key insights:

  • The original Mac team signed the inside of the case; Jobs made engineers redo the circuit board layout for beauty no customer would see
  • "Technology alone is not enough" — products live at the intersection of technology and the liberal arts
  • Audit the back-of-fence surfaces: empty states, error copy, settings pages, loading screens, emails
  • "Be a yardstick of quality" — A-players raise each other; tolerated mediocrity compounds
  • Taste is trainable: study great products, articulate why they're great, apply the standard ruthlessly

Review applications:

ContextApplicationExample
Detail auditReview the screens nobody demos404 page held to homepage standard
Copy reviewRead every string aloudError messages sound human, specific
Team standardCritique to the best work, not the average"Is this the best you've ever done?"

Review prompts:

  • "Show me the ugliest screen in the product — that's our real quality bar."
  • "Would you sign your name inside this?"
  • "Where did we use plywood?"

Ethical boundary: High standards apply to the work, never to a person's worth — demand excellence without demeaning people.

See: references/case-studies.md

7. Running the Review

Core concept: Structure the review: experience the product cold as a customer, name the One Thing it must do, audit against principles 1-6, then deliver a binary verdict — insanely great, or not done — with a specific cut list and fix list.

Why it works: Reviews fail through vagueness and politeness. A fixed walkthrough order, brutal specificity, and a binary verdict prevent "good enough" from shipping while giving the team an exact path to 10/10. Products get judged against their own promise — "What is this supposed to do? Then why doesn't it do that?"

Key insights:

  • Always experience the product cold before the meeting — first impressions can't be re-run
  • Open with the promise: state what the product claims, then test only that
  • Feedback must be specific and actionable: "this is confusing" fails review too — say what, where, why, and the fix direction
  • End binary: ship-worthy or a ranked fix list; never "polish it a bit"
  • One decider owns the verdict; input is wide, decision is narrow

ALWAYS output reviews in this format:

# Design Review: [Product/Feature]
**Verdict:** INSANELY GREAT / NOT DONE (score X/10)
**The One Thing:** [what this must do]
**Keeps its promise?** [yes/no — evidence]
**Cut list:** [what to remove]
**Fix list:** [ranked, specific, with fix direction]
**Back of the fence:** [unseen surfaces that fail the bar]

Ethical boundary: Channel Jobs' standards, not his cruelty — total candor about the work, zero contempt for the people who made it.

See: references/review-protocol.md

Common Mistakes

MistakeWhy It FailsFix
Reviewing only aestheticsDesign is how it works; pretty-but-clunky still fails usersWalk flows, latency, and failure states
Fixing problems by addingEach addition taxes attention and breeds more complexitySubtract first; additions need a kill
Consensus verdictsCommittees average ideas into mushOne decider, wide input, narrow decision
Reviewing specs and slidesAbstractions hide dealbreakers; everyone imagines a different productDemand working demos on the real device
"Good enough" verdictsMediocrity compounds into brand damageBinary: insanely great or not done
Skipping unseen surfacesUsers sense plywood; teams learn to cut cornersAudit empty/error/settings/email states
Cosplaying crueltyFear stops demos and candor, killing the feedback loopBe brutal about work, decent to people

Quick Diagnostic

QuestionIf NoAction
Can you state the One Thing this product must do in one sentence?No focus — everything is the priorityWrite it; cut what doesn't serve it
Does a new user reach core value in ≤3 steps?Complexity is unconqueredMap steps-to-value; remove, don't reorder
Did the reviewer experience it cold, as a customer?You reviewed the team's story, not the productUse it before the meeting, no walkthrough
Is there a working demo on the real device?You're approving an imagined productReschedule until there's a demo
Was anything removed this cycle?Roadmap is accreting, not focusingAdd a cut list to every review
Do error, empty, and edge states match hero-screen quality?Back of the fence is plywoodAudit and fix unseen surfaces
Would the team proudly use it daily and sign it?The bar is "acceptable", not "insanely great"Hold the binary verdict until pride is real

Reference Files

  • review-protocol.md: Full Jobs-style review session — agenda, walkthrough order, verdict format, saying-no rituals
  • simplicity-and-focus.md: Simplicity audit method, steps-to-value, the 2×2 product matrix, the no list
  • end-to-end-experience.md: Whole-widget touchpoint audit from discovery to offboarding, first-run theater
  • demo-culture.md: Creative selection loop, demo derbies, decider roles, honest-demo rules
  • case-studies.md: iMac, iPod, iPhone keyboard, Apple Stores, MobileMe failure review — what each teaches reviewers

About the Author

Steve Jobs (1955-2011) co-founded Apple and led it to create the Mac, iPod, iPhone, and iPad, building the most valuable company in the world on design-led product development. This skill distills his documented review practices and standards from Walter Isaacson's authorized biography, Ken Segall's Insanely Simple, and Ken Kocienda's Creative Selection.

Further Reading

This skill is based on documented accounts of Steve Jobs' product and design review practices:

GitHub Repository

wondelai/skills
Path: steve-jobs-design-review
0
agent-skillsai-skillsclaude-codeclaude-code-marketplaceclaude-code-pluginclaude-code-skills
FAQ

Frequently asked questions

What is the steve-jobs-design-review skill?

steve-jobs-design-review is a Claude Skill by wondelai. Skills package instructions and resources that Claude loads on demand, so Claude can perform steve-jobs-design-review-related tasks without extra prompting.

How do I install steve-jobs-design-review?

Use the install commands on this page: add steve-jobs-design-review to Claude Code as a plugin, or clone its repository into your skills directory, then restart Claude so it picks up the skill.

What category does steve-jobs-design-review belong to?

steve-jobs-design-review is in the Testing category, tagged excel, ai, testing and design.

Is steve-jobs-design-review free to use?

Yes. steve-jobs-design-review is listed on AIMCP and free to install. It runs inside Claude, so no separate service account is required to use the skill itself.

Related Skills

evaluating-llms-harness
Testing

This Claude Skill runs the lm-evaluation-harness to benchmark LLMs across 60+ standardized academic tasks like MMLU and GSM8K. It's designed for developers to compare model quality, track training progress, or report academic results. The tool supports various backends including HuggingFace and vLLM models.

View skill
cloudflare-cron-triggers
Testing

This skill provides comprehensive knowledge for implementing Cloudflare Cron Triggers to schedule Workers using cron expressions. It covers setting up periodic tasks, maintenance jobs, and automated workflows while handling common issues like invalid cron expressions and timezone problems. Developers can use it for configuring scheduled handlers, testing cron triggers, and integrating with Workflows and Green Compute.

View skill
webapp-testing
Testing

This Claude Skill provides a Playwright-based toolkit for testing local web applications through Python scripts. It enables frontend verification, UI debugging, screenshot capture, and log viewing while managing server lifecycles. Use it for browser automation tasks but run scripts directly rather than reading their source code to avoid context pollution.

View skill
finishing-a-development-branch
Testing

This skill helps developers complete finished work by verifying tests pass and then presenting structured integration options. It guides the workflow for merging, creating PRs, or cleaning up branches after implementation is done. Use it when your code is ready and tested to systematically finalize the development process.

View skill