context-hierarchy-design
About
This skill helps developers design progressive context loading systems to optimize Claude's memory usage. It provides a three-tier hierarchy strategy for organizing CLAUDE.md imports, implementing just-in-time loading, and creating priming hierarchies. Use it when setting up context infrastructure, refactoring bloated files, or optimizing agent startup time.
Quick Install
Claude Code
Recommended/plugin add https://github.com/majiayu000/claude-skill-registrygit clone https://github.com/majiayu000/claude-skill-registry.git ~/.claude/skills/context-hierarchy-designCopy and paste this command in Claude Code to install this skill
Documentation
Context Hierarchy Design Skill
Design a memory hierarchy that loads context progressively based on task needs.
Purpose
Not all context is needed all the time. A well-designed hierarchy ensures agents get exactly the context they need without bloat.
When to Use
- Setting up a new project's context infrastructure
- Refactoring bloated CLAUDE.md files
- Creating task-specific context loading
- Optimizing agent startup time
- Scaling context for multiple task types
The Three-Tier Memory Strategy
Tier 1: CLAUDE.md (minimal, always loaded)
|
v
Tier 2: Priming Commands (task-specific, on-demand)
|
v
Tier 3: File Reads (just-in-time, as needed)
Design Process
Step 1: Audit Current State
Analyze existing context infrastructure:
- Measure CLAUDE.md size
- Count imports and their sizes
- Review command coverage
- Identify task types
Checklist:
- [ ] CLAUDE.md token count
- [ ] Import count and total
- [ ] Task types identified
- [ ] Command coverage mapped
Step 2: Categorize Context
For each piece of existing context, ask:
| Question | If Yes -> |
|---|---|
| Needed for EVERY task? | Tier 1 (CLAUDE.md) |
| Needed for task TYPE? | Tier 2 (Priming) |
| Needed for specific work? | Tier 3 (On-demand) |
Step 3: Design Tier 1 (CLAUDE.md)
Minimal essentials only:
# Project Name
## Context
One-sentence project description.
## Tooling
- Language: X
- Runtime: Y
- Package Manager: Z
## Key Commands
- `cmd test` - Run tests
- `cmd build` - Build project
## Critical Rules
1. Most important rule
2. Second most important
3. Third most important (max 5)
Target size: less than 2KB.
Step 4: Design Tier 2 (Priming Commands)
Create commands for each task type:
| Task Type | Command | Context Loaded |
|---|---|---|
| General | /prime | README, git status |
| Bug Fix | /prime-bug | Recent commits, test files |
| Feature | /prime-feature | Architecture, API patterns |
| Review | /prime-review | Style guide, PR diff |
| Docs | /prime-docs | Existing docs, API surface |
Priming Command Template:
# Prime: [Task Type]
## Run
[Discovery commands for current state]
## Read
[Essential files for this task type]
## Report
[Summary prompt for understanding]
Step 5: Design Tier 3 (On-Demand)
Identify files loaded during work:
| Category | Loading Strategy |
|---|---|
| Source files | Read specific file when editing |
| Test files | Read when running tests |
| Configs | Read when configuring |
| Dependencies | Read when debugging deps |
Principle: Never pre-load what can be loaded on-demand.
Output Format
Design document structure:
# Context Hierarchy Design: [Project]
## Tier 1: CLAUDE.md (Always Loaded)
**Target Size:** <2KB
**Content:**
- Project context (1 sentence)
- Tooling (language, runtime, PM)
- Key commands (3-5)
- Critical rules (3-5)
## Tier 2: Priming Commands (Task-Specific)
### /prime
**Purpose:** General codebase understanding
**Loads:** README, git status, project structure
### /prime-bug
**Purpose:** Bug fixing context
**Loads:** Recent commits, test patterns, error logs
### /prime-feature
**Purpose:** Feature development context
**Loads:** Architecture, API patterns, similar features
## Tier 3: On-Demand (Just-in-Time)
**Strategy:** Load during execution, not pre-emptively
- Source files: Read when editing
- Tests: Read when testing
- Configs: Read when configuring
## Migration Plan
1. [ ] Backup current CLAUDE.md
2. [ ] Create minimal CLAUDE.md
3. [ ] Create priming commands
4. [ ] Test with common task types
5. [ ] Iterate based on usage
Design Patterns
Pattern: Import Hierarchy
CLAUDE.md (always)
-> imports/core.md (always, if large)
-> imports/tooling.md (conditional)
-> imports/workflows.md (conditional)
Pattern: Directory-Scoped Memory
/project
CLAUDE.md (project-wide)
/frontend
CLAUDE.md (frontend-specific)
/backend
CLAUDE.md (backend-specific)
Pattern: Task-Type Priming
/prime -> Base context
/prime-{type} -> Type-specific context
Validation Criteria
After design, verify:
- CLAUDE.md under 2KB
- Every task type has priming command
- No "just in case" loading
- Progressive disclosure works
- Common workflows smooth
Anti-Patterns to Avoid
| Anti-Pattern | Problem | Solution |
|---|---|---|
| Everything in CLAUDE.md | Bloat | Move to priming |
| No priming commands | Static context | Add task-type priming |
| Import everything | Wasted tokens | Conditional imports |
| Directory CLAUDE.md bloat | Compounding | Keep scoped minimal |
Key Quote
"Use context priming over CLAUDE.md or similar auto-loading memory files. Engineering work constantly changes, but memory files only grow."
Cross-References
- @context-priming-patterns.md - Priming command patterns
- @rd-framework.md - Reduce unnecessary context
- @context-layers.md - What loads into context
- @minimum-context-principle.md - Include only what's necessary
Version History
- v1.0.0 (2025-12-26): Initial release
Last Updated
Date: 2025-12-26 Model: claude-opus-4-5-20251101
GitHub Repository
Related Skills
content-collections
MetaThis skill provides a production-tested setup for Content Collections, a TypeScript-first tool that transforms Markdown/MDX files into type-safe data collections with Zod validation. Use it when building blogs, documentation sites, or content-heavy Vite + React applications to ensure type safety and automatic content validation. It covers everything from Vite plugin configuration and MDX compilation to deployment optimization and schema validation.
creating-opencode-plugins
MetaThis skill provides the structure and API specifications for creating OpenCode plugins that hook into 25+ event types like commands, files, and LSP operations. It offers implementation patterns for JavaScript/TypeScript modules that intercept and extend the AI assistant's lifecycle. Use it when you need to build event-driven plugins for monitoring, custom handling, or extending OpenCode's capabilities.
sglang
MetaSGLang is a high-performance LLM serving framework that specializes in fast, structured generation for JSON, regex, and agentic workflows using its RadixAttention prefix caching. It delivers significantly faster inference, especially for tasks with repeated prefixes, making it ideal for complex, structured outputs and multi-turn conversations. Choose SGLang over alternatives like vLLM when you need constrained decoding or are building applications with extensive prefix sharing.
evaluating-llms-harness
TestingThis Claude Skill runs the lm-evaluation-harness to benchmark LLMs across 60+ standardized academic tasks like MMLU and GSM8K. It's designed for developers to compare model quality, track training progress, or report academic results. The tool supports various backends including HuggingFace and vLLM models.
