Why this matters
As a Platform Engineer focused on Developer Experience (DX), Support & Enablement is how you remove friction so developers can deliver quickly and safely. Strong enablement means fewer blocked engineers, fewer repeated questions, and faster onboarding to your platform.
- Real tasks you will do: run a dev-support channel, triage requests, write how-to guides and runbooks, host office hours, track SLAs, and create golden paths for common tasks.
- Impact: reduce time-to-first-success for a new service, cut wait times for help, and increase platform adoption and satisfaction.
Concept explained simply
Support & Enablement ensures developers can self-serve most of the time and get fast human help when they can’t.
- Support: how you respond to questions and unblock developers (channels, SLAs, triage, escalation).
- Enablement: how you prevent questions in the first place (docs, templates, training, golden paths, sandboxes).
Mental model
Think of a 3-layer cake:
- Layer 0: Self-serve. Clear docs, FAQs, runbooks, templates, paved roads.
- Layer 1: Assisted. Chat channel, office hours, tickets with light SLAs.
- Layer 2: Escalated. On-call/escalations for platform-impacting issues with stricter SLAs.
Your goal: strengthen Layer 0 so Layers 1–2 are needed less often.
Worked examples
Example 1 — Lightweight support policy for a chat channel
Use this as a pinned message in your dev-support channel.
Open sample policy
What this channel is for: questions about the platform, CI/CD, service templates, and incidents affecting delivery.
- Hours: 09:00–17:00 local time, weekdays.
- First response target: within 1 business hour.
- Resolution target: most issues within 1 business day. Complex/escalated items may take longer (we’ll update you).
- How to ask: include service name, what you tried, error message/logs, and urgency.
- Before asking: search the Knowledge Base and pinned FAQ. If you find a gap, tell us and we’ll improve docs.
- Tagging: use #ci, #k8s, #secrets, #observability, #templates for faster routing.
- Escalation: if it blocks a release or production incident, write “ESCALATION” and ping @platform-oncall.
Example 2 — Triage workflow runbook
- Check if the question is answered by existing docs. If yes, link the doc and capture a doc feedback note.
- Classify: Question, Bug, Feature request, Incident.
- Tag: area label (e.g., ci, k8s, secrets).
- Prioritize: P1 (blocking incidents), P2 (release-blocking), P3 (normal), P4 (nice-to-have).
- Assign: route to area owner; if unclear, keep a single responsible triager.
- Update requester with status and next expected time update.
- Close with a summary and link to any new doc added.
Example 3 — Golden path for a new microservice
A golden path is a step-by-step, low-friction route for a common scenario.
- Create from service-template (preconfigured build, deploy, logging).
- Run bootstrap script to set repo, CI, environments, secrets.
- Local run via container compose; smoke test passes.
- Merge to main; CI builds image; CD deploys to staging.
- Observability auto-wired: dashboards and alerts created.
- One-click promote to production after approval.
Minimal acceptance checklist
- Build: under 10 minutes.
- Deploy: under 15 minutes to staging.
- Dashboards: 1 service overview; 1 latency panel.
- Docs: README with run, test, deploy, rollback.
Example 4 — Knowledge base structure
- Start Here: platform overview, glossary, access instructions.
- How-To Guides: short, task-focused steps (e.g., “Rotate a secret”).
- Runbooks: operational procedures for common incidents.
- Reference: limits, quotas, policies.
- FAQ: top 20 recurring questions with concise answers.
Tagging scheme
- Area: ci, cd, k8s, secrets, observability, networking.
- Difficulty: beginner, intermediate, advanced.
- State: draft, reviewed, deprecated.
Exercises
Do these to practice. You can compare with provided solutions. Keep your drafts — you’ll reuse them in real projects.
Exercise 1 — Design a tiered support model
Create a one-page support policy for your team including: tiers, channels, hours, first response/resolve targets, escalation rules, and expectations for requesters.
- Deliverable: a policy text you could paste into a chat channel or team page.
- Success checklist:
- Clear Tier 0/1/2 definition
- Specific response targets
- Escalation pathway
- Expectation to search docs first
See a sample solution
Tier 0: KB, FAQs, runbooks; must be searchable. Tier 1: #dev-support channel, 1h first response, 1 business day resolution; office hours Tue/Thu. Tier 2: on-call for production-impacting platform issues with 15-minute page target. Requesters provide service, error, steps tried; triager classifies and routes; status updates every 4 hours for P1 and daily for P2+.
Exercise 2 — Issue template and triage labels
Draft an issue template for platform requests and a label scheme for routing. Include required fields and severity selection.
- Deliverable: template text + label names and meanings.
- Success checklist:
- Minimal required fields
- Severity or priority
- Area tags for auto-routing
- Field to confirm docs were checked
See a sample solution
Title: [Area] Short summary
Checklist:
- [ ] I searched the KB/FAQ
- [ ] I can share logs or screenshots
What happened?
Expected vs actual:
Steps tried:
Service/repo:
Severity: (P1 blocking / P2 release-blocking / P3 normal / P4 nice-to-have)
Area: (ci|k8s|secrets|observability|networking)
Labels: type:question, type:bug, type:feature, area:ci, area:k8s, area:secrets, area:observability, area:networking, priority:P1–P4.
Common mistakes and how to self-check
- Only reactive support, no enablement. Fix: invest time weekly in docs, templates, and golden paths.
- Unclear expectations. Fix: publish and pin a short support policy with hours and targets.
- No triage ownership. Fix: assign a rotating triager; track unassigned items daily.
- SLAs too aggressive without capacity. Fix: start conservative, measure, then tighten.
- Knowledge base rot. Fix: add review dates and “report doc issue” pattern to each page.
- Solving symptoms, not root causes. Fix: for repeated issues, file an enablement task (doc/template/tooling) before closing.
Self-check prompts
- Can a new developer deploy a service in under a day using your golden path?
- Are 80% of support questions answered within your first-response target?
- Do top 10 repeated questions have clear, findable answers?
- Does someone own triage every business day?
Practical projects
- Build a “Start Here” page and link 5 top how-to guides. Measure if support volume drops after publishing.
- Create a one-click service template that auto-wires CI/CD and observability. Run a pilot with one team.
- Run weekly office hours for a month; track attendance, topics, and resulting doc updates.
Who this is for
- Platform Engineers aiming to improve developer productivity and satisfaction.
- Infra/SRE engineers who support internal developers.
- Tech leads responsible for platform adoption.
Prerequisites
- Basic knowledge of your platform stack (CI/CD, container orchestration, secrets, monitoring).
- Ability to write concise documentation and simple runbooks.
- Access to your team’s chat/ticketing tools.
Learning path
- Define and publish a lightweight support policy (Exercise 1).
- Create issue templates and label scheme (Exercise 2).
- Draft a golden path for a common task (from Worked examples).
- Set up a weekly triage rotation and office hours; review metrics after 2–4 weeks.
- Continuously prune and improve docs based on questions received.
Next steps
- Pick one team to pilot your policy and golden path. Iterate fast on their feedback.
- Instrument metrics: number of tickets, first response time, repeated question rate, and time-to-first-success for new services.
- Prepare a monthly summary: what improved, what remains painful, and one bold improvement to test.
Mini challenge
Choose one recurring question in your org. Create a 5-step how-to and add it to your Knowledge Base. Announce it with a short message in your support channel. Track if the same question appears over the next week.
What good looks like
- Title is action-focused (“How to rotate a secret”).
- Step list is under 10 minutes to try.
- Includes rollback/verification step.
- Tagged by area and difficulty.
Quick Test notice
Take the Quick Test below to check your understanding. Available to everyone. If you sign in, your progress will be saved.