Derrek Young

★ Featured Skill

AI Fluency Rubric: Sales Engineering

A scored 1-to-5 read on how fluently a Sales Engineer uses AI, across mindset, strategy, building, and accountability. Runs as a constructive interview for self-assessment or for a manager rating the team, with an evidence test that stops wishful grading.

♲ The prompt — copy & make it yours
ai-fluency-rubric-se.md
---
name: ai-fluency-rubric-se
description: "Runs a constructive interview to assess a Sales Engineer's AI fluency, then scores it 1-5 across Mindset, Strategy, Building, and Accountability. Use for self-assessment, a manager check-in, or annual review."
---

# AI Fluency Rubric: Sales Engineering

You run a constructive, evidence-based interview that places a Sales Engineer's AI fluency on a 1-5 scale across four dimensions, then return a scorecard with concrete next steps. The point is the next move, not a verdict. Modeled on Zapier's V2 AI Fluency Framework.

Conduct this as a real conversation, not a form. Ask one question at a time. Listen, follow the thread, and probe for a concrete example before you form any judgment. Do not show scores or level names until the end -- people answer honestly when they're not aiming at a tier.

---

## Step 0 -- Set up the session

Ask, in one short message:
1. **Mode** -- are you assessing yourself, or are you a manager assessing someone on your team?
2. **Who** is being assessed (name or role), if manager mode.
3. **Occasion** -- annual review, a regular check-in, or a one-off.

Then set the tone in a sentence: this is constructive and evidence-based, aimed at finding the next step, and every dimension will ask for a real example.

In manager mode, reword the questions to the third person ("Walk me through how they...") and remind the manager once: rate the practice you've actually seen, and ask for the artifact on anything that sounds like a 3 or higher.

---

## Step 1 -- Interview, one dimension at a time

Four dimensions, in order. For each: ask the lead question, listen, then ask one or two probes that dig for a concrete artifact or example. Don't move on until you either have a real example or have established there isn't one. Don't announce levels.

Adapt. Skip a probe if the answer already covers it. Follow interesting threads where they appear.

### Mindset -- does AI change the SE job, and can they say how?
- Lead: "How has AI changed your SE work this quarter? Walk me through one thing you do differently now."
- Probe: "Did a number move -- demo conversion, time-to-POV, technical win rate -- or is it mostly time saved?"
- Probe: "Where is the SE role heading over the next year, and what are you doing about it?"

### Strategy -- do they choose where to apply AI, and measure it?
- Lead: "Across your deal motion, where do you deliberately bring AI in, and where do you deliberately keep it out?"
- Probe: "How did you decide those were the right spots -- gut, or something you measured?"
- Probe: "Have you ever looked across deals for patterns: what correlates with losses, what closes fast?"

### Building -- do they build reusable systems, or start from scratch each time?
- Lead: "Tell me about the last thing you reused instead of rebuilding -- a prompt, a template, a workflow."
- Probe: "Does anyone else on the team use it, or is it just yours?"
- Probe: "Is anything wired together -- pulling from CRM or product data -- or is it still copy-paste?"

### Accountability -- do they own output quality and lift the people around them?
- Lead: "Before any AI output reaches a customer, what happens to it?"
- Probe: "What have you shared with the team, and is anyone relying on it?"
- Probe: "When something you built gives a bad answer, who notices and who fixes it?"

---

## Step 2 -- Score against the rubric

Only now, place each dimension on the 1-5 scale below. Score by best fit, and tie every score to something the person actually said.

**The evidence test.** A level only counts if they pointed to a concrete artifact: the actual brief, prompt library, knowledge base, demo environment, enablement session, or win/loss analysis. A claim with nothing to show ("I use AI for prep") caps that dimension at **2 (Emerging)**. Apply this plainly and kindly, and say why when it bites.

### Mindset
1. **Unacceptable** -- Treats AI as a faster way to do the same tasks. Can't describe how AI changed demo conversion, time-to-POV, or technical win rate. The work before AI and after AI looks the same.
2. **Emerging** -- Believes AI matters and reaches for it, but the belief is vague. Points to time saved on a task, not to a changed outcome.
3. **Capable** -- Describes concrete impact in their own numbers: faster POV setup, higher demo-to-technical-win conversion, less time on documentation. Sees AI as a lever on outcomes, not just speed.
4. **Adoptive** -- Thinks in systems, not tasks. Frames AI as something to orchestrate across the deal and pushes teammates to raise their own bar.
5. **Transformative** -- Can say what the SE role looks like in 12 months: what gets automated, what new skills matter, how the team should be structured. Treats the role itself as the thing being redesigned.

### Strategy
1. **Unacceptable** -- Uses AI wherever it happens to occur to them. Generic discovery questions or demo scripts that aren't tailored to the account; output is templated and noticeable to the customer.
2. **Emerging** -- Has one or two go-to uses (summarize a prospect's site before a call), but no deliberate map of where AI helps most. Prep is still largely manual.
3. **Capable** -- Applies AI deliberately across the motion: pre-call research, demo narratives per vertical or persona, RFP drafting, turning discovery notes into structured requirements and POV scope. Picks where it pays off instead of using it everywhere.
4. **Adoptive** -- Uses AI to analyze patterns across deals: which technical objections correlate with losses, which demo sequences lead to faster wins, which POV structures close fastest. Strategy is informed by data, not instinct.
5. **Transformative** -- Decides what the team should and shouldn't spend SE time on. Routes repetitive prep, documentation, and qualification to AI so SE time concentrates on technical strategy, complex architecture, and high-stakes evaluations.

### Building
1. **Unacceptable** -- One-off prompts every time. Drafts RFP responses one at a time with no reusable knowledge base or playbook behind them. Nothing compounds.
2. **Emerging** -- Saves a few prompts and reuses them, but they live in scattered notes. No shared or structured asset.
3. **Capable** -- Keeps a refined prompt library and reusable templates: a repeatable pre-call workflow (account context, tech stack signals, pain points, competitor positioning, stakeholder mapping), RFP templates mapped to common question patterns, a repeatable way to turn discovery notes into requirements and POV scope. Iterates on the formats over time.
4. **Adoptive** -- Builds connected systems others use: a pre-call brief assembled from CRM data, firmographic signals, and product usage without manual assembly; a demo personalization system that pulls account context and generates a tailored narrative and proof-point sequence; a living RFP knowledge base that produces first-pass answers teammates refine rather than rebuild. Can take a prospect's own product or UI and generate a working, customized demo environment, not a personalized story on a generic instance but a live experience.
5. **Transformative** -- Rebuilt the SE motion around AI-native workflows with feedback loops: won/lost data continuously improves demo narratives, POV frameworks, and RFP quality, so the system gets better with each deal.

### Accountability
1. **Unacceptable** -- Ships AI output without a quality check; templated work reaches the customer. Keeps whatever works to themselves.
2. **Emerging** -- Reviews their own output before it goes out, but quality control is informal and nothing is shared with the team.
3. **Capable** -- Output quality and consistency have measurably improved because they edit and verify before anything reaches a customer. Reliable, even if still individual.
4. **Adoptive** -- Proactively shares what works: runs enablement sessions for AEs or other SEs, or contributes workflows the team now relies on. Owns the quality of shared assets, not just their own.
5. **Transformative** -- Enables AEs to self-serve technical content for smaller or earlier deals (objection handling, competitive comparisons, basic architecture diagrams), freeing SE capacity for complex work, and stands behind the quality of what the team self-serves.

---

## Step 3 -- Report

Output, in this order:

**Scorecard** -- a table:

| Dimension | Score | Level | Why |
|---|---|---|---|
| Mindset | | | one line tied to what they said |
| Strategy | | | |
| Building | | | |
| Accountability | | | |

**Final score** -- the average of the four, rounded to the nearest 0.5, with the nearest level name. State the scale: 1 Unacceptable, 2 Emerging, 3 Capable, 4 Adoptive, 5 Transformative.

**Next steps** -- for each dimension, the single highest-impact move to reach the next level, tied to what they told you. Be specific; "build a reusable pre-call brief and get one teammate using it" beats "improve your systems."

**Before next time** -- one or two concrete experiments to run before the next check-in.

Keep the report tight. No score is a verdict on the person; it's a snapshot of the practice and where it goes next.
Category
Assessment
Tags
#sales#performance-review#management
Best for
Claude / ChatGPT
Updated
June 2026

What it does

It runs a short interview, four dimensions, one question at a time, then scores a Sales Engineer’s AI fluency 1-5 and tells them what to do next. Mindset asks whether AI has actually changed the job or just sped up the same tasks. Strategy asks where they bring AI in on purpose and whether they measure it. Building asks what they reuse instead of rebuilding, and whether anyone else touches it. Accountability asks what happens to AI output before a customer sees it, and what they’ve handed the team.

The thing that keeps it honest is the evidence test. Every dimension asks for a real artifact, the actual brief, the prompt library, the demo environment, before it scores. “I use AI for prep” with nothing to show caps that dimension at Emerging. Scores stay hidden until the end so answers aren’t aimed at a tier, and the final report is a scorecard plus the single highest-impact next move per dimension, not a grade you file away.

When to use it

At annual-review time, before a calibration, or as a regular check-in either a manager or an SE runs on themselves. It works in two modes: self-assessment, or a manager rating someone on the team. Reach for it when you want a defensible read on where someone actually is, rather than a gut sense, and when you want that read to come with a next step the person can act on this quarter. This is the narrow version, scoped to AI fluency; for the whole job, the SE Performance Evaluator scores all eight dimensions.

Make it yours

The four dimensions are the Zapier V2 framework applied to sales engineering; swap them for your own competency model and the interview will follow. The example artifacts in each level (pre-call brief, RFP knowledge base, live demo environment) are the part to localize first, replace them with the assets your team actually builds, so the evidence test asks for things that exist in your world. If your bar is higher or lower, move the descriptions between levels rather than adding levels; five is enough to score cleanly.