How to Run an ESL Demo Lesson: A School Rubric and Scoring Guide
Why the Demo Lesson Is Your Most Valuable Hiring Tool
A polished CV and a confident interview answer tell you what a candidate says they can do. A demo lesson tells you what they can actually do — in front of learners, under mild pressure, in real time.
For ESL teacher hiring, the demo lesson is the highest-signal evaluation tool available. Schools that skip it in favour of interview alone are making a hiring decision with significantly less information than they could have. Schools that run demos without a consistent scoring framework are making subjective judgements that are hard to defend, hard to compare across candidates, and easy to get wrong.
This guide provides a structured approach to ESL demo lesson assessment — covering what to observe, a four-point scoring rubric across five criteria, debrief methodology, and the red flags that should give any assessor pause.
Setting Up the Demo Lesson for Success
A poorly designed demo brief produces poor signal. Before the rubric matters, the setup must be right.
Give a Structured Brief — 24 to 48 Hours in Advance
Provide the candidate with:
- The student level (e.g., B1 intermediate, Young Learners ages 9–11, adult business English)
- The lesson focus (e.g., introducing present perfect, a reading comprehension on a given topic, a speaking activity around a vocabulary set)
- The time available (e.g., 20 minutes)
- Classroom setup details (available materials, projector, whiteboard)
- Any specific expectation (e.g., include a communicative activity)
A 24–48 hour window tests planning ability, preparation habits, and how a teacher interprets a brief — all relevant signals.
Use Real Students Where Possible
Observing a teacher with real learners reveals how they handle the unpredictability of genuine classroom dynamics: a student who doesn't understand, an off-topic question, two students chatting. This is the core of what you need to see.
Assign the Same Brief to All Candidates
Consistency in briefs makes cross-candidate comparison meaningful. If Candidate A taught an upper-intermediate adult group and Candidate B taught beginner young learners, your rubric scores are not directly comparable.
The Five Assessment Criteria
Effective ESL demo lesson evaluation covers five distinct dimensions of teaching quality. Together they provide a comprehensive picture of competence without being so granular as to be impractical.
1. Lesson Planning and Structure
Does the lesson have a clear beginning, middle, and end? Is there an identifiable objective? Does the candidate build from simpler to more complex tasks? Is time managed sensibly across the lesson arc?
2. Language Use and Instruction Clarity
Is teacher talk appropriate for the level? Are instructions given clearly and in manageable steps? Does the teacher check comprehension of instructions before releasing students to a task? Is metalanguage (grammar terms, correction language) used purposefully rather than confusingly?
3. Student Engagement and Interaction
Does the teacher involve all students, or do some disappear? Is there meaningful student-to-student interaction as well as teacher-to-student? Does the teacher generate genuine communicative demand, or is the lesson a series of teacher presentations?
4. Classroom Management and Presence
Does the teacher maintain a calm, positive environment? Can they manage minor disruptions without losing momentum? Is their physical presence in the room purposeful — do they move, make eye contact, use space effectively?
5. Adaptability and Real-Time Responsiveness
This is the criterion that separates good teachers from great ones. When a student doesn't understand, does the teacher re-explain using a different approach? When the plan encounters unexpected difficulty (a task falls flat, a concept confuses the group), does the teacher adapt or push through regardless?
Scoring Rubric
Use the rubric below to score each criterion from 1 to 4. Record scores per criterion during or immediately after the lesson, before the debrief, to preserve an uncontaminated observation.
| Criterion | 1 — Poor | 2 — Acceptable | 3 — Good | 4 — Excellent |
|---|---|---|---|---|
| Lesson Planning & Structure | No clear structure; no identifiable objective; time management absent | Basic structure visible; objective loosely implied; transitions awkward | Clear objective stated or evident; structured progression; reasonable pacing | Tight, purposeful structure; objective explicitly staged; excellent time management; logical lesson arc |
| Language Use & Clarity | Instructions unclear or at wrong level; teacher talk dominates and confuses | Instructions mostly clear; occasional level mismatches; some unnecessary complexity | Level-appropriate language throughout; clear instructions with comprehension checks | Precise, differentiated language use; instructions modelled as well as stated; metalanguage used masterfully |
| Student Engagement & Interaction | Students passive throughout; teacher talks at group; no student-to-student interaction | Some student involvement; mostly teacher-fronted; limited interaction variety | Students regularly engaged; mix of interaction patterns (T→S, S→S); genuine participation | All students visibly engaged; sustained communicative activity; high student-talk ratio; creative interaction design |
| Classroom Management & Presence | Environment disorganised; candidate appears unsettled or unable to maintain focus | Basic control; minor disruptions handled slowly or ineffectively | Calm, positive environment maintained; disruptions managed without losing momentum | Exceptional presence; proactive management; room feels purposeful and energised; high trust with learners |
| Adaptability & Responsiveness | Pushes through regardless of comprehension; no adjustment observed | Some responsiveness but defaults to repetition; limited range of repair strategies | Clear real-time adjustment when needed; at least one repair strategy used effectively | Multiple adaptive moves observed; flexible re-sequencing; genuinely reads the room; adjusts to individual learner needs |
Scoring guide:
- 18–20: Exceptional — strong hire signal
- 14–17: Good — hire with standard onboarding
- 10–13: Moderate — consider for junior roles or re-evaluate with additional evidence
- Below 10: Significant concerns — proceed with caution
What to Observe at Each Stage
Warm-Up (first 3–5 minutes)
- Does the candidate settle students and establish a positive register?
- Is there a meaningful activation of prior knowledge or topic introduction?
- Is the pace of the warm-up calibrated to the class?
Main Teaching/Activity Phase (10–15 minutes)
- How are instructions delivered? (Watch for: reading from notes, unclear sequencing, no check for understanding)
- Is the task genuinely communicative or primarily mechanical?
- How does the teacher handle errors? (Watch for: overcorrection that kills fluency, or zero correction that signals low expectations)
- Does the teacher circulate during student activities, or stand at the front?
Closure and Review (last 3–5 minutes)
- Is there a genuine close, or does the lesson simply stop when time runs out?
- Does the teacher elicit what was learned, rather than just summarising?
- Is homework or a follow-up activity suggested in a motivating way?
Red Flags to Watch For
The following behaviours, regardless of overall rubric score, warrant serious discussion before proceeding to offer:
- Excessive L1 use with monolingual classes — understandable in extremis, but habitual L1 reliance suggests the teacher lacks confidence in English-medium delivery
- Ignoring student confusion — continuing when it's visually clear students are lost, without any repair attempt, signals low learner-awareness
- Talking down to students — condescending tone, excessive simplification that infantilises adult learners, or dismissive responses to incorrect answers
- Reading from notes throughout — suggests low lesson ownership and will not translate to dynamic real-world delivery
- Inability to manage even mild disruption — in a demo setting, students are typically on their best behaviour; if management fails here, it will fail under normal conditions
- No flexibility whatsoever — delivering a scripted lesson regardless of student response suggests a candidate who cannot function outside a controlled environment
A candidate with one red flag alongside strong rubric scores can still be a good hire; use the debrief to probe the specific area. Multiple red flags alongside moderate scores is a clear signal to pass.
Running an Effective Debrief
The post-demo debrief is a second evaluation in its own right. How a teacher reflects on their own performance tells you about their professional maturity, self-awareness, and capacity for growth — all strong predictors of long-term effectiveness.
Recommended debrief structure (15 minutes):
- Open with a positive observation — name one specific thing that worked well. This establishes a constructive tone and signals that you were genuinely paying attention.
- Ask a self-assessment question: "How do you feel it went? Was there a moment you'd handle differently?" Listen for specificity and honesty versus vagueness or defensiveness.
- Probe one area from your rubric: "I noticed you moved on quite quickly after the reading task — what was your thinking there?" This surfaces reasoning, not just outcome.
- Ask a forward-thinking question: "If you were to teach this group again next week, what would you do with the same students as a follow-up?" This tests lesson coherence thinking.
- Allow the candidate to ask questions — what they ask about your school, your students, or your curriculum signals what they care about.
Standardising Across Multiple Assessors
When multiple stakeholders observe the same demo (academic director, department head, HR), independent scoring before a joint discussion produces better decisions than a group consensus formed during the observation. Run the following process:
- All observers complete their individual rubric scores during or immediately after the lesson.
- Observers debrief privately (without the candidate) and compare scores.
- Significant discrepancies (2+ points on any criterion) are discussed — these often surface genuine interpretive differences worth resolving.
- A joint decision is made based on combined evidence.
This approach also produces a written record of the evaluation, which is useful both for internal accountability and for responding to candidate queries about decision-making.
Integrating Demo Lessons into Your Hiring Workflow
The demo lesson works best as the second or third step in a structured process:
- Profile screening — review qualifications, experience, and certifications
- Initial interview — assess communication style, motivation, cultural fit
- Demo lesson — assess teaching competence directly
- Reference check — verify claims from steps 1–3
- Offer
Platforms like JobRovers let school HR managers browse vetted teachers on JobRovers with structured profiles that accelerate step one — reducing the time you spend on profile screening so more resource goes to steps two and three where quality decisions are actually made.
For guidance on attracting candidates who are worth progressing to a demo stage, see how to attract better ESL teachers. For what to do with the teacher you hire after the demo, see onboarding international teachers successfully.
Hiring great teachers?
Browse vetted, ready-to-hire teachers on JobRovers — and reach out directly.
Browse teachersFrequently asked
How long should an ESL demo lesson be?
The standard range is 20–30 minutes for a demo lesson, which is enough time to observe lesson structure, instruction delivery, and real-time responsiveness without disrupting a full class period. Some international schools use a 45-minute format that includes a 15-minute teaching segment, a 15-minute observed student activity, and a 15-minute debrief with the assessor. Avoid demos under 15 minutes — they are too short to assess anything beyond surface presentation skills.
Should the demo lesson use real students or adult role players?
Real students produce the most authentic and useful observation data — you see how the teacher actually responds to children's energy, confusion, and unpredictability. Where using real students isn't practical, a small group of staff playing the role of students at an appropriate level is the preferred alternative. Avoid purely theoretical demos (presenting a lesson plan without actually teaching) — they assess planning, not teaching.
What level and topic should schools assign for the demo lesson?
Assign a level and topic that matches the actual class the teacher would be hired to teach — this produces the most relevant signal. Provide the brief 24–48 hours in advance, which tests planning ability alongside delivery. If you need to compare multiple candidates on the same criteria, using an identical brief for all candidates makes scoring more consistent and comparable.
What are the biggest red flags in an ESL demo lesson?
The most serious red flags include: talking at students rather than engaging them (lecture-mode for the full demo); inability to adjust explanation when students don't understand; overuse of L1 or complex English with low-level learners; no clear lesson structure (no warm-up, no clear objective, no closure); inability to manage even minor classroom disruption; and patronising or dismissive responses to student errors. A candidate who presents a beautiful lesson plan but cannot execute flexibly in front of real students is a significant hire risk.
How should schools debrief candidates after a demo lesson?
The debrief serves two purposes: gathering additional signal (how a teacher reflects on their own performance is itself a strong indicator of professional maturity) and treating the candidate with respect. Effective debrief questions include: What do you think went well? What would you do differently? How did you decide on that approach for that level? Was there a moment where you adjusted your plan — what triggered that? Avoid leading questions that telegraph the 'correct' answer.
Can a candidate score well on a rubric but still be a wrong hire?
Yes — this is why the rubric should be combined with cultural fit and team dynamics assessment, not used as the sole decision tool. A candidate who scores strongly on lesson craft but whose communication style creates friction with colleagues or whose values don't align with the school's ethos will still be a problematic hire. The rubric surfaces teaching competence; cultural alignment requires different assessment methods, including stakeholder interviews and school visit observations.

