CELPIP Speaking Task 3: Describe a Scene That Impresses Examiners
Task 3 shows you a picture and asks you to describe what is happening. Most test-takers list random objects. Below: the spatial organization method that scores CLB 9, real sample answers at three levels, and the habits that keep your descriptions underdeveloped.
What Is CELPIP Speaking Task 3? (Describing a Scene)
In Task 3, you see a picture and must describe everything in it. You have 30 seconds to prepare and 60 seconds to respond. The challenge is not identifying objects — it is organizing your description so it sounds complete and structured.
Preparation time
30 seconds with the picture visible. Do not memorize a word list — plan a spatial route (left to right, foreground to background) so your description has flow.
Response time
60 seconds. You need to fill the entire time with meaningful description. If you finish in 30 seconds, you missed details — and your task fulfillment score drops.
What examiners score
Coverage (did you describe enough of the picture?), organization (spatial or logical flow), vocabulary precision, and natural fluency. Random lists of objects get low coherence scores.
How to Use Your 30 Seconds (Prep Strategy)
Your 30 seconds should be spent planning a route through the picture, not memorizing words. The CLB 9 strategy guide covers how this spatial scanning skill applies across all speaking tasks.
Seconds 1–10: Identify the setting
What kind of place is this? A park, an office, a restaurant? Identifying the overall setting first gives your description a natural opening line: "This picture shows a busy farmers' market."
Seconds 10–20: Plan a spatial route
Decide how you will move through the image: left to right, foreground to background, or center outward. Pick one approach and commit to it. This prevents the random jumping that costs coherence points.
Seconds 20–30: Spot 2–3 interesting details
Find small details like facial expressions, unusual objects, or actions. These are what separate a CLB 7 description ("a man is sitting") from CLB 9 ("a man in the corner is reading a newspaper with his coffee getting cold").
The Spatial Description Structure That Scores CLB 9
This structure works for any scene description prompt. It keeps your response organized and prevents the random listing examiners penalize.
1. Open with the overall scene (10 sec)
Name the setting and the general mood or activity. "This picture shows a lively outdoor café on what appears to be a sunny afternoon." This tells the examiner you understand the big picture before diving into details.
2. Describe the main area (20 sec)
Move through the center or most prominent part of the image. Describe people, their actions, and objects. Use present continuous: "A woman is pouring coffee while a child next to her is drawing on a placemat."
3. Move to secondary areas (15 sec)
Shift to the edges or background. "In the background, there are several trees, and on the right side, two men appear to be having a conversation." Use spatial markers: "on the left," "in the background," "near the edge."
4. Add interpretation or detail (10 sec)
Comment on mood, weather, or speculate naturally: "The atmosphere seems relaxed — everyone looks like they are enjoying their Saturday." This shows higher-order language and fills remaining time.
CELPIP Speaking Task 3 Sample Answers at Three CLB Levels
Same picture prompt — notice how the description becomes more organized and detailed at higher levels.
Prompt: Describe the picture of a busy farmers' market.
"I see a market. There are people. Some fruits and vegetables. A man is selling things. There is a woman buying something. The weather is sunny. Yeah, that is the picture."
"This picture shows a farmers' market. There are several vendors with tables set up, selling fruits and vegetables. In the center, a woman is examining some tomatoes while a vendor is talking to her. On the left side, there is a man carrying a bag full of produce. There are also some children near a table with baked goods. The weather looks sunny and warm, and there are a lot of people walking around."
"This picture captures a vibrant outdoor farmers' market on what looks like a warm summer morning. In the foreground, there is a large produce stand overflowing with colorful fruits — I can see baskets of strawberries, some peaches, and what appear to be bunches of fresh herbs. A woman in a sun hat is carefully selecting tomatoes while chatting with the vendor, who seems to be recommending something to her. Moving to the right side, there is a smaller stand with artisan bread and pastries, where two children are pointing excitedly at something behind the glass display. In the background, several people are strolling between the stalls with reusable bags, and there is a musician sitting on a folding chair playing guitar near the entrance. The whole scene has a relaxed, community feel to it — the kind of Saturday morning market where people come as much for the atmosphere as for the shopping."
Examiner-Level Score Analysis: Why Each Response Gets Its Score
Description tasks are scored on coverage, organization, vocabulary precision, and fluency. Here is what separates each level.
CLB 4–5: Why This Description Fails
Coverage
Barely scratches the surface. "There are people. Some fruits and vegetables." — this could describe any market picture. The speaker notices almost nothing specific and finishes far too early.
Organization
No spatial route, no logical flow. Objects are listed randomly: people, fruits, a man, a woman, weather. There is no sense of moving through the picture — just a disconnected list.
Vocabulary
"Selling things," "buying something" — the vaguest possible descriptions. No specific produce names, no actions beyond "selling" and "buying," and no descriptive adjectives at all.
Fluency
Very short, choppy sentences. The response ends in about 20 seconds with "Yeah, that is the picture" — a hallmark of unprepared CLB 4–5 responses.
CLB 7–8: Solid Coverage, Mechanical Organization
This response covers the picture but feels like a checklist rather than a natural description. The CLB 9 strategy guide shows how to add the observational depth that pushes scores higher.
Coverage
Covers most major elements: vendors, customers, children, weather. But every detail is surface-level — "examining some tomatoes," "a man carrying a bag." There are no small, observant details that show genuine engagement with the image.
Organization
Uses some spatial markers ("in the center," "on the left side") — but the route feels mechanical rather than natural. Acceptable for CLB 7, but CLB 9 descriptions flow as though the speaker is walking through the scene.
Vocabulary
"Examining," "produce," "baked goods" — functional vocabulary. But there is nothing vivid or unexpected. Every word choice is the first that comes to mind, which is exactly the CLB 7–8 vocabulary ceiling.
Fluency
Adequate pacing and fills most of the time. But the delivery sounds like a report — sentence after sentence with no variation in tone or engagement.
CLB 9–12: What Top Descriptions Actually Sound Like
Coverage
Thorough — from produce specifics ("baskets of strawberries," "bunches of fresh herbs") to small details (musician on a folding chair, children pointing at pastries). The speaker notices things a casual observer might miss.
Organization
Natural spatial flow: foreground → right side → background. Transitions are organic ("Moving to the right side," "In the background"). The description feels like a camera panning across the scene.
Vocabulary
"Vibrant," "overflowing," "artisan bread," "strolling," "reusable bags" — precise, varied, and contextually appropriate. The final sentence ("come as much for the atmosphere as for the shopping") demonstrates genuine language control.
Fluency
Fills the full 60 seconds with natural pacing and genuine engagement. The speaker adds personal observations ("I can see," "what appear to be") that make the description conversational rather than robotic.
Common CELPIP Task 3 Mistakes That Kill Your Score
Most Task 3 mistakes are about description strategy, not language ability. These are the patterns that cost the most points.
Random listing without spatial flow
"There is a man. There is a table. There are some trees." Random lists destroy coherence scores. Always describe the scene by moving through it spatially — left to right, front to back, or center outward.
Only describing the obvious
Saying "people are at a market" when the picture clearly shows a market is stating the obvious. Examiners want to hear what is happening in the market, what specific things you notice, and how the scene feels.
Ignoring actions and interactions
Static descriptions ("There is a woman. There is a child.") miss the most scoreable content. Describe what people are doing and how they are interacting: "A woman is chatting with the vendor while her child tugs at her sleeve." If you tend to describe objects but not actions, a speaking coaching session can retrain this habit.
Using the same sentence structure repeatedly
"There is a man. There is a woman. There is a table." Repeating "There is" kills your vocabulary and fluency scores. Vary your structures: "A man is standing…" "On the left, I can see…" "Near the entrance, there appears to be…"
Finishing in 30 seconds
If you run out of things to say, you missed details. Use the remaining time to describe mood, weather, colors, or speculate about what people might be thinking. "The whole scene feels very relaxed" is better than silence.
How to Move from CLB 7 to CLB 9 in Task 3
If you can identify objects in the picture but your scores stay at CLB 7, the problem is depth of observation, not language level.
Stop doing this
Listing objects without describing actions or interactions
Using "There is" to start every sentence
Describing only large, obvious elements
Finishing before 60 seconds and having nothing more to say
Start doing this
Plan a spatial route and announce it naturally ("In the foreground…")
Describe people's actions, expressions, and interactions — not just their existence
Notice small details: colors, textures, unusual objects, body language
End with mood or atmosphere: "The whole scene feels…"
The key insight: CLB 9 descriptions do not use harder words — they notice more. Practice by describing any picture for 60 seconds and challenging yourself to include at least three details a casual observer would miss.
Continue Your CELPIP Speaking Preparation
← Task 2: Personal Experience
Learn the storytelling structure that keeps personal narratives on track.
Task 4: Making Predictions →
Master speculative language and organized future-oriented responses.
Full CLB 9 Strategy Guide
The complete system for reaching CLB 9 across all CELPIP sections.
All Speaking Tasks
Return to the speaking hub for all 8 task pages.
CELPIP Overview
See the full preparation system with all skill hubs and resources.
Private Speaking Coaching
Get one-on-one feedback on your descriptions, fluency, and structure.
Still Struggling with Task 3? Get Targeted Help
If your descriptions keep scoring CLB 7 despite good vocabulary, the problem is usually observation depth and organization — not grammar.
Private Speaking Coaching
Your tutor walks through picture descriptions with you in real time, showing exactly what details you miss and how to organize your spatial route.
Book a Speaking SessionWriting Correction
Descriptive skills transfer between speaking and writing. Improving your ability to describe scenes in writing often improves your spoken descriptions too.
Submit Your WritingWeekly CELPIP Webinar
Practice picture descriptions with other students and hear examiner-perspective feedback on what CLB 9 descriptions sound like.
Join This Week