Science of Reading · Phonics Teaching Tool

Elkonin boxes for phonics

Elkonin boxes are the single most effective tool for teaching a child to hear the sounds in a word. Here is exactly how to use them — and why they work.

By the Phonics Guide Editorial Team · Updated May 9, 2026 · 8 min read

What are Elkonin boxes?

Elkonin boxes — also called sound boxes or phoneme boxes — are a set of empty squares, one per sound, drawn on a piece of paper or shown on a screen. As a child says each sound in a word, they tap or slide a counter into the corresponding box. Once all the sounds are in their boxes, the child blends them together to read the whole word.

They were developed in the 1960s by Soviet psychologist Daniil Borisovich Elkonin, who was studying how children develop phonemic awareness — the ability to hear and manipulate the individual sounds in words. Decades of research since then have confirmed that this concrete, multisensory approach significantly accelerates reading development in young children.

Today, Elkonin boxes are a cornerstone of the Science of Reading — the research consensus that systematic, explicit phonics instruction is the most effective way to teach reading. Most US states now mandate SOR-aligned instruction in their literacy standards.

Sounds, not letters

The most important thing to understand about Elkonin boxes: each box represents one sound (phoneme), not one letter (grapheme).

English has 26 letters but approximately 44 distinct sounds. Many sounds are spelled with two or more letters working together. This is why counting letters gives the wrong number of boxes:

cat 3 letters → 3 boxes (c · a · t) ✓
ship 4 letters → 3 boxes (sh · i · p) — "sh" is one sound
night 5 letters → 3 boxes (n · igh · t) — "igh" is one sound
street 6 letters → 5 boxes (s · t · r · ee · t)
caught 6 letters → 3 boxes (c · augh · t) — "augh" is one sound

How to use Elkonin boxes: step by step

You do not need special materials. A piece of paper and a pencil works fine.

  1. 1
    Choose a word. Start with 2 or 3-sound words for beginners: at, up, cat, dog, big.
  2. 2
    Draw the boxes. Draw one empty square per sound. For "cat," draw 3 boxes side by side.
  3. 3
    Say the word slowly. Stretch it out: "ccc-aaa-t." Ask your child: "How many sounds do you hear?"
  4. 4
    Tap or push a counter into each box. Use a coin, button, or just your finger. One tap per sound: "c" → tap, "a" → tap, "t" → tap.
  5. 5
    Blend. Run your finger under all the boxes as your child says the full word: "cat."
  6. 6
    Write the letters. Once the child can segment reliably, add a writing step: write the grapheme for each sound in its box (e.g., "sh" in one box, "i" in the next, "p" in the last for "ship").

Progression: what order to teach

Start with the simplest words and build up as your child gains confidence:

Stage 1 2-sound words at, up, it, ox, am — pure vowel + consonant. Best starting point. Browse 2-sound words →
Stage 2 3-sound CVC words cat, dog, big, run — consonant-vowel-consonant. Core of early reading. Browse 3-sound words →
Stage 3 Consonant blends stop, frog, best — two consonants at the start or end. 4-sound words. Browse 4-sound words →
Stage 4 Digraphs & vowel teams ship, night, rain — two or more letters that make one sound. Browse S words →

Worked examples: three words, three levels

Below are three concrete walkthroughs — from the simplest CVC pattern up to a word with a tricky consonant digraph and a vowel team. Read these aloud with your child, tapping each box in turn.

Example 1 — "dog" (3-letter CVC, 3 sounds)

Draw three empty boxes. Say the word slowly: "d…o…g." Tap box one for the /d/ sound, box two for the short /o/ sound, box three for the /g/ sound. Slide your finger under all three and say the whole word: "dog." Then ask your child to write the letter that matches each sound — one letter per box. This is the pattern used by every decodable book for beginning readers. Try it with cat, pig, run, and bed.

Example 2 — "ship" (4 letters, 3 sounds)

This is where most children's first real phonics insight happens. Draw three boxes — not four. The "sh" at the start is a consonant digraph: two letters working together to make a single /sh/ sound. Tap box one for /sh/, box two for the short /i/ sound, box three for /p/. When you write the letters in, put "sh" together in the first box; this makes the grapheme–phoneme correspondence visible. Try the same pattern with chop, that, when, and phone.

Example 3 — "night" (5 letters, 3 sounds)

Draw three boxes again. The first box takes /n/. The second takes "igh," a three-letter vowel team making the long-I sound — yes, all three letters go into one box. The third takes /t/. Children are often surprised that a five-letter word can have just three sounds; once they see this pattern, they start spotting it everywhere. Try it with light, right, high, and sight.

Research evidence: what the studies show

The Elkonin method is one of the most heavily studied interventions in early literacy. Three findings are especially important for parents and teachers:

  • Effect size on phonemic awareness is large. The National Reading Panel (2000) meta-analysis found that phonemic awareness instruction using manipulative tools like sound boxes produced an effect size of d = 0.86 on phonemic awareness outcomes — one of the largest effects in education research.
  • Combining sounds with letters multiplies the benefit. The same review found that PA instruction including written letters (which is exactly what Elkonin boxes do in step 6 of our walkthrough) produced an effect size of d = 0.67 on later reading — compared to d = 0.38 for oral-only PA instruction. Writing the letters into the boxes is not optional; it is where the power of the method lives.
  • Short, frequent sessions work best. The strongest effects in the literature come from sessions of 15–20 minutes, run 3–5 times per week, over 10–18 weeks. Longer single sessions do not help more; shorter, consistent practice does.

If you want to go deeper, the most cited primary source is Elkonin's own 1963 paper The Psychology of Mastering the Elements of Reading, later translated and summarised by Ehri and colleagues. For classroom-ready applications, see the Reading Rockets guide to Elkonin boxes.

Common mistakes parents make

Sound boxes are powerful, but they are also easy to use incorrectly. Here are the three mistakes we see most often when we watch parents try this at home for the first time:

1. Drawing one box per letter

Five letters in "night," so five boxes — right? No. One box per sound, not per letter. This is the single most common mistake and undermines the entire point of the exercise. If you are unsure how many boxes a word needs, look it up on this site — every word page shows the correct count.

2. Stretching sounds that don't stretch

"C…a…t" works because all three sounds can be held: /ccc/-/aaa/-/t/. But stop sounds like /p/, /b/, /t/, /d/, /k/, /g/ cannot be stretched without distorting them into /puh/, /buh/, /tuh/. Teach your child to say them cleanly and briefly, without the extra "uh."

3. Skipping the blending step

Tapping each sound in turn is segmenting; reading requires blending those sounds back into a whole word. Always end the exercise by running your finger smoothly under all the boxes as your child says the full word. This is the step most often rushed or forgotten at home.

How Elkonin boxes fit with other phonics methods

Elkonin boxes are a phonemic awareness tool — they help children hear, manipulate, and map the sounds in spoken words. They do not replace a full phonics curriculum; they sit inside one.

In a structured literacy classroom, sound boxes are typically used in the first ten minutes of a phonics lesson, right after letter-sound review and before children read a decodable text. In Orton-Gillingham and related structured programmes, the box-tapping step is often called "finger spelling" or "tap and spell." In Jolly Phonics and its Indian school equivalents, the same principle is baked into the "sound button" tradition — tapping dots under each sound as you decode.

What matters most is that your child segments and blends every day, with a systematic set of words that get gradually harder. The method is a tool; consistency is the engine.

Why Elkonin boxes work

Reading requires two skills working together: phonemic awareness (hearing sounds in spoken words) and decoding (mapping sounds to letters). Many children struggle with decoding not because they cannot recognise letters, but because they have not yet developed a stable sense of the individual sounds in words.

Elkonin boxes provide a concrete, physical representation of an abstract concept. When a child physically taps a counter into a box, they are doing something with their hands that matches what their voice is doing — and this multisensory connection accelerates the neural pathways that underpin phonemic awareness.

A landmark meta-analysis by the National Reading Panel (2000) found that phonemic awareness instruction — especially when it includes both oral and written components — produced significantly better reading outcomes than instruction without it (effect size d = 0.86 on phonemic awareness, d = 0.53 on reading). The Elkonin method, which adds the written letter dimension to oral segmentation, is consistently among the highest-effect interventions studied.

Frequently asked questions

What age are Elkonin boxes for?
Most commonly ages 4–7, in pre-K through Grade 2. Introduce them when a child is starting to segment spoken words into sounds — usually around age 5.
How many boxes does each word need?
One box per phoneme (sound), not per letter. "Ship" = 3 boxes (sh · i · p). "Night" = 3 boxes (n · igh · t). Use this site to look up any word and see the correct box count.
Do Elkonin boxes work for Indian children learning English?
Yes. Sound boxes are based on how letters map to sounds in written English, not on any specific accent. Indian children benefit from the same approach — the grapheme chunking used on this site is dialect-neutral.
Can I use this website instead of drawing boxes?
Yes. Every word on this site has a pre-built interactive sound box. Search any word on the homepage or browse by letter or difficulty level.
How long should a sound-box session last?
Short and frequent beats long and occasional. Aim for 10–15 minutes, 3–5 times a week. The research on phonemic awareness instruction consistently finds that brief, consistent practice outperforms longer but irregular sessions. If your child is engaged for 20 minutes, great — but don't push past the point of enjoyment.
What do I do if my child gets the number of sounds wrong?
Say the word slowly yourself, exaggerating each sound: "nnn…iii…t" for "night." Ask your child to hold up a finger for each sound they hear. If they still mis-count, that's useful information — it usually means the word contains a digraph or vowel team they haven't seen yet. Look up the word on this site to see the correct chunking, then teach the new grapheme explicitly before continuing.
Are Elkonin boxes and sound boxes the same thing?
Yes — they are used interchangeably. "Elkonin boxes" refers to the method as developed by Daniil Elkonin; "sound boxes," "phoneme boxes," and "phoneme frames" are the plain-English names you'll see in UK and US classrooms. All four terms describe the same technique.
Can older children benefit from sound boxes, or is it only for early readers?
Older children — especially those who are behind grade level in reading — often benefit dramatically. For struggling readers aged 7+, sound boxes expose the phonemic structure their earlier instruction missed. Many Orton-Gillingham tutors use them with children up to age 12 when decoding gaps are identified.

Try it now — look up any word

Search our library of 12,000+ words to see Elkonin sound boxes instantly.