Listening is the Foundation Everything Else Stands On
Reading builds vocabulary faster. But listening builds everything else. And if you had to choose one activity to do more of, this is the one.
I talk about reading a lot. I recommend LingQ in almost every post. I advocate morning reading sessions. I’ve written about how reading builds vocabulary, installs grammar, develops spelling instinct, and creates the deep implicit knowledge that fluency runs on.
I stand behind all of it. Reading is extraordinary.
But if I look honestly at what contributed most to my own Spanish fluency, and if I look at the learners I work with on iTalki who progress fastest, one activity stands above the rest. Not reading. Listening. Specifically, the sheer volume of listening.
The learners with the best comprehension, the most natural pronunciation, the smoothest speaking rhythm, and the fastest real-time processing are, almost without exception, the ones who listen the most. Not the ones who read the most. Not the ones who study the most. The ones whose ears have absorbed the most hours of real English.
What Listening Gives You That Reading Can’t
Reading and listening are both input. Both feed the implicit system. Both build vocabulary and grammar. But they feed different dimensions of your English, and the dimensions that listening feeds are the ones that conversation depends on most directly.
Processing speed. When you read, you control the pace. You can slow down. Re-read a sentence. Pause on a word. The processing happens at your speed. When you listen, you process at the speaker’s speed. Their pace is the pace. Your brain has to keep up or fall behind. This real-time processing demand, repeated across hundreds of hours, trains your brain to handle English at conversational speed. Reading doesn’t train this. Only listening does.
Connected speech. Written English presents words as separate units with spaces between them. Spoken English connects, blurs, reduces, and links words in ways that bear little resemblance to what appears on the page. “What do you want to do?” becomes “whaddaya wanna do?” in natural speech. A learner who reads extensively but listens minimally will understand the written version and be lost by the spoken one. Only hours of listening teach your brain to decode real spoken English.
Rhythm and stress. English is a stress-timed language. The musicality of it, which syllables carry weight, which words are emphasised, which are swallowed, determines both comprehension and naturalness. This dimension is invisible on the page. The word “comfortable” has four syllables when you read it and three when a native speaker says it. The only way to absorb English rhythm is through your ears.
Intonation. The rise and fall of English sentences carries meaning that the words don’t. A rising pitch turns a statement into a question. A falling pitch signals certainty. A flat delivery where a native speaker would rise sounds robotic. Intonation is absorbed through listening. There is no reading equivalent.
Reduced forms. Native speakers compress unstressed words almost to nothing. “Him” becomes “’im.” “Have” becomes “’ve.” “Can” sounds like “c’n.” A learner who hasn’t heard these reductions for hundreds of hours finds real English bewilderingly fast. It’s not fast. It’s reduced. And the only teacher for reductions is the ear.
Pronunciation templates. As we discussed in our post on how your ears must learn before your mouth can speak, your mouth produces what your ear has learned to hear. Every word you’ll ever pronounce correctly was first heard correctly, many times, through listening. Your ear stores a template. Your mouth aims for it. No listening means no template. No template means your mouth defaults to your native language’s best guess.
Reading’s Superpower (And Its Limitation)
Reading has one clear advantage over listening: vocabulary acquisition density. In twenty minutes of reading on LingQ, you encounter more unique words than in twenty minutes of listening. Written content tends to use a wider vocabulary range than spoken content. And the ability to pause on unknown words, look them up, and see them in written context makes reading the faster vocabulary builder.
Reading also installs spelling, teaches written grammar patterns, and develops a feel for how English is structured on the page. For writing ability specifically, reading is irreplaceable.
But reading has a limitation that matters more than most learners realise: it builds a silent English. An English of the eyes. An English that exists on the page and in the mind but that has no sound attached to it.
A learner who reads extensively but listens minimally develops a curious split. Large vocabulary. Strong grammar intuition. Good comprehension of written text. And a spoken English that sounds nothing like what native speakers produce, because the ear never absorbed the templates that spoken production depends on.
I’ve seen this split in my students. Impressive readers whose spoken English is flat, monotone, and accented in ways that reveal their mouth is guessing rather than reproducing. The vocabulary is there. The grammar is there. The sound is missing. Because the sound only comes from listening.
The Ratio
If you have an hour a day for English practice, here’s how I’d split it based on what I’ve seen work.
Forty minutes listening. Twenty minutes reading.
If you have ninety minutes, sixty listening, thirty reading. If you have thirty minutes, twenty listening, ten reading.
The listening takes the larger share because it’s training more systems simultaneously: comprehension, processing speed, pronunciation templates, rhythm, intonation, connected speech decoding, and reduced form recognition. Reading trains vocabulary and written grammar more efficiently, but the dimensions it doesn’t touch are the dimensions that conversation depends on most.
And listening fits into life more easily. You can listen while walking, commuting, cooking, cleaning, exercising, shopping. Reading demands your eyes and your attention exclusively. Listening rides alongside other activities. This means the total listening hours you can accumulate in a day are naturally higher than reading hours, without requiring any additional dedicated time.
A learner who reads for twenty minutes each morning and listens for an hour throughout the day, spread across a commute, a walk, and some household tasks, is investing eighty minutes total with a sixty-forty split toward listening. Over a year, that’s roughly 365 hours of listening and 120 hours of reading. Both substantial. Both productive. But the listening volume is where the spoken fluency is primarily built.
The Listening Lifestyle
The learners I’ve watched progress fastest didn’t just add listening to their practice. They made it the background texture of their day.
English podcast on from the moment the headphones go in until the moment they come out. Walking to the train: English. On the train: English. Walking from the train: English. Cooking dinner: English. Folding laundry: English. Saturday morning run: English.
None of these require dedicated study time. They require a pair of headphones and the decision to fill silence with English instead of with native-language content or with nothing.
The accumulation is staggering. A learner who listens for ninety minutes a day across these scattered moments accumulates over 500 hours of English listening in a year. At that volume, spoken English stops being something foreign and starts being something familiar. The connected speech that was impenetrable at month two is transparent at month twelve. The speed that felt overwhelming becomes normal. The reductions that confused you become expected.
This is what immersion sounds like for someone who doesn’t live in an English-speaking country. Not a classroom. Not a study session. Just English in your ears, woven through the fabric of your ordinary day, accumulating silently into the comprehension and the pronunciation instinct that fluency requires.
Reading Deepens. Listening Widens.
Here’s how I’ve come to think about the relationship.
Reading deepens your English. It takes you into vocabulary you’d never hear in casual speech. It exposes you to complex sentence structures that conversation rarely produces. It builds the literary, academic, and formal dimensions of the language that sophisticated English requires.
Listening widens your English. It covers more of the language as it’s actually used in the world. The casual. The fast. The messy. The real. The English of arguments and laughter and boredom and excitement. The English that happens between real people at real speed in real life.
You need both. A learner who only reads has deep but narrow English. A learner who only listens has wide but shallow English. The combination produces depth and breadth. But if the balance tips, I’d tip it toward listening, because the wideness that listening builds is the wideness that real-world communication operates in.
Fill Your Ears First
If your English practice currently leans heavily toward reading with listening as an afterthought, consider reversing the ratio. Not abandoning reading. Increasing listening.
Find three or four podcasts you genuinely enjoy. Subscribe. Let them fill your commute, your walks, your kitchen time. Watch a show on Lingopie every evening instead of every other evening. Add an audiobook for longer journeys.
Keep your morning reading session on LingQ. It’s building something that listening can’t replace. But recognise that the twenty minutes of reading, as valuable as they are, contribute less to your spoken fluency than the sixty minutes of listening that fill the gaps in your day.
The ears build the English that the mouth eventually speaks. Reading gives you the words. Listening gives you the sound, the speed, the rhythm, and the feel. Both matter. The ears just need more hours than the eyes.
Fill your ears first. Your mouth will thank you for it when the time comes to speak.
For the reading that builds the deep vocabulary your listening draws from, LingQ is the tool I recommend above all others.
For absorbing natural English through real TV shows and films with interactive subtitles, Lingopie turns your screen time into genuine acquisition.
For training your mouth to produce the English your ears have absorbed, Glossika builds speaking through structured repetition. Available in both British and American English.
If you want compelling, story-driven English listening at just the right level, Olly Richards’ Conversations course is well worth exploring.
If you’re ready to discover what all those listening hours have built, iTalki is where I’d start. And if you’d like to work with me, book a trial lesson here.
Thanks for reading. I’m Richard, a New Zealand-based conversation partner who thinks ears deserve more credit than they get in language learning.
Some links on this blog are affiliate links. I earn a small commission if you sign up, at no cost to you. I recommend these products because I genuinely believe in them.


