Subtle vowel and consonant distinctions
At C1 you learned the broad strokes of AmE phonology — rhoticity, flapping of intervocalic /t/, the cot-caught merger as a general AmE trend. At C2 the work is fine-grained: knowing exactly where each merger is and isn’t, recognizing the conditioning environments for /æ/-tensing in NYC and Philadelphia, distinguishing the three realizations of intervocalic /t/ (aspirated, flap, glottal), and understanding the subtle vowel substitutions that distinguish regional varieties even in otherwise “standard” speech.
These distinctions matter for two reasons. First, comprehension — you encounter speakers from different regions constantly, and the phonetic features differ in patterned ways. Second, production — your own AmE production tends toward a particular regional substrate (often a hybrid GA), and at C2 you should know which features you are producing and whether they cohere into a recognizable variety or read as a mishmash.
Russian L1 speakers face a specific challenge here: Russian phonology lacks the relevant English contrasts (no /æ/ vs /ɛ/, no /ɑ/ vs /ɔ/ as low back vowels, no flap allophone of /t/), so the entire English fine-grain has to be installed from scratch. This lesson maps the major C2-level distinctions and where Russian-trained speakers should focus.
Academic vocabulary pronunciation at C1 Advanced vowel distinctions (B2)1. The cot-caught merger and its geography
The cot-caught merger collapses /ɑ/ and /ɔ/ into a single low-back vowel, usually realized as /ɑ/. Words like cot and caught, Don and Dawn, stock and stalk become homophones.
Merged regions:
- Western US (California, Pacific Northwest, Rockies, much of the Southwest).
- Canadian English (universally merged).
- Western Pennsylvania (Pittsburgh).
- Boston and much of Eastern New England (with /ɔ/-like quality).
- Younger speakers in many traditionally non-merged regions are merging.
Non-merged regions:
- New York City (strongly non-merged, with raised /ɔ/).
- Philadelphia, Baltimore, the Mid-Atlantic.
- Inland North (Chicago, Detroit, Buffalo).
- Much of the Deep South (with characteristic Southern offglides).
- Older Northeastern speakers generally.
For a non-merged speaker, cot is /kɑt/ and caught is /kɔt/ — these are different words, and the distinction is mandatory. For a merged speaker, both are /kɑt/.
At C2 you should:
- Recognize both systems in input and parse correctly.
- Choose one for your own production based on what GA-target you are working with. Most Russian-trained speakers default toward a merged system (since the distinction is hard to acquire) but should be aware they are producing the merged variant.
In IPA, the merged vowel is typically /ɑ/, sometimes /ɔ/ depending on region.
Cot-caught self-test
Read the following minimal pairs aloud and decide whether you produce them as distinct or merged:
- cot / caught
- Don / Dawn
- stock / stalk
- body / bawdy
- collar / caller
- knotty / naughty
If they all sound identical, you produce the merged system (universal in the West and Canada). If they sound different, you produce the non-merged system (NYC, Philly, Inland North, Deep South). Most Russian-trained speakers default to merged, since the distinction is hard to acquire.
2. The pin-pen merger
In Southern AmE and some adjacent areas, /ɪ/ and /ɛ/ merge before nasal consonants (/m, n, ŋ/). Pin and pen, bin and Ben, since and cents become homophones (typically realized as /ɪ/).
Affected regions:
- The entire South (universally merged).
- Much of the Lower Midwest.
- AAE generally.
- Spread to some Western communities through migration.
Unaffected regions:
- The Northeast.
- The Inland North.
- Most of the West.
For a Southern speaker, I need a pen is /aː niːd ə pɪn/ — pen sounds like pin. To disambiguate, Southerners often specify ink pen (vs stick pin).
This merger affects only pre-nasal /ɪ/-/ɛ/; in other environments (pit/pet, sit/set) the distinction is preserved.
Pin-pen disambiguation strategies
For comprehension in pin-pen-merged speech: use context to distinguish. In I need an ink pen, ink is the disambiguator; in send me a text, the context makes the meaning clear. AAE inherits this merger; Southern AmE has it universally.
For production: if your target is General American, distinguish */ɪ/ and /ɛ/ before nasals (pin /pɪn/ vs pen /pɛn/). If targeting Southern, merge them (both /pɪn/).
3. /æ/-tensing in NYC and Philadelphia
In NYC and Philadelphia, the vowel /æ/ has split into two distinct phonemes through a process called /æ/-tensing:
- Tense /æ/ (sometimes notated /eə/ or /ɛə/) — used in some words and environments.
- Lax /æ/ (the standard GA /æ/) — used in others.
The conditioning environments differ between NYC and Philadelphia and constitute one of the most complex sound patterns in AmE.
NYC system (simplified):
- Tense before /m, n, f, θ, s, ʃ/ in monosyllables and word-finally before these consonants: bad, man, half, bath, glass, ash → /bɛəd, mɛən/.
- Lax before /p, t, k, b, d, g, ŋ/ and before vowels: cap, bat, back, bag, sang, marry → /kæp, bæt/.
- Lax before /m, n/ in some specific irregular words (ham, swam, ran, began) — these are lexical exceptions.
Philadelphia system (similar but with different exceptions):
- Tense in many of the same environments as NYC.
- But /æ/ in bad, mad, glad is tense, while in sad it is lax (a famous Philadelphia minimal triple).
- Planet is lax; planning is tense.
The NYC/Philly /æ/ split is one of the most distinctive features of these dialects and a key marker of authentic local speech.
For comprehension: when you hear a NYC speaker say bad, expect /bɛəd/-like quality, not /bæd/. For production: most Russian-trained speakers should target a single GA /æ/ and not attempt to acquire the NYC/Philly split, which is genuinely complex.
/æ/-tensing comprehension test
If you hear a NYC speaker produce bad, mad, glad, sad — listen for whether all four have tense /ɛə/ quality, or whether sad sounds different (lax /æ/). In Philly, sad is lax while the other three are tense, producing a famous minimal triple distinction. NYC tenses all four.
For most Russian-trained C2 speakers, the appropriate target is a single GA /æ/, applied uniformly. The NYC/Philly split is too complex to acquire productively without immersion.
4. Intervocalic /t/: aspirated, flap, glottal
A defining feature of AmE is the variable realization of /t/ between vowels. Three main allophones:
Flap /ɾ/ — the default in unstressed environments after a stressed vowel.
- Better /bɛɾɚ/, city /sɪɾi/, butter /bʌɾɚ/, water /ˈwɔɾɚ/ ~ /ˈwɑɾɚ/ (depending on cot-caught merger status; merged-AmE speakers use /ɑ/), atom /æɾəm/.
- Across word boundaries too: get it /gɛɾɪt/, what about /wʌɾəbaʊt/.
Glottal stop /ʔ/ — used before syllabic /n/ and word-finally before consonants in some varieties.
- Button /bʌʔn/, mountain /maʊʔn/, kitten /kɪʔn/.
- Hit me /hɪʔ mi/, that one /ðæʔ wʌn/.
- In some young AmE speakers, glottal stop is encroaching on flap territory: better /bɛʔɚ/ — but this is still marked.
Aspirated [tʰ] — at the start of stressed syllables and word-initially.
- Top [tʰɑp], time [tʰaɪm], attack [əˈtʰæk] (stressed syllable).
The distinction matters because:
- Russian-trained speakers often produce released, aspirated [tʰ] in every position, including intervocalic — sounding clearly non-American.
- Producing flap requires a specific tongue-tip articulation similar to a quick /d/.
- The glottal stop variant is used in specific environments and overusing it sounds British or affected.
At C2, you should reliably produce the flap in unstressed intervocalic positions. The aspirated [tʰ] should be reserved for stressed-syllable onsets. The glottal stop should appear before /n/ in button, mountain, important.
Intervocalic /t/ drill
Produce each word and phrase with the correct AmE allophone:
| Spelling | AmE /t/ realization | IPA |
|---|---|---|
| city | flap | /sɪɾi/ |
| better | flap | /bɛɾɚ/ |
| bottom | flap | /bɑɾəm/ |
| getting | flap | /gɛɾɪŋ/ |
| button | glottal + syllabic n | /bʌʔn̩/ |
| mountain | glottal + syllabic n | /maʊʔn̩/ |
| kitten | glottal + syllabic n | /kɪʔn̩/ |
| certain | glottal + syllabic n | /sɝʔn̩/ |
| top | aspirated | [tʰɑp] |
| time | aspirated | [tʰaɪm] |
| attain | aspirated (stressed) | /əˈtʰeɪn/ |
Record yourself reading the list. The most common Russian L1 error is producing aspirated [tʰ] for all of them, which sounds British.
5. The Mary-marry-merry merger
In most of AmE, Mary, marry, and merry are all pronounced identically — typically /mɛri/. This is the universal AmE merger of pre-/r/ vowels.
Three-way distinguished: Older NYC, Philadelphia, some New England — /meɪri/ vs /mæri/ vs /mɛri/.
Two-way distinguished: Some traditional Southern speakers.
Fully merged: General American, most regions.
For a Russian-trained speaker, the merged system is easier and recommended. Recognize that traditional NYC and Philly speakers may produce the distinction.
6. The wine-whine merger and /hw/
In nearly all modern AmE, the /hw/ vs /w/ contrast in wine vs whine, witch vs which, weather vs whether is fully merged — both pronounced /w/. The /hw/ pronunciation survives in:
- Older speakers (especially Southern and from formal-speech training).
- Some Scottish-and-Irish-substrate communities.
- Some formal-pulpit speech.
If you produce /hw/ consistently, you sound dated or affected. Drop it for natural AmE. Comprehension-wise, you may encounter it in older speakers and recorded archival material.
/hw/ self-test
Read aloud: whale, which, what, where, when, why, while, whisper, white, wheel. Do you produce these with /hw/ or just /w/? In modern AmE, /w/ dominates. Producing /hw/ consistently sounds dated, formal, or pulpit-influenced. For natural AmE, drop the /h/.
7. Subtle consonant contrasts: yod-dropping and yod-coalescence
In AmE, the /j/ glide before /u/ in stressed syllables has been largely dropped after coronal consonants (/t, d, n, s, z, l/):
- Tune /tuːn/, not BrE /tjuːn/.
- Dew /duː/, not /djuː/.
- News /nuːz/, not /njuːz/.
- Student /stuːdənt/, not /stjuːdənt/.
- Tuesday /tuːzdeɪ/, not /tjuːzdeɪ/.
But /j/ is retained after non-coronals: cute /kjuːt/, mute /mjuːt/, few /fjuː/.
In unstressed environments, /tj, dj/ may coalesce to /tʃ, dʒ/ (yod-coalescence): natural /nætʃərəl/, educate /ɛdʒəkeɪt/, did you /dɪdʒuː/, got you /gɑtʃuː/.
Russian-trained speakers often produce BrE-style /tjuːn, djuː, njuːz/. Drop the /j/ after coronals for AmE-native pronunciation.
Yod-dropping drill
Produce each word with AmE pronunciation (no /j/ after coronals):
- tune /tuːn/, tube /tuːb/, Tuesday /tuːzdeɪ/
- due /duː/, dew /duː/, duty /duːɾi/
- new /nuː/, news /nuːz/, neutral /nuːtrəl/
- suit /suːt/, super /suːpɚ/, suitable /suːɾəbəl/
- student /stuːdənt/, stupid /stuːpɪd/
- lewd /luːd/, lucid /luːsɪd/
But retain /j/ after non-coronals:
- cute /kjuːt/, cube /kjuːb/
- music /mjuːzɪk/, mute /mjuːt/
- few /fjuː/, fume /fjuːm/
- huge /hjuːdʒ/, human /hjuːmən/
- view /vjuː/, vacuum /vækjuːm/
Yod-dropping after coronals is one of the cleanest AmE-vs-BrE markers. A Russian L1 speaker producing BrE-style /tjuːn/ in AmE context is immediately flagged.
8. Pre-/r/ vowel mergers
Several pre-/r/ vowel mergers characterize AmE:
- The Mary-marry-merry merger (/ɛr/, /ær/, /ɛr/): in most AmE the three vowels before /r/ collapse so Mary, marry, and merry are homophones — the canonical pre-/r/ merger to know.
- The hurry-furry split (a separate phenomenon): hairy /ˈhɛri/ ~ /ˈheɪri/ vs hurry /ˈhɜri/ ~ /ˈhʌri/ — note this is /ɛr/ vs /ɜr/ (or /ʌr/ in older AmE), NOT /ɪr/ vs /ɛr/. The hurry-furry split distinguishes /ɜr/ from /ʌr/ before /r/ in stressed syllables; merged speakers have only /ɜr/.
- /ɔr/ and /oʊr/ merge in most AmE: horse and hoarse are homophones.
- /ʊr/ and /ɔr/ distinguished in some words: poor /pʊr/ vs pour /pɔr/ (varies regionally).
The general direction is toward simplification — fewer pre-/r/ vowel distinctions in younger speakers.
9. The Russian L1 substitution pattern
Common Russian-substrate phonetic features at C2 residual level:
- /æ/ produced too close to Russian /э/ — cat /kɛt/ rather than /kæt/.
- /ɪ/ produced too close to Russian /и/ — bit /bit/ rather than /bɪt/.
- /ə/ schwa replaced with a full vowel — about /aˈbaʊt/ ~ /ɑˈbaʊt/ rather than /əˈbaʊt/. Russian lacks /æ/ entirely, so the substitution is the Russian /a/ ~ /ɑ/ on the first syllable, not /æ/.
- No flap on intervocalic /t/ — better /bɛtʰər/ rather than /bɛɾɚ/.
- No glottal stop before syllabic /n/ — button /bʌtən/ rather than /bʌʔn̩/.
- Aspirated final /t/, /p/, /k/ where AmE has unreleased or glottal.
- Final-obstruent devoicing carried over from Russian — bag /bæk/ rather than /bæg/.
- /r/ trilled rather than the AmE bunched or retroflex approximant.
Each of these is fixable individually with focused drill, but at C2 residuals tend to persist under stress and fatigue. The cure is sustained attention to the specific feature in the speaker’s own production.
Common Russian-speaker mistakes
- Wrong: Aspirated [tʰ] in intervocalic positions (better /bɛtʰər/). Right: Flap /ɾ/ in unstressed intervocalic (better /bɛɾɚ/). Why: The aspirated allophone is BrE; AmE uses flap, and this is one of the strongest AmE-vs-BrE markers.
- Wrong: Released [tʰ] before syllabic /n/ (button /bʌtən/). Right: Glottal stop with syllabic /n/ (button /bʌʔn̩/). Why: AmE uses glottal stop in this environment; released /t/ sounds careful, non-native, or affected.
- Wrong: Final-obstruent devoicing carried from Russian (bag /bæk/, dog /dɔk/). Right: Voiced final obstruents (bag /bæɡ/, dog /dɑɡ/). Why: English maintains voicing contrast in final position; Russian devoicing transfers and is highly noticeable.
- Wrong: /j/ glide after coronals before /u/ (tune /tjuːn/, student /stjuːdənt/). Right: Yod-dropped (tune /tuːn/, student /stuːdənt/). Why: AmE has dropped the /j/ after coronals; retaining it sounds British.
- Wrong: Russian /э/ for English /æ/ (cat /kɛt/). Right: Lowered, fronted /æ/ (cat /kæt/). Why: Russian lacks /æ/ entirely; the L1 substitution /э/ is recognizably non-native at C2.
- Wrong: Full-vowel reductions where AmE has schwa (about /æbaʊt/). Right: Schwa /ə/ in unstressed syllables (about /əbaʊt/). Why: AmE relies on schwa for stress-timing; full vowels destroy the rhythm.
- Wrong: Trilled or tapped /r/ from Russian phonology. Right: Bunched or retroflex approximant /ɹ/ in AmE. Why: Russian /р/ is acoustically very different from AmE /ɹ/; the trilled variant is the most diagnostic non-native cue.
Summary
- The cot-caught merger is universal in the West, Canada, Boston, Pittsburgh; non-merged in NYC, Philadelphia, Inland North, Deep South.
- The pin-pen merger is universal in Southern AmE and AAE; absent elsewhere.
- /æ/-tensing in NYC and Philadelphia splits /æ/ into tense and lax phonemes with complex conditioning environments.
- Intervocalic /t/ in AmE has three main allophones: flap /ɾ/ in unstressed positions, glottal stop /ʔ/ before syllabic /n/, aspirated [tʰ] in stressed-syllable onsets.
- AmE has dropped /j/ after coronals before /u/ — tune /tuːn/, student /stuːdənt/, news /nuːz/.
- Russian L1 residuals at C2 include aspirated intervocalic /t/, final-obstruent devoicing, /j/ after coronals, trilled /r/, and Russian /э/ for English /æ/.
Next lesson: prosodic disambiguation — the same words producing different meanings under different stress placements, and prosody as the sole resolver of attachment ambiguity.