Table of contents |
Japanese is one of the easiest languages in the world to pronounce. It has a relatively small number of well-defined sounds, most of which are not that hard for the Western tongue. And Japanese is pronounced almost exactly as written (either in kana, or transliterated Roman characters).
Having said that, it's always best to hear a native speaker's pronunciation; you can do that here
This SL page refers to a Japanese language teaching book [1] for the basics of Japanese pronunciation in regard to English speech.
There are several schemes for writing Japanese in the Roman alphabet (called roumaji in Japanese). Sensei's Library uses the Hepburn Romanization with the wapuro ("word processor") spelling of long vowels. (The Hepburn Romanization because it gives English speakers a better idea of pronunciation, and the wapuro long vowels because they are easy to type, require only ASCII characters, are hard to lose, correspond to the way they are written in kana, and correspond to how you type them into a Japanese word processor.)
Japanese has five short vowels: a, i, u, e, o. They are short vowels, pronounced clearly and crisply. If you pronounce the vowels in the following English sentence, making them all short, you will have their approximate sounds. The u is pronounced with no movement forward of the lips.
Ah, we soon get old.
a i u e o
Long vowels are pronounced like the short vowels but held for twice as long. Care should be taken to pronounce them as a continuous sound, equal in value to two identical short vowels.
Long a is written aa; long i is written ii; long u is written uu; long e is written ei; and long o is written ou.
For example, ii ("good") has a long i; kyuu ("rank") has a long u; sensei ("teacher") has a long e; and jouseki ("established line of play") has a long o.
Every syllable in the Japanese language is either a vowel on its own, a consonant followed by a vowel, or n. The consonants are k, s, t, n, h, m, y, r, w, and their voiced counterparts g, z, d, b, and p. In addition, a y sound may intervene between the initial consonant and the vowel, as is seen in words such as byou ("second").
Some sounds combine in ways you might not expect:
The interstitial y "palatalizes" the preceding consonant: that is, causes it to be pronounced with your tongue near the top of your mouth. For example, byou is not pronounced like the start of English "biology".
fu is pretty hard to pronounce for English speaking people. Don't bring your lower lip up to your upper teeth as in English f; instead purse your lips as if about to whistle and blow through them. Try to find an example in an anime show for the correct pronunciation of it. It's similar to the sound you make when you blow out a candle.
The r in Japanese is "flapped". Prepare your tongue as if about to say English l but instead touch the tip of the tongue quickly to the top of your mouth and bring it down again. If you get it right it sounds somewhere between English l, r and d.
The g as in ga, gi, etc. at the beginning of a word is hard (as in English "garden"), but when it occurs in the middle or in the last syllable of a word, it often becomes nasal, as in eiga ("movie"). (Some speakers always use the nasal g; this is a regional difference in pronunciation.)
n is the only independent consonant not combined with a vowel. It turns the preceding vowel into a nasal vowel. If it is followed by a syllable beginning with b, m or p, it is pronounced more like English m, for example shinbun ("newspaper") is pronounced more like shimbun. (In some Romanization schemes you would write m in these cases but in Hepburn you write n regardless of pronunciation.)
Distinguishing syllabic n from the ordinary consonant n can make a big difference. For example kinen means "no smoking" when pronounced ki + n + e + n but means "anniversary" when pronounced ki + ne + n.
A double consonsant, for example the kk in ikkentobi ("one-space jump"), indicates that the consonant is double length. Start to say the consonant, then stop the flow of air, pause for one syllable, and then finish saying the consonant.
Each word has a stereotypical stress pattern. Stressed syllables are indicated by higher pitch. This allows you to distinguish nihon ("Japan") from nihon ("two bottles").
Some Japanese Go terms have been adopted by English-speaking players with ad-hoc spellings. For example kou has become ko and jouseki joseki. (This probably happened because some people used a Romanization scheme in which long vowels are written with accented characters, for example kō or kô, and the accents were later lost.)
The English form "ko" is fine for writing English but remember to use kou when writing Japanese.
Some adopted words have Anglicised pronunciations, for example sente may be pronounced in English with a diphthong at the end, like "sentay" but in Japanese it ends with a short e.
(Anonymous): I've always wondered, what is the correct pronunciation, if you are an English speaker, of the terms Tesuji and Tsumego? Thanks!
Please check the link to native speaker's recordings at the top of this page.
Karl: Some of the comments on "tricky pronunciation for English speakers" are problematic for me...
"The interstitial y "palatalizes" the preceding consonant: that is, causes it to be pronounced with your tongue near the top of your mouth. For example, byou is not pronounced like the start of English "biology"."
Well of course it isn't; the English long I is a completely different sound anyway (which would be rendered as an 'ai' dipthong in roumaji). But I can't figure out how to put my tongue anywhere near the roof of my mouth and produce a sound that remotely resembles 'b'. Is is sufficient to have a falling motion of the tongue between the 'b' and 'y', as opposed to keeping the tip of the tongue anchored under the lower teeth (as I normally would for a 'b')? Otherwise I sound rather as if I'm in the dentist's chair.
"fu is pretty hard to pronounce for English speaking people. Don't bring your lower lip up to your upper teeth as in English f; instead purse your lips as if about to whistle and blow through them. Try to find an example in an anime show for the correct pronunciation of it. It's similar to the sound you make when you blow out a candle."
I don't find it difficult to get a reasonable approximation; easier than that r/l/d sound, anyway. (It's worth noting that some systems of roumaji use l's instead, and others probably allow for either according to what looks better in context. And then, often you see imported English words given in English spelling instead of roumaji transliteration...)
"n is the only independent consonant not combined with a vowel. It turns the preceding vowel into a nasal vowel. If it is followed by a syllable beginning with b, m or p, it is pronounced more like English m, for example shinbun ("newspaper") is pronounced more like shimbun. (In some Romanization schemes you would write m in these cases but in Hepburn you write n regardless of pronunciation.)"
Should that perhaps be b, h and p, which are related (same kana except for dots/circle marking)? Or is this just one of those random things? Although I do suppose a 'slide' from an n sound to an m sound is rather odd.
"Distinguishing syllabic n from the ordinary consonant n can make a big difference. For example kinen means "no smoking" when pronounced ki + n + e + n but means "anniversary" when pronounced ki + ne + n."
Okay, but how do you make this distinction in pronunciation?
Oh - and what is the difference between the z+i "ji" and the d+i "ji"?
Rafael Karl, the y is not a big deal. The explanation above just makes it look harder than it is. I believe you can pronounce it as you do in English "yard", "your", "Dubya". Your tongue does rise, but just a little. Insert a finger in your mouth and pronounce "ah". Your tongue doesn't have to touch the finger. Then pronounce "ya" (as in "yard").
Please listen to the recordings linked above, it's probably much easier than to understand written descriptions. The fact that English spellling is a mess doesn't help (I guess the "y" explanation is meant to tell people not to pronounce it as in "Yale").
The sounds "b" and "p" are indeed related: both are produced by joining the lips and interrupting the air flow. The difference is that "b" is voiced while "p" is unvoiced (vocal folds don't vibrate). The sound "m" is also bilabial (you join both lips) but the air flow goes thru the nose, like "n". So it's not a coincidence that we usually turn the "n" sound into an "m" sound before "b" and "p": it's hard not to do so.
I pronounce "kin'en" (no smoking) roughly like "king yen", but I'm not a native speaker (and again, I don't see much point in these kind of descriptions unless you really find it hard to listen to the recordings and imitate the sounds).
There's no difference between "zi" and "di" as far as pronunciation is concerned.
Authors