F2開始周波数と調音位置　その２　（F2-onset frequency and place of articulation Part 2）

閉鎖子音の直後に母音が続く場合、子音の調音位置にしたがって、母音の第1および第2フォルマント(F1、F2)の遷移部の周波数が変わる、という様子は[K300 F2開始周波数と調音位置]で確認した通りです。
ここでは、先のページよりも多くのパラメータを変更することで、より自然な音声に近づける工夫をしてみます。

このデモンストレーションでは、フォルマント合成器を使い、上記のようにフォルマント周波数の軌跡を変化させることでCV（子音・母音）音節を合成しています。このとき、閉鎖子音の破裂に伴うバーストは合成していません。合成に関する各パラメータは次の通りです。[new]の記載があるパラメータが追加で変更されています。より詳細は [8] Tomaru and Arai (2016) をご覧ください。

[new] ・後続母音は/a/：F1＝790 Hz、F2＝1190 Hz、F3＝2640 Hz
        ・母音遷移部はF1とF2のみ変化させた
[new] ・F1およびF2の遷移時間：22 ms ～ 38msの区間を2ms刻みで変化（合計9パターン）
[new] ・F1の遷移開始周波数：385 Hzで一定
[new] ・F2の遷移開始周波数：885 Hzから1690 Hzの区間を100Hz刻みで変化
[new] ・基本周波数は以下のタイミングで変化
            0 ms ～　100 ms = 125 Hz
            150 ms = 130 Hz
            200 ms = 135 Hz
            300 ms = 130 Hz
            330 ms = 125 Hz
[new] ・F1～F5のBandwidth
            F1 = 60 Hz
            F2 = 105 Hz
            F3 = 150 Hz
            F4 = 200 Hz
            F5 = 1000 Hz
[new] ・OQ (Open Quotioent: voicing open-time/period） = 80%
[new] ・TL（Extra tilt of voicing spectrum) = 8 dB down @ 3 kHz

	Step 1	Step 2	Step 3	Step 4	Step 5	Step 6	Step 7	Step 8	Step 9
F1 & F2 transition time (ms)	22	24	26	28	30	32	34	36	38
F2 starting frequency (ms)	885	985	1085	1190	1290	1390	1490	1590	1690
Sounds

Kewley-Port, D., “Measurement of formant transitions in naturally produced stop consonant-vowel syllables,” J. Acoust. Soc. Am., 72(2), 379-389, 1982.
Kent, R. D. and Read, C., Acoustic Analysis of Speech, Singular Publishing, San Diego, CA, 2001. （荒井隆行, 菅原勉監訳, 音声の音響分析, 海文堂, 1996.）
Klatt, D. H. and L. C. Klatt, “Analysis, synthesis, and perception of voice quality variations among female and male talkers,” J. Acoust. Soc. Am., 87, 820–857 (1990).
Klatt,D. H. “The new MIT speech VAX computer facility,” in Speech Communication Group Working Papers IV, Research Laboratory of Electronics (MIT, Cambridge, MA, 1984), pp. 73–82.
Liberman, A. M., P. C. Delattre, F. S. Cooper and L. J. Gerstman, “The role of consonant-vowel transitions in the perception of the stop and nasal consonants,” Psychol. Monogr.: Gen. Appl., 68(8), 1–13 (1954).
Liberman, A. M., K. S. Harris, H. S. Hoffman and B. C. Griffith, “The discrimination of speech sounds within and across phoneme boundaries,” J. Exp. Psychol., 54, 358–368 (1957).
Liberman, A. M., K. S. Harris, J. A. Kinney and H. Lane, “The discrimination of relative onset-time of the components of certain speech and nonspeech patterns,” J. Exp. Psychol., 61, 379–388 (1961).
Tomaru, L. and T. Arai, “Role of labeling mediation in speech perception: Evidence from a voiced stop continuum perceived in different surrounding sound contexts,” Acoust. Sci. & Tech., 37(6), 303-314 (2016).