메뉴 건너뛰기




Volumn 19, Issue 3, 1996, Pages 221-244

Analysis/synthesis and modification of the speech aperiodic component

Author keywords

Analysis synthesis; Aperiodic component of speech; Random formant wave forms; Rice representation; Speech decomposition; Speech modifications; Speech noises

Indexed keywords

ACOUSTIC SIGNAL PROCESSING; RANDOM PROCESSES; SHOT NOISE; SPEECH ANALYSIS; SPEECH RECOGNITION; SPEECH SYNTHESIS;

EID: 0030232135     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/0167-6393(96)00038-6     Document Type: Article
Times cited : (25)

References (63)
  • 2
    • 0025546887 scopus 로고
    • Time-frequency speech transformation based on an elementary waveform representation
    • C. d'Alessandro (1990), "Time-frequency speech transformation based on an elementary waveform representation", Speech Communication, Vol. 9, Nos. 5/6, pp. 419-431.
    • (1990) Speech Communication , vol.9 , Issue.5-6 , pp. 419-431
    • D'Alessandro, C.1
  • 4
    • 30244571354 scopus 로고
    • Evaluation of periodic/aperiodic decomposition for analysis of aperiodicities in the voice source
    • Dourdan, France
    • C. d'Alessandro, V. Darsinos and B. Yegnanarayana (1995b), "Evaluation of periodic/aperiodic decomposition for analysis of aperiodicities in the voice source", Proc. ISMA'95 Internat. Symp. on Music. Acoust., Dourdan, France, pp. 446-452.
    • (1995) Proc. ISMA'95 Internat. Symp. on Music. Acoust. , pp. 446-452
    • D'Alessandro, C.1    Darsinos, V.2    Yegnanarayana, B.3
  • 6
  • 7
    • 0014857228 scopus 로고
    • Adaptive predictive coding of speech signals
    • B.S. Atal and M.R. Schoeder (1970), "Adaptive predictive coding of speech signals", Bell Syst. Tech. J., Vol. 49, pp. 1973-1986.
    • (1970) Bell Syst. Tech. J. , vol.49 , pp. 1973-1986
    • Atal, B.S.1    Schoeder, M.R.2
  • 8
    • 0020543345 scopus 로고
    • Instantaneous frequency and energy distribution of a signal
    • C. Berthomier (1983), "Instantaneous frequency and energy distribution of a signal", Signal Processing., Vol. 5, No. 1, pp. 31-45.
    • (1983) Signal Processing , vol.5 , Issue.1 , pp. 31-45
    • Berthomier, C.1
  • 9
    • 0026368814 scopus 로고
    • Speech coding using nonstationary sinusoidal modelling and narrow-band basis functions
    • Toronto, Canada
    • H. Carl and B. Kolpatzik (1991), "Speech coding using nonstationary sinusoidal modelling and narrow-band basis functions", Proc. IEEE-ICASSP'91 Internat. Conf. Acoust. Speech Signal Process., Toronto, Canada, pp. 581-584.
    • (1991) Proc. IEEE-ICASSP'91 Internat. Conf. Acoust. Speech Signal Process , pp. 581-584
    • Carl, H.1    Kolpatzik, B.2
  • 10
    • 4243132821 scopus 로고
    • Revised recommendation P.80 - SQEG, COM XII-118 E, Internat. Telegraph and Telephone Consultative Commitee (CCITT) from Recommendation P.80, Blue Book, 1989
    • CCITT (1992), Revised recommendation P.80 - "Methods for subjective determination of transmission quality", SQEG, COM XII-118 E, Internat. Telegraph and Telephone Consultative Commitee (CCITT) from Recommendation P.80, Blue Book, Vol. V, 1989.
    • (1992) Methods for Subjective Determination of Transmission Quality , vol.5
  • 11
  • 12
    • 0025786649 scopus 로고
    • Vocal quality factors: Analysis, synthesis and perception
    • D.G. Childers and C.K. Lee (1991), "Vocal quality factors: Analysis, synthesis and perception", J. Acoust. Soc. Amer., Vol. 90, No. 5, pp. 2394-2410.
    • (1991) J. Acoust. Soc. Amer. , vol.90 , Issue.5 , pp. 2394-2410
    • Childers, D.G.1    Lee, C.K.2
  • 13
    • 84955030626 scopus 로고
    • An experimental study of speech wave probability distributions
    • W.B. Davenport Jr. (1952), "An experimental study of speech wave probability distributions", J. Acoust. Soc. Amer., Vol. 24, No. 4, pp. 390-399.
    • (1952) J. Acoust. Soc. Amer. , vol.24 , Issue.4 , pp. 390-399
    • Davenport W.B., Jr.1
  • 15
    • 0027285715 scopus 로고
    • A cepstrum-based technique for determining a harmonics-to noise ratio in speech signals
    • G. De Krom (1993), "A cepstrum-based technique for determining a harmonics-to noise ratio in speech signals", J. Speech Hearing Res., Vol. 36, pp. 254-266.
    • (1993) J. Speech Hearing Res. , vol.36 , pp. 254-266
    • De Krom, G.1
  • 17
    • 0027839344 scopus 로고
    • MBR-PSOLA: Text-To-Speech synthesis based on an MBE re-synthesis of the segments database
    • T. Dutoit and H. Leich (1993). "MBR-PSOLA: Text-To-Speech synthesis based on an MBE re-synthesis of the segments database", Speech Communication, Vol. 13, Nos. 3-4, pp. 435-440.
    • (1993) Speech Communication , vol.13 , Issue.3-4 , pp. 435-440
    • Dutoit, T.1    Leich, H.2
  • 20
    • 0019047655 scopus 로고
    • Parametric coding of speech spectra
    • J.L. Flanagan (1980), "Parametric coding of speech spectra", J. Acoust. Soc. Amer., Vol. 68, pp. 412-419.
    • (1980) J. Acoust. Soc. Amer. , vol.68 , pp. 412-419
    • Flanagan, J.L.1
  • 22
    • 0001654096 scopus 로고
    • Analysis-by-synthesis/overlap-add sinusoidal modeling applied to the analysis and synthesis of musical tones
    • E.B. George and M.J.T. Smith (1992), "Analysis-by-synthesis/overlap-add sinusoidal modeling applied to the analysis and synthesis of musical tones", J. Audio Eng. Soc., Vol. 40, No. 6, pp 497-516.
    • (1992) J. Audio Eng. Soc. , vol.40 , Issue.6 , pp. 497-516
    • George, E.B.1    Smith, M.J.T.2
  • 23
    • 30244520792 scopus 로고
    • Computationally efficient methods of calculating instantaneous frequency for auditory analysis
    • Berlin
    • I.R. Grandsten and S.W. Beet (1993), "Computationally efficient methods of calculating instantaneous frequency for auditory analysis", Proc. Eurospeech'93 European Conf. on Speech Comm. and Tech., Berlin, pp. 385-389.
    • (1993) Proc. Eurospeech'93 European Conf. on Speech Comm. and Tech. , pp. 385-389
    • Grandsten, I.R.1    Beet, S.W.2
  • 25
    • 0026394314 scopus 로고
    • Synthesis of breathy vowels: Some research methods
    • D.J. Hermes (1991), "Synthesis of breathy vowels: Some research methods", Speech Communication, Vol. 10, Nos. 5-6, pp. 497-502.
    • (1991) Speech Communication , vol.10 , Issue.5-6 , pp. 497-502
    • Hermes, D.J.1
  • 26
    • 0023587829 scopus 로고
    • A methodological study of perturbation and additive noise in synthetically generated voice signals
    • J. Hillenbrand (1987), "A methodological study of perturbation and additive noise in synthetically generated voice signals", J. Speech Hearing Res., Vol. 30, pp. 448-461.
    • (1987) J. Speech Hearing Res. , vol.30 , pp. 448-461
    • Hillenbrand, J.1
  • 28
    • 0015699693 scopus 로고
    • The influence of the glottal waveform on the naturalness of speech from a parallel formant synthesizer
    • J.N. Holmes (1973), "The influence of the glottal waveform on the naturalness of speech from a parallel formant synthesizer", IEEE Trans. Audio Electroacoust., Vol. AU-21, pp. 298-305.
    • (1973) IEEE Trans. Audio Electroacoust. , vol.AU-21 , pp. 298-305
    • Holmes, J.N.1
  • 29
    • 0020905802 scopus 로고
    • Research report - Formant synthesizers: Cascade or parallel?
    • J.N. Holmes (1983), "Research report - Formant synthesizers: Cascade or parallel?" Speech Communication, Vol. 2, No. 4, pp. 251-273.
    • (1983) Speech Communication , vol.2 , Issue.4 , pp. 251-273
    • Holmes, J.N.1
  • 30
    • 0018986665 scopus 로고
    • Software for a cascade/parallel formant synthesizer
    • D.H. Klatt (1980), "Software for a cascade/parallel formant synthesizer", J. Acoust. Soc. Amer., Vol. 67, No. 3, pp. 971-995.
    • (1980) J. Acoust. Soc. Amer. , vol.67 , Issue.3 , pp. 971-995
    • Klatt, D.H.1
  • 31
    • 0025321354 scopus 로고
    • Analysis, synthesis.And perception of voice quality variations among female and male talkers
    • D.H. Klatt and L.C. Klatt (1990), "Analysis, synthesis.and perception of voice quality variations among female and male talkers", J. Acoust. Soc. Amer., Vol. 87, No. 2, pp. 820-857.
    • (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.2 , pp. 820-857
    • Klatt, D.H.1    Klatt, L.C.2
  • 32
    • 0023310877 scopus 로고
    • The measurement of the signal-to-noise ratio (SNR) in continuous speech
    • F. Klingholz (1987), "The measurement of the signal-to-noise ratio (SNR) in continuous speech", Speech Communication, Vol. 6, No. 1, pp. 15-26.
    • (1987) Speech Communication , vol.6 , Issue.1 , pp. 15-26
    • Klingholz, F.1
  • 38
    • 0011958880 scopus 로고
    • Complex representation of optical fields in coherence theory
    • L. Mandel (1967), "Complex representation of optical fields in coherence theory", J. Opt. Soc. Amer., Vol. 57, No. 5., pp. 613-617.
    • (1967) J. Opt. Soc. Amer. , vol.57 , Issue.5 , pp. 613-617
    • Mandel, L.1
  • 39
    • 0028445718 scopus 로고
    • Hybrid harmonic coding of speech at low bit rates
    • J.S. Marques and A.J. Abrantes (1994), "Hybrid harmonic coding of speech at low bit rates". Speech Comunication, Vol. 14, No. 3, pp. 231-247.
    • (1994) Speech Comunication , vol.14 , Issue.3 , pp. 231-247
    • Marques, J.S.1    Abrantes, A.J.2
  • 40
    • 0001944557 scopus 로고
    • Low-rate speech coding based on the sinusoidal model
    • S. Furui and M. M. Sondhi, Eds., Marcel Dekker, New York
    • R.J. McAulay and T.F. Quatieri (1992), "Low-rate speech coding based on the sinusoidal model", in: S. Furui and M. M. Sondhi, Eds., Advances in Speech Signal Processing (Marcel Dekker, New York), pp. 165-208.
    • (1992) Advances in Speech Signal Processing , pp. 165-208
    • McAulay, R.J.1    Quatieri, T.F.2
  • 41
    • 0025543906 scopus 로고
    • Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • E. Moulines and F. Charpentier (1990), "Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones", Speech Communication, Vol. 9, Nos. 5/6, pp. 453-467.
    • (1990) Speech Communication , vol.9 , Issue.5-6 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 42
    • 0015330634 scopus 로고
    • Minimum mean squared-error quantization in speech
    • M.D. Paez and T.H. Glisson (1972), "Minimum mean squared-error quantization in speech", IEEE Trans. Comm., Vol. Com-20, pp. 225-230.
    • (1972) IEEE Trans. Comm. , vol.COM-20 , pp. 225-230
    • Paez, M.D.1    Glisson, T.H.2
  • 45
    • 30244545527 scopus 로고
    • The analytical signal and related problem
    • G. Longo and B. Picinbono. Eds., Time and Frequency Representation of Signals and Systems, Springer, Wien
    • B. Picinbono (1989), "The analytical signal and related problem", in: G. Longo and B. Picinbono. Eds., Time and Frequency Representation of Signals and Systems, CISM Courses and Lectures No. 309 (Springer, Wien), pp. 1-9.
    • (1989) CISM Courses and Lectures No. 309 , vol.309 , pp. 1-9
    • Picinbono, B.1
  • 46
    • 0020747248 scopus 로고
    • Représentation des signaux par amplitude et phase instantanées
    • in French
    • B. Picinbono and W. Martin (1983), "Représentation des signaux par amplitude et phase instantanées", Ann. Télécommun., Vol. 38, No. 5-6, pp. 179-190 (in French).
    • (1983) Ann. Télécommun. , vol.38 , Issue.5-6 , pp. 179-190
    • Picinbono, B.1    Martin, W.2
  • 47
    • 0014272107 scopus 로고
    • Digital-formant synthesizer for speech-synthesis studies
    • L.R. Rabiner (1968), "Digital-formant synthesizer for speech-synthesis studies", J. Acoust. Soc. Amer., Vol. 49, No. 4, pp. 822-828.
    • (1968) J. Acoust. Soc. Amer. , vol.49 , Issue.4 , pp. 822-828
    • Rabiner, L.R.1
  • 48
    • 84939730902 scopus 로고    scopus 로고
    • Mathematical analysis of random noise
    • S.O. Rice (1944-1945), "Mathematical analysis of random noise", Bell Syst. Tech. J., Vol. 24, pp. 282-332, Vol. 25, pp. 46-156.
    • (1944) Bell Syst. Tech. J. , vol.24 , pp. 282-332
    • Rice, S.O.1
  • 49
    • 84939730902 scopus 로고    scopus 로고
    • S.O. Rice (1944-1945), "Mathematical analysis of random noise", Bell Syst. Tech. J., Vol. 24, pp. 282-332, Vol. 25, pp. 46-156.
    • Bell Syst. Tech. J. , vol.25 , pp. 46-156
  • 51
    • 30244436200 scopus 로고
    • Unvoiced speech analysis and synthesis using Poissonian random formant-wave-functions
    • Brussels, Belgium
    • G. Richard, C. d'Alessandro and S. Grau (1992), "Unvoiced speech analysis and synthesis using Poissonian random formant-wave-functions", Proc. EUSIPCO'92 European Sig. Process. Conf., Brussels, Belgium, pp. 347-350.
    • (1992) Proc. EUSIPCO'92 European Sig. Process. Conf. , pp. 347-350
    • Richard, G.1    D'Alessandro, C.2    Grau, S.3
  • 53
    • 0021499794 scopus 로고    scopus 로고
    • Time-domain formant-wave-function synthesis
    • J.C. Simon, Ed., Reidel, Dordrecht
    • X. Rodet (1980), "Time-domain formant-wave-function synthesis", in: J.C. Simon, Ed., Spoken Language Generation and Understanding (Reidel, Dordrecht). Reprinted in Computer Music J., Vol. 8, No. 3, pp. 9-14.
    • (1980) Spoken Language Generation and Understanding
    • Rodet, X.1
  • 54
    • 0021499794 scopus 로고    scopus 로고
    • X. Rodet (1980), "Time-domain formant-wave-function synthesis", in: J.C. Simon, Ed., Spoken Language Generation and Understanding (Reidel, Dordrecht). Reprinted in Computer Music J., Vol. 8, No. 3, pp. 9-14.
    • Computer Music J. , vol.8 , Issue.3 , pp. 9-14
  • 55
    • 85135065838 scopus 로고
    • Speech analysis and synthesis methods based on spectral envelopes and voiced/unvoiced functions
    • Edinburgh, UK
    • X. Rodet, P. Depalle and G. Poirot (1987), "Speech analysis and synthesis methods based on spectral envelopes and voiced/unvoiced functions", Proc. European Conf. on Speech Comm. and Tech., Edinburgh, UK.
    • (1987) Proc. European Conf. on Speech Comm. and Tech.
    • Rodet, X.1    Depalle, P.2    Poirot, G.3
  • 56
    • 0025544510 scopus 로고
    • Spectral modeling synthesis: A sound/synthesis system based on a deterministic plus stochastic decomposition
    • X. Serra and J. Smith (1990), "Spectral modeling synthesis: A sound/synthesis system based on a deterministic plus stochastic decomposition", Computer Music J., Vol. 14, No. 4.
    • (1990) Computer Music J. , vol.14 , Issue.4
    • Serra, X.1    Smith, J.2
  • 58
    • 0015142423 scopus 로고
    • Airflow and turbulence noise for fricative and stop consonants: Static considerations
    • K.N. Stevens (1971), "Airflow and turbulence noise for fricative and stop consonants: static considerations", J. Acoust. Soc. Amer., Vol. 50, No. 2, pp. 1180-1192.
    • (1971) J. Acoust. Soc. Amer. , vol.50 , Issue.2 , pp. 1180-1192
    • Stevens, K.N.1
  • 59
    • 84964182639 scopus 로고
    • Spectra of fricative noise in human speech
    • P. Strevens (1960), "Spectra of fricative noise in human speech", Language and Speech, Vol. 3. Reprinted in: Lehiste, Ed., Readings in Acoustic Phonetics (MIT press, Cambridge, MA, 1967), pp. 202-219.
    • (1960) Language and Speech , vol.3
    • Strevens, P.1
  • 60
    • 84964182639 scopus 로고
    • MIT press, Cambridge, MA
    • P. Strevens (1960), "Spectra of fricative noise in human speech", Language and Speech, Vol. 3. Reprinted in: Lehiste, Ed., Readings in Acoustic Phonetics (MIT press, Cambridge, MA, 1967), pp. 202-219.
    • (1967) Readings in Acoustic Phonetics , pp. 202-219
    • Lehiste1
  • 61
    • 30244538433 scopus 로고
    • Speech representation and analysis by the use of instantaneous frequency
    • M. Cooke, S. Beet and M. Crawford, Wiley, New York
    • A. Tsopanoglou, J. Mourjopoulos and G. Kokkinakis (1993), "Speech representation and analysis by the use of instantaneous frequency", in: M. Cooke, S. Beet and M. Crawford, Visual Representation of Speech Signals (Wiley, New York), pp. 341-346.
    • (1993) Visual Representation of Speech Signals , pp. 341-346
    • Tsopanoglou, A.1    Mourjopoulos, J.2    Kokkinakis, G.3
  • 63
    • 0020319209 scopus 로고
    • Harmonics-to-noise ratio as an index of the degree of hoarseness
    • E. Yumoto, W.J. Gould and T. Baer (1982), "Harmonics-to-noise ratio as an index of the degree of hoarseness", J. Acoust. Soc. Amer., Vol. 71, pp. 1544-1550.
    • (1982) J. Acoust. Soc. Amer. , vol.71 , pp. 1544-1550
    • Yumoto, E.1    Gould, W.J.2    Baer, T.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.