-
1
-
-
0023739214
-
Voice conversion through vector quantization
-
Abe, M., Nakanura, S., Shikano, K., Kuwabara, H., 1998. Voice conversion through vector quantization. In: Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing, pp. 655-658.
-
(1998)
Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing
, pp. 655-658
-
-
Abe, M.1
Nakanura, S.2
Shikano, K.3
Kuwabara, H.4
-
2
-
-
0033154052
-
Speaker transformation algorithm using segmental code books (STASC)
-
Arslan L.M. Speaker transformation algorithm using segmental code books (STASC). Speech Communication 28 (1999) 211-226
-
(1999)
Speech Communication
, vol.28
, pp. 211-226
-
-
Arslan, L.M.1
-
3
-
-
1942535983
-
Extraction of fixed dimension patterns from varying duration segments of consonant-vowel utterances
-
Chennai, India, pp
-
Gangashetty, S.V., Sekhar, C.C., Yegnanarayana, B., 2004. Extraction of fixed dimension patterns from varying duration segments of consonant-vowel utterances. In: Proceedings of IEEE International Conference on Intelligent Sensing and Information Processing, Chennai, India, pp. 159-164.
-
(2004)
Proceedings of IEEE International Conference on Intelligent Sensing and Information Processing
, pp. 159-164
-
-
Gangashetty, S.V.1
Sekhar, C.C.2
Yegnanarayana, B.3
-
5
-
-
0003962869
-
-
Macmillan Publishing Company, 866 Third Avenue, New York, USA
-
Hogg R.V., and Ledolter J. Engineering Statistics (1987), Macmillan Publishing Company, 866 Third Avenue, New York, USA
-
(1987)
Engineering Statistics
-
-
Hogg, R.V.1
Ledolter, J.2
-
7
-
-
4444285698
-
-
PhD Thesis, OGI School of Science and Engineering, Oregon Health and Science University, USA
-
Kain, A., 2001. High Resolution Voice Transformation. PhD Thesis, OGI School of Science and Engineering, Oregon Health and Science University, USA.
-
(2001)
High Resolution Voice Transformation
-
-
Kain, A.1
-
8
-
-
0034841948
-
Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction
-
Kain, A., Macon, M.W., 2001. Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction. In: Proceedings of IEEE International Conference of Acoustics, Speech, Signal Processing, vol. 2. pp. 813-816.
-
(2001)
Proceedings of IEEE International Conference of Acoustics, Speech, Signal Processing
, vol.2
, pp. 813-816
-
-
Kain, A.1
Macon, M.W.2
-
9
-
-
77950064323
-
Web-based listening test system for speech synthesis and speech conversion evaluation
-
Marrakech Morocco
-
Laurent Blin, O.B., Barreaud, V., 2008. Web-based listening test system for speech synthesis and speech conversion evaluation. In: Proceedings of LREC (Marrakech (Morocco)).
-
(2008)
Proceedings of LREC
-
-
Laurent Blin, O.B.1
Barreaud, V.2
-
11
-
-
0030365550
-
A new voice personality transformation based on both linear and nonlinear prediction analysis
-
Lee, K.S., Youn, D.H., Cha, I.W., 1996. A new voice personality transformation based on both linear and nonlinear prediction analysis. In: Proceedings of International Conference on Spoken Language Processing, pp. 1401-1404.
-
(1996)
Proceedings of International Conference on Spoken Language Processing
, pp. 1401-1404
-
-
Lee, K.S.1
Youn, D.H.2
Cha, I.W.3
-
12
-
-
77950024658
-
-
Xiao-dan Mei, Sheng-he Sun, 2000. An Efficient Method to Compute LSFs From LPC Coefficients, In: ICSP-2000, pp. 655-658.
-
Xiao-dan Mei, Sheng-he Sun, 2000. An Efficient Method to Compute LSFs From LPC Coefficients, In: ICSP-2000, pp. 655-658.
-
-
-
-
13
-
-
0000668614
-
Robustness of group-delay-based method for extraction of significant excitation from speech signals
-
Murthy P.S., and Yegnanarayana B. Robustness of group-delay-based method for extraction of significant excitation from speech signals. IEEE Transactions on Speech and Audio Processing 7 (1999) 609-619
-
(1999)
IEEE Transactions on Speech and Audio Processing
, vol.7
, pp. 609-619
-
-
Murthy, P.S.1
Yegnanarayana, B.2
-
15
-
-
0003513556
-
-
Prentice-Hall, Upper Saddle River, NJ
-
Oppenheim A.V., Schafer R.W., and Buck J.R. Discrete-time Signal Processing (1999), Prentice-Hall, Upper Saddle River, NJ
-
(1999)
Discrete-time Signal Processing
-
-
Oppenheim, A.V.1
Schafer, R.W.2
Buck, J.R.3
-
16
-
-
51449094434
-
Voice conversion with linear prediction residual estimation
-
Percybrooks, W.S., Moore II, E., 2008. Voice conversion with linear prediction residual estimation. In: Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing, pp. 4673-4676.
-
(2008)
Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing
, pp. 4673-4676
-
-
Percybrooks, W.S.1
Moore II, E.2
-
17
-
-
33745205178
-
-
PhD Thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India
-
Prasanna, S.R.M., 2004. Event-Based Analysis of Speech. PhD Thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India.
-
(2004)
Event-Based Analysis of Speech
-
-
Prasanna, S.R.M.1
-
18
-
-
0036288088
-
Detection of vowel onset point in speech
-
Orlando, Florida, USA
-
Prasanna, S.R.M., Zachariah, J.M., 2002. Detection of vowel onset point in speech. In: Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing, Orlando, Florida, USA.
-
(2002)
Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing
-
-
Prasanna, S.R.M.1
Zachariah, J.M.2
-
21
-
-
36048979629
-
Voice conversion by prosody and vocal tract modification
-
Bhubaneswar, Orissa, India
-
Rao, K.S., Yegnanarayana, B., 2006. Voice conversion by prosody and vocal tract modification. In: Ninth International Conference on Information Technology, Bhubaneswar, Orissa, India.
-
(2006)
Ninth International Conference on Information Technology
-
-
Rao, K.S.1
Yegnanarayana, B.2
-
22
-
-
33750713338
-
Modeling durations of syllables using neural networks
-
Rao K.S., and Yegnanarayana B. Modeling durations of syllables using neural networks. Computer Speech and Language 21 (2007) 282-295
-
(2007)
Computer Speech and Language
, vol.21
, pp. 282-295
-
-
Rao, K.S.1
Yegnanarayana, B.2
-
23
-
-
77950056334
-
Voice transformation by mapping the features at syllable level
-
Kolkata, India
-
Rao, K.S., Laskar, R.H., Koolagudi, S.G., 2007. Voice transformation by mapping the features at syllable level. In: Second International Conference on Pattern Recognition and Machine Intelligence (Premi-2007), Kolkata, India.
-
(2007)
Second International Conference on Pattern Recognition and Machine Intelligence (Premi-2007)
-
-
Rao, K.S.1
Laskar, R.H.2
Koolagudi, S.G.3
-
24
-
-
0029375490
-
Determination of instants of significant excitation in speech using group delay function
-
Smits R., and Yegnanarayana B. Determination of instants of significant excitation in speech using group delay function. IEEE Transactions on Speech and Audio Processing 3 (1995) 325-333
-
(1995)
IEEE Transactions on Speech and Audio Processing
, vol.3
, pp. 325-333
-
-
Smits, R.1
Yegnanarayana, B.2
-
28
-
-
33646767751
-
A study on residual prediction techniques for voice conversion
-
Sundermann, D., Bonafonte, A., Ney, H., 2005. A study on residual prediction techniques for voice conversion. In: Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing, pp. 13-16.
-
(2005)
Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing
, pp. 13-16
-
-
Sundermann, D.1
Bonafonte, A.2
Ney, H.3
-
29
-
-
77950036367
-
Tc-star: Evaluation plan for voice conversion technology
-
Munich, Germany
-
Sundermann, D., Bonafonte, A., Duxans, H., Hoege, H., 2005. Tc-star: evaluation plan for voice conversion technology. In: Proceedings of DAGA: 31st German Annual Conference on Acoustics, Munich, Germany.
-
(2005)
Proceedings of DAGA: 31st German Annual Conference on Acoustics
-
-
Sundermann, D.1
Bonafonte, A.2
Duxans, H.3
Hoege, H.4
-
30
-
-
0034842552
-
Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum
-
Toda, T., Saruwatari, H., Shikano, K., 2001. Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum. In: Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing, vol. 2. pp. 841-844.
-
(2001)
Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing
, vol.2
, pp. 841-844
-
-
Toda, T.1
Saruwatari, H.2
Shikano, K.3
-
31
-
-
77950029784
-
-
PhD Thesis, Institute for Graduate Studies in Science and Engineering, Bogaziti University, Berlin, Germany
-
Turk, O., 2007. Cross-lingual Voice Conversion. PhD Thesis, Institute for Graduate Studies in Science and Engineering, Bogaziti University, Berlin, Germany.
-
(2007)
Cross-lingual Voice Conversion
-
-
Turk, O.1
-
33
-
-
84863647359
-
Donor selection for voice conversion
-
Antalya, Turkey
-
Turk, O., Arslan, L.M., 2005. Donor selection for voice conversion. In: Proceedings of EUSIPCO, Antalya, Turkey.
-
(2005)
Proceedings of EUSIPCO
-
-
Turk, O.1
Arslan, L.M.2
-
34
-
-
33746653351
-
Robust processing techniques for voice conversion
-
Turk O., and Arslan L.M. Robust processing techniques for voice conversion. Computer Speech and Language 20 (2006) 441-467
-
(2006)
Computer Speech and Language
, vol.20
, pp. 441-467
-
-
Turk, O.1
Arslan, L.M.2
-
35
-
-
4544284652
-
High quality voice morphing
-
Ye, H., Young, S., 2004. High quality voice morphing. In: Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing, pp. 9-12.
-
(2004)
Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing
, pp. 9-12
-
-
Ye, H.1
Young, S.2
-
36
-
-
0035989168
-
AANN an alternative to GMM for pattern recognition
-
Yegnanarayana B., and Kishore S.P. AANN an alternative to GMM for pattern recognition. Neural Networks 15 (2002) 459-469
-
(2002)
Neural Networks
, vol.15
, pp. 459-469
-
-
Yegnanarayana, B.1
Kishore, S.P.2
-
38
-
-
0034856452
-
Source and system features for speaker recognition using AANN models
-
Salt Lake City, Utah, USA, pp
-
Yegnanarayana, B., Reddy, K.S., Kishore, S.P., 2001. Source and system features for speaker recognition using AANN models. In: Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing, Salt Lake City, Utah, USA, pp. 409-412.
-
(2001)
Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing
, pp. 409-412
-
-
Yegnanarayana, B.1
Reddy, K.S.2
Kishore, S.P.3
|